mirror of
https://github.com/rspeer/wordfreq.git
synced 2024-12-23 17:31:41 +00:00
a0893af82e
* Remove marks from more languages
* Add Korean tokenization, and include MeCab files in data
* add a Hebrew tokenization test
* fix terminology in docstrings about abjad scripts
* combine Japanese and Korean tokenization into the same function
Former-commit-id:
|
||
---|---|---|
.. | ||
test_chinese.py | ||
test_japanese.py | ||
test_korean.py | ||
test.py |