mirror of
https://github.com/rspeer/wordfreq.git
synced 2024-12-23 09:21:37 +00:00
0a2bfb2710
* Remove marks from more languages
* Add Korean tokenization, and include MeCab files in data
* add a Hebrew tokenization test
* fix terminology in docstrings about abjad scripts
* combine Japanese and Korean tokenization into the same function
Former-commit-id: fec6eddcc3
9 lines
285 B
Plaintext
9 lines
285 B
Plaintext
recursive-include wordfreq/data *.gz
|
|
include README.md
|
|
recursive-include wordfreq/data *.txt
|
|
recursive-include wordfreq/data *.bin
|
|
recursive-include wordfreq/data *.def
|
|
recursive-include wordfreq/data *.dic
|
|
recursive-include wordfreq/data dicrc
|
|
recursive-include wordfreq/data COPYING
|