wordfreq/wordfreq
2015-09-22 16:54:39 -04:00
..
data load the Chinese character mapping from a .msgpack.gz file 2015-09-22 16:32:33 -04:00
__init__.py replace the literal 10 with the constant INFERRED_SPACE_FACTOR 2015-09-22 16:46:07 -04:00
chinese.py actually, still delay loading the Jieba tokenizer 2015-09-22 16:54:39 -04:00
japanese.py WIP: Traditional Chinese 2015-09-04 18:52:37 -04:00
tokens.py tokenize Chinese using jieba and our own frequencies 2015-09-05 03:16:56 -04:00