wordfreq/wordfreq
2015-09-24 12:50:06 -04:00
..
data load the Chinese character mapping from a .msgpack.gz file 2015-09-22 16:32:33 -04:00
__init__.py don't apply the inferred-space penalty to Japanese 2015-09-24 12:50:06 -04:00
chinese.py actually, still delay loading the Jieba tokenizer 2015-09-22 16:54:39 -04:00
japanese.py WIP: Traditional Chinese 2015-09-04 18:52:37 -04:00
tokens.py tokenize Chinese using jieba and our own frequencies 2015-09-05 03:16:56 -04:00