fix duplication

Former-commit-id: d4d7b2f72e
This commit is contained in:
Rob Speer 2015-07-07 18:18:27 -04:00
parent 66d5d0002f
commit 20299b30b2

View File

@ -230,10 +230,9 @@ def word_frequency(word, lang, wordlist='combined', minimum=0.):
- Full text of Wikipedia
- A sample of 72 million tweets collected from Twitter in 2014,
divided roughly into languages using automatic language detection
- Google Books Syntactic Ngrams 2013
- Frequencies extracted from OpenSubtitles
- The Leeds Internet Corpus
- Google Books Ngrams and Google Books Syntactic Ngrams
- Google Books Syntactic Ngrams 2013
Another available wordlist is 'twitter', which uses only the data from
Twitter.