Access a database of word frequencies, in various natural languages.
Go to file
Lance Nathan d577c9e9c9 Merge pull request #6 from LuminosoInsight/ftfy4
Clean data with ftfy v4

Former-commit-id: 4632ffb177
2015-05-06 17:32:45 -04:00
tests A different plan for the top-level word_frequency function. 2014-02-24 18:03:31 -05:00
wordfreq Merge pull request #6 from LuminosoInsight/ftfy4 2015-05-06 17:32:45 -04:00
wordfreq_data v0.7: make a proper Dutch 'surfaces' list 2015-04-30 13:01:24 -04:00
.gitignore Add wordfreq_data files. 2013-10-31 13:39:02 -04:00
MIT-LICENSE Add license text for the whole package 2014-06-02 16:37:32 -04:00
README.txt Add license text for the whole package 2014-06-02 16:37:32 -04:00
setup.py require ftfy 4 2015-05-05 12:04:13 -04:00

Tools for working with word frequencies from various corpora.

Author: Rob Speer

## License

`wordfreq` is freely redistributable under the MIT license. See
`MIT-LICENSE.txt`.

It contains data extracted from Google Books Ngrams
(http://books.google.com/ngrams). The terms of use of this data are:

    Ngram Viewer graphs and data may be freely used for any purpose, although
    acknowledgement of Google Books Ngram Viewer as the source, and inclusion
    of a link to http://books.google.com/ngrams, would be appreciated.

It also contains data from the University of Leeds Centre for Translation
Studies (see http://corpus.leeds.ac.uk/list.html), which is freely
redistributable under a Creative Commons Attribution license.