Commit Graph

24 Commits

Author SHA1 Message Date
Robyn Speer
313306f12e try to match the wordlist metanl actually uses
Former-commit-id: 90772e33fb
2013-10-31 15:13:22 -04:00
Robyn Speer
773f6b9843 The metanl scale is not what I thought it was.
Former-commit-id: 0d2fb21726
2013-10-31 14:38:01 -04:00
Robyn Speer
351378e318 Don't download the DB if the right version is already there
Former-commit-id: e931062b5a
2013-10-31 14:12:04 -04:00
Robyn Speer
16bc844841 try being really nonspecific about functools32 versions
Former-commit-id: c1564908f2
2013-10-31 14:06:06 -04:00
Robyn Speer
8690ac3f57 be less specific about the functools32 version
Former-commit-id: 2542cf9e35
2013-10-31 14:02:40 -04:00
Robyn Speer
9163a67a9f Add wordfreq_data files.
Now the build process is repeatable from scratch, even if something goes
wrong with the download server.


Former-commit-id: 26c0d7dd28
2013-10-31 13:39:02 -04:00
Robyn Speer
101e767ad9 When strings are inconsistent between py2 and 3, don't test them on py2. 2013-10-31 13:11:13 -04:00
Robyn Speer
52bcb99c48 add util.py, which provides standardize_word 2013-10-30 18:14:43 -04:00
Robyn Speer
5b31bd415f and of course this changes the metanl constant 2013-10-30 18:14:34 -04:00
Robyn Speer
4bda3e6b6f Turns out we need to change the metanl constant after normalizing words. 2013-10-30 16:58:10 -04:00
Robyn Speer
8f00846117 Normalize words when storing them or looking them up. 2013-10-30 14:59:57 -04:00
Robyn Speer
ea5de7cb2a Revise the build test to compare lengths of wordlists.
The test currently fails on Python 3, for some strange reason.
2013-10-30 13:22:56 -04:00
Lance
74cfb69f5a Another Py3 change, this one for functools32 2013-10-30 12:06:41 -04:00
Lance
de41143159 Py3 tweak to urllib import 2013-10-30 11:57:50 -04:00
Robyn Speer
68f7b25cf7 Change default values to offsets. 2013-10-29 18:06:47 -04:00
Robyn Speer
8a48e57749 now this package has tests 2013-10-29 17:21:55 -04:00
Robyn Speer
a95d88d1b9 Implement the data uploady downloady stuff in setup. 2013-10-29 16:44:13 -04:00
Robyn Speer
91a62dbee5 Deal with database connections more consistently 2013-10-29 16:43:58 -04:00
Robyn Speer
4fc1971b0f Add a couple of useful statistics about wordlists 2013-10-29 16:42:38 -04:00
Robyn Speer
67fefa5dd5 add query.iter_wordlist, to visit all words in a list 2013-10-29 12:44:16 -04:00
Robyn Speer
36344d3737 prepare to write custom commands in setup.py 2013-10-29 12:43:41 -04:00
Robyn Speer
c0ed89c015 revise config.py, clarify some of query.py 2013-10-29 12:18:38 -04:00
Robyn Speer
a92fed80cf better default parameters and better log messages in building 2013-10-29 12:04:17 -04:00
Robyn Speer
e8273e47a1 Initial version.
Noticeably missing: data files or any way to get them.
2013-10-28 19:26:44 -04:00