Robyn Speer
|
56e811be19
|
Fix Dutch lists
- Use surface forms consistently, not stems
- Count all instances of words on Wikipedia, not one per article
Former-commit-id: 3507d8b630
|
2015-03-12 16:00:03 -04:00 |
|
Robyn Speer
|
ca944e54aa
|
new Dutch data, bump version to 0.6
Former-commit-id: 377336bcdc
|
2015-03-03 15:54:45 -05:00 |
|
Robyn Speer
|
ad22387a53
|
add surface forms from Twitter 2014 data
Former-commit-id: ffdaa82b11
|
2015-02-17 15:06:11 -05:00 |
|
Robyn Speer
|
f4280dcad0
|
add twitter-stems-2014 wordlist data
Former-commit-id: 6ab72201cd
|
2015-02-11 13:29:32 -05:00 |
|
Robyn Speer
|
313306f12e
|
try to match the wordlist metanl actually uses
Former-commit-id: 90772e33fb
|
2013-10-31 15:13:22 -04:00 |
|
Robyn Speer
|
9163a67a9f
|
Add wordfreq_data files.
Now the build process is repeatable from scratch, even if something goes
wrong with the download server.
Former-commit-id: 26c0d7dd28
|
2013-10-31 13:39:02 -04:00 |
|