We can put the cutoff back now

I took it out when a step in the English SUBTLEX process was outputting
frequencies instead of counts, but I've fixed that now.
This commit is contained in:
Rob Speer 2015-09-04 16:16:52 -04:00
parent 56318a3ca3
commit 5c7a7ea83e

View File

@ -280,7 +280,7 @@ def combine_lists(languages):
output_file = wordlist_filename('combined', language)
add_dep(lines, 'merge', input_files, output_file,
extra='wordfreq_builder/word_counts.py',
params={'cutoff': 0})
params={'cutoff': 2})
output_cBpack = wordlist_filename(
'combined-dist', language, 'msgpack.gz')