Calculates word frequencies in ACTib segmented corpus
make download
to download the whole corpus.
frequencies.py
populates output/
with:
- a folder per collection containing a frequency file per volume in the collection
- one file per collection that adds up frequencies of all files in a given collection
total_freqs.txt
which contains the general frequencies for the whole of ACTib.