api/
contains the main "pipeline" command line toolcorpora/
contains scripts for reading corpora into predictable data structures from various source formats.emr/
contains scripts for running jobs on Elastic Map Reducelib/
holds miscellaneous helpers or basic text manipulation tools.ml/
contains all the machine learning and natural language processing tools.models/
holds the database schema and relationship definitions.ui/
contains the web application.__init__.py
contains extensive log configuration.
After installing, tweedr
can be used as a Python package:
import tweedr
print tweedr.__version__