mimic-tokenize My heuristic script for sentence tokenization of mimic notes Installation pip install -r requirements.txt python -m nltk.downloader punkt Usage python build_corpus.py small python heuristic-tokenize.py data/4.txt