A Node.js port of splitta, Dan
Gillick's statistical sentence boundary detector. It does not provide
all the functionality of splitta. It requires a small
patch
of svmlight to run. This patch
simply adds a binary called svm_classifyd, basically a classifier
which can run as a daemon and be communicated with over stdin/stdout,
to avoid having to spawn new svm_classify processes. See
Model.coffee for the details.
To install the patched svmlight using Homebrew:
$ brew tap contours/homebrew-alt
$ brew install svm_light
To install node-splitta:
$ git clone https://github.com/contours/node-splitta.git
$ cd node-splitta
$ npm install
$ npm test