-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor IOB #64
base: master
Are you sure you want to change the base?
Refactor IOB #64
Conversation
a later PR will add crfsuite base on the newly python-crfsuite(https://github.com/tpeng/python-crfsuite)
It is better to put models somewhere else, and notebooks were broken.
add base classifier and global ngrams feature functions
1. rename DEFAULT_TAGSET to EXAMPLE_TAGSET; 2. rename DEFAULT_FEATURES to EXAMPLE_TOKEN_FEATURES; 3. make token_features empty by default in create_wapiti_pipeline.
except model_filename must be kwargs now. Also, this fixes the example from the tutorial.
…v/webstruct into speed_up_text_tokenizer
Speed up text tokenizer
Wapiti for webstruct
Ignore all non-text tags
Non-recursive implementation of algorithm
fix boolean bug
update travis to run different python versions
Codecov Report
@@ Coverage Diff @@
## master #64 +/- ##
==========================================
+ Coverage 81.02% 81.08% +0.06%
==========================================
Files 40 40
Lines 2092 2104 +12
==========================================
+ Hits 1695 1706 +11
- Misses 397 398 +1 |
c, tg = self.sequence_encoder.split(c) | ||
chains.append((c, tree, is_tail)) | ||
tags.append(tg) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
with this implementation I don't think it would make sense to create BILOU as a class as it would be just a function that translates IOB tags. I would add here a check for Bilou flag that if True would call the bilou_translator function from sequence_encoding.py
No description provided.