You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am interested in investigating and improving the sentence tokenizers part of WordTokenizers.jl. Would that be of interest to you if I work on a PR regarding this? Thanks!
The text was updated successfully, but these errors were encountered:
TheCedarPrince
changed the title
Interest Improving Sentence Tokenization
Interest in Improving Sentence Tokenization
Jan 18, 2021
I am interested in investigating and improving the sentence tokenizers part of WordTokenizers.jl. Would that be of interest to you if I work on a PR regarding this?
Sure. Contributions are welcome.
I am not familiar with how SpaCy handles sentence splitting. Maybe we could have something similar in this package as well.
Do you have any ideas on how you want to improve the sentence tokenizer? Could you also share some samples (if possible) from your Pdf which weren't working well with these tokenizers.
Hi @Ayushk4 - I was suggested by @oxinabox and @aviks to ping you.
I am interested in investigating and improving the sentence tokenizers part of WordTokenizers.jl. Would that be of interest to you if I work on a PR regarding this? Thanks!
The text was updated successfully, but these errors were encountered: