This project focuses specifically on using a probabilistic language model combined with techniques like the n-gram model and edit distance, particularly for the English language. The bigram language model and Damerau-Levenshtein distance algorithm are employed to develop a spelling checker. This system uses edit distance and word probabilities from a medical and Covid-19 related corpus. A user-friendly Graphical User Interface (GUI) is created with Tkinter. While the system shows promising results, it has limitations such as handling contextual ambiguity, homophones, and plural words. It is also limited to the English language and does not address misspellings in other languages.
-
Notifications
You must be signed in to change notification settings - Fork 0
A probabilistic language model combined with techniques like the n-gram model and edit distance, particularly for the English language.
License
yanshenp/nlp-spell-checker
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A probabilistic language model combined with techniques like the n-gram model and edit distance, particularly for the English language.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published