GitHub - roddar92/linguistics_problems: Natural language processing in examples and games

Computational linguistics

Welcome to the main page of my project! This repository stores examples of linguistics problems.

My name is Daria, I'm a software engineer with skills in natural language processing. My general scientific interests are knowledge bases and facts extraction. There are very important analysis tools that provides semantic analysis and text mining.

Project has next sections:

Pre-morphology
Phonology
Morphology
Knowledge engineering
N-grams applications
Games

In the source code three languages is supported now: English, Russian and Finnish. I hope that very soon next publishing problems will implement NLP-algorithms for more languages.

Source code:

Pre-morphology

Phonology

Morphology

Syntax

Syntax analyzer for simple sentences

Knowledge engineering

N-grams applications

N-gram dictionary (for spelling/for language modeling)
Simple English word filler
N-gram language model
Collocations
Russian diminutive names generator with RNN
Russian character RNN (non-smoothing)
Russian joking language model (PI Day)
Simple spell-checker (based on n-grams and Damerau-Levenstein distance)
Advanced spell-checker based on:
- dictionary of words from good texts with 2-3-gram index;
- train language model with 2-grams on good texts;
- retrieval candidates with Damerau-Levenstein distance;
- find candidate with max probability of bigram max{ P(prev_word, candidate), candidate in candidates}

Name		Name	Last commit message	Last commit date
Latest commit History 729 Commits
.idea		.idea
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computational linguistics

Pre-morphology

Phonology

Morphology

Syntax

Knowledge engineering

N-grams applications

Games

About

Releases

Packages

Contributors 2

Languages

roddar92/linguistics_problems

Folders and files

Latest commit

History

Repository files navigation

Computational linguistics

Pre-morphology

Phonology

Morphology

Syntax

Knowledge engineering

N-grams applications

Games

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages