English-Phoneme-ngrams

The repository lets you make phoneme level ngram model of English. To use it, run build_PDF.py, and then pass the output of that into the input of process_PFD.py. The first script will download two text files, combine them, and save the result. The second script will build an ngram model out of that.

The two text files are a pronunciation dictionary and a frequecny dictionary. Combining these gives a pronunciation-frequency dictionary (PFD). From that we can make an ngram model. Here is some example output:

wʌtɚmʌðɚ, sɛlʌm, paʊntɚnmʌnd, brɪkjʌŋ, sʌbʌt, dɪskul

And here is how I would probably spell those words:

Whatermother, Selum, Pounternmund, Brikyung, Suhbut, Diskul

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitattributes		.gitattributes
README.md		README.md
build__PFD.py		build__PFD.py
process_PFD.py		process_PFD.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English-Phoneme-ngrams

About

Releases

Packages

Languages

DavidK0/English-Phoneme-ngrams

Folders and files

Latest commit

History

Repository files navigation

English-Phoneme-ngrams

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages