Skip to content

A neural network featuring convolutional layers designed to classify eight different languages. Class activation maps are used to investigate similarities in activation patterns across and within languages.

License

Notifications You must be signed in to change notification settings

nick-monto/LangNet_CNN

Repository files navigation

LangNet_CNN

Hrayr Harutyunyan showed that CNNs are very good at identifying what language is being spoken give multiple languages.

This repository is for my exploration of CNNs in language recognition. I am currently using file from several Shtooka databases spanning eight languages. I am still searching for other freely available spoke word databases that contain recordings across many languages and multiple speakers within each language.

The purpose of this work is to examine the features that the CNN architecture deems as important for distinguishing different languages.

During the preprocessing, there are two other folders. One folder containing the Flac files from Shtooka and one folder containing the converted flac - wav files. I have not uploaded these as they each contain over 6000 files.

About

A neural network featuring convolutional layers designed to classify eight different languages. Class activation maps are used to investigate similarities in activation patterns across and within languages.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages