Skip to content

asr-lord/speech-to-text-voxforge

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-to-text-voxforge

Download the speech corpus

In order to download the speech corpus run

python downloader.py "voxforge-corpus"

You can additionally specify the amount of speaker directories to be downloaded using -n or the amount of threads to be used for the download using -w:

python downloader.py "voxforge-corpus" -n 20000 -w 15 -url http://www.repository.voxforge1.org/downloads/SpeechCorpus/Trunk/Audio/Main/8kHz_16bit/

Generate training data

If you want to generate a training data file for the speech recognition tool, run generator.py providing the path to the directory where the voxforge corpus was being downloaded and a path to the new file where the training data should be stored. The data will be stored as JSON.

python generator.py "voxforge-corpus" "training_data.json"

About

Downloader for the voxforge corpus

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%