Erdos_2022_05_Audio_Project

Erdos Institute's May Data Science Boot Camp, 2022

Data source:

common-voice2 - Kaggle
https://www.kaggle.com/datasets/danielgraham1997/commonvoice2

OVERVIEW - LENA Foundation inspired audio project

Count spoken words in audio clips

Given an audio clip, we want to count the number of spoken words it contains. There are parameters used to split audio clips at "silences" which we optimize. We use machine learning models to find the features which effect the accuracy of our counter (for example, if our counter more accurate for females than males, we will account for that).

Later, we want to make a model which can associate word counts with people (e.g. 100 words by child, 300 words by mother, 500 words by teacher in a specific day).

Most helpful link:

Split audio files using silence detection - StackOverflow

>*https://stackoverflow.com/questions/45526996/split-audio-files-using-silence-detection*

Other links to consider:

Audio signal split at word level boundary - StackOverflow

*https://stackoverflow.com/questions/64153590/audio-signal-split-at-word-level-boundary*

Split speech audio file on words in python - StackOverflow

*https://stackoverflow.com/questions/36458214/split-speech-audio-file-on-words-in-python*

Using pyDub to chop up a long audio file - StackOverflow

*https://stackoverflow.com/questions/23730796/using-pydub-to-chop-up-a-long-audio-file*

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.ipynb_checkpoints		.ipynb_checkpoints
commonvoice		commonvoice
.gitattributes		.gitattributes
00_Optimized_Counting_Function.ipynb		00_Optimized_Counting_Function.ipynb
01_1_EDA_DongJoanne.ipynb		01_1_EDA_DongJoanne.ipynb
01_2_Augment_DF_MESSY.ipynb		01_2_Augment_DF_MESSY.ipynb
02_1_ML_Linear_Regression.ipynb		02_1_ML_Linear_Regression.ipynb
02_2_ML_Multiple_Linear_Regression.ipynb		02_2_ML_Multiple_Linear_Regression.ipynb
02_3_ML_Multiclass_Logistic_Regression.ipynb		02_3_ML_Multiclass_Logistic_Regression.ipynb
03_1_ValTest_of_ML_Models.ipynb		03_1_ValTest_of_ML_Models.ipynb
03_2_ValTest_of_MLR_Model.ipynb		03_2_ValTest_of_MLR_Model.ipynb
Imports_and_Functions.ipynb		Imports_and_Functions.ipynb
LICENSE		LICENSE
README.md		README.md
Summary.ipynb		Summary.ipynb
TE_MED_df.tsv		TE_MED_df.tsv
TR_MED_df.tsv		TR_MED_df.tsv
TR_df_DongJoanne.csv		TR_df_DongJoanne.csv
VA_MED_df.tsv		VA_MED_df.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Erdos_2022_05_Audio_Project

Data source:

OVERVIEW - LENA Foundation inspired audio project

Count spoken words in audio clips

Most helpful link:

Other links to consider:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Erdos_2022_05_Audio_Project

Data source:

OVERVIEW - LENA Foundation inspired audio project

Count spoken words in audio clips

Most helpful link:

Other links to consider:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages