Environmental-Sound-Classification-ESC-using-neural-networks-and-other-classifiers

Audio feature extraction and classification with the ECS-10 data set audio dataset
ECS-10 audio data is included. It consists of 10 classes of different environmental sounds (sea waves, kids playing, etc.)
The main goal is to compare classification accuracies for the 6 tested classifiers.

Dependencies

Librosa (audio loading, audio visualization and feature extraction)
Sci-kit learn
Keras (Theano backend)
Numpy, Matplotlib
Pandas (data visualization)

Google Colab Notebook

A Google Colab Notebook (Python 3.7 Kernel) is added to illustrate the workflow.

The scripts for feature extraction and classification have been added as .ipynb files and are all loaded in the Jupyter Notebook sequentally.

Running feature_extraction.py creates a numpy array for features (feature.npy) and one for labels (label.npy). These files will be saved in the current directory.

Audio features extracted

MFCC
Chroma
Mel spectrogram
Tonal centroid feature
Spectral contrast

Classifiers implemented

Convolutional Neural Network (CNN)
Multilayer Perceptron (MLP)
Recurrent Neural Network (RNN)
Support Vector Machine (SVM)
Random Forest (RF)
Naive Bayes (NB)
KNearestNeighbors(KNN)

Accuracies obtained

Note: Direct comparison between classifiers can't be done yet since their parameters haven't been tuned to optimize accuracy yet. Out of 400 audio samples, the test set consisted on the 33% of this.

CNN: 78.125% (100 epochs)
MLP: 79.125 (100 epochs)
RNN: 72% (100 epochs)
SVM: 81.7%
RF: 83%
NB: 69.7%
KNN: 67%

Approaches to improve accuracy

Compute other features: MFCC + ZCR features improve classification accuracy for speech, noise and music labels. See if it also works for the 10 classes.
Tune optimization hyperparameters (for every classifier): Weight initialization, decaying learning rate.
Data scaling and feature normalization (MFCC)

Name	Name	Last commit message	Last commit date
Latest commit MarkMburu Created using Colaboratory Jul 30, 2020 4f9c3d7 · Jul 30, 2020 History 4 Commits
audio-data	audio-data	initial commit	Jul 20, 2020
.README.md.swp	.README.md.swp	added readme	Jul 20, 2020
Esc.ipynb	Esc.ipynb	initial commit	Jul 20, 2020
README.md	README.md	added readme	Jul 20, 2020
RecurrentNeuralNetwork.ipynb	RecurrentNeuralNetwork.ipynb	initial commit	Jul 20, 2020
conformal_predictor.ipynb	conformal_predictor.ipynb	Created using Colaboratory	Jul 30, 2020
convolutionNeuralNetwork.ipynb	convolutionNeuralNetwork.ipynb	initial commit	Jul 20, 2020
feat.npy	feat.npy	initial commit	Jul 20, 2020
feature_extraction.py	feature_extraction.py	initial commit	Jul 20, 2020
knn.ipynb	knn.ipynb	initial commit	Jul 20, 2020
label.npy	label.npy	initial commit	Jul 20, 2020
multilayer_perceptron.ipynb	multilayer_perceptron.ipynb	initial commit	Jul 20, 2020
thesis.pdf	thesis.pdf	added thesis	Jul 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Environmental-Sound-Classification-ESC-using-neural-networks-and-other-classifiers

Dependencies

Google Colab Notebook

Audio features extracted

Classifiers implemented

Accuracies obtained

Approaches to improve accuracy

About

Releases

Packages

Languages

MarkMburu/Environmental-Sound-Classification-ESC-using-neural-networks-and-other-classifiers

Folders and files

Latest commit

History

Repository files navigation

Environmental-Sound-Classification-ESC-using-neural-networks-and-other-classifiers

Dependencies

Google Colab Notebook

Audio features extracted

Classifiers implemented

Accuracies obtained

Approaches to improve accuracy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages