GitHub

Generating Sound

in this repository, FSDD (free spoken digits dataset) Audio Files are preprocessed using a preprocessing pipeline (see Audio Signal Processing for ML) to train a Varitoanl Auto Encoder Model to generate new audio that outputs the generated audio in /Audio directory.

Some Notes:

this repo is for demo only, so the quality of the output audio isn't the best
this repo initially was written without the intent of being published, so the code may be unorganized at some points, but it will be restructured later

References:

Generating Sound using neural network playlist on youtube by Valero Velardo.
Generative Deep Learning, 2nd Edition by David Foster, chapter 3 Variational Autoencoders

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
samples		samples
#1-Audio_pre_processing_for_generation.ipynb		#1-Audio_pre_processing_for_generation.ipynb
#2-train_vae.ipynb		#2-train_vae.ipynb
#3-generate_sound_script.ipynb		#3-generate_sound_script.ipynb
#3.1-sound_generator_class.ipynb		#3.1-sound_generator_class.ipynb
.gitignore		.gitignore
README.md		README.md
sound_generator_class.py		sound_generator_class.py
vae_class.py		vae_class.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generating Sound

About

Releases

Packages

Languages

ziadasem/generate_audio_using_vae

Folders and files

Latest commit

History

Repository files navigation

Generating Sound

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages