🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Jun 15, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
A high-quality speech analysis, manipulation and synthesis system
General Speech Restoration
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Vietnamese Text to Speech library
PyTorch Implementation of FastDiff (IJCAI'22)
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
A fast, high-quality neural vocoder.
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
A vocoder framework which had been widely used in research community since 1999.
Fatcord's Alternative WaveRNN (Faster training)
Add a description, image, and links to the vocoder topic page so that developers can more easily learn about it.
To associate your repository with the vocoder topic, visit your repo's landing page and select "manage topics."