Zero-Shot Multi Speaker GradTTS

This repository provides audio samples of Zero-Shot Grad-TTS.

Audio Samples

The audio samples for Zero-Shot Grad-TTS can be found in link.

Comparison Dataset

For the comparison of synthesis performance for the Seen speaker, we randomly selected speakers from the LibriTTS dataset used in the learning and performed speech synthesis.

For the comparison of synthesis performance for Unseen Speakers, a total of 11 speakers were selected from the VCTK dataset and speech synthesis was performed. The 11 selected speakers are as follows.

VCTK: p225, p234, p245, p302

Comparison Model

For model comparison, we perform comparisons with the flow-based Zero-shot Multi Speaker voice synthesis models SC-GlowTTS and YourTTS.

Composite audio samples from SC-GlowTTS, YourTTS were used by downloading voice samples provided by the authors. You can download the authors' voice samples from the link below.

SC-GlowTTS Audio Sample.

YourTTS Audio Sample.

Acknowledgment

This work was partially supported by the Artificial Intelligence Industry Cluster Agency(AICA) grant funded by the Korea government(MSIT) (K-Digital Challenge : AI Startup Foundation Competition, 2023), and by the research fund from Chosun University, 2023.

License

These audio samples are MIT-licensed.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-Shot Multi Speaker GradTTS

Audio Samples

Comparison Dataset

Comparison Model

Acknowledgment

License

About

Releases

Packages

License

cjchun3616/zero_shot_gradtts

Folders and files

Latest commit

History

Repository files navigation

Zero-Shot Multi Speaker GradTTS

Audio Samples

Comparison Dataset

Comparison Model

Acknowledgment

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages