-
Notifications
You must be signed in to change notification settings - Fork 42
TTS generates only noise for new languages #18
Copy link
Copy link
Open
Description
I’m fine-tuning MiraTTS on indian language, and although training completes successfully, inference produces only random/noisy audio instead of intelligible speech. Prompts, codec, context, and semantic tokens are verified correct, so the issue seems to be the model failing to learn proper text → semantic alignment for this language/script. What are the recommended approaches to fix this, e.g., phoneme-based training, tokenizer modifications, or text preprocessing, and what common pitfalls could cause noise-only outputs despite successful training?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels