Mimic3 TTS models failing to load with INVALID_PROTOBUF error #58

Saisei2004 · 2024-06-02T17:01:55Z

I am encountering an issue with Mimic3 TTS models failing to load with the error "INVALID_PROTOBUF". Despite following the installation steps and ensuring the correct setup, the models are not loading correctly and the error persists.

Steps to reproduce the behavior:

Download and install the Mimic3 TTS models using the provided instructions.
Attempt to use the Mimic3 TTS models with the following command:

mimic3 --voice en_US/ljspeech_low "This is a test of the ljspeech voice model." > test_output_ljspeech.wav

Observe the error message: "Load model from /home/user/.local/share/mycroft/mimic3/voices/en_US/ljspeech_low/generator.onnx failed: Protobuf parsing failed."

The expected behavior is for the TTS model to load successfully and generate the audio output without errors.

Here are the relevant log files and error messages:

[ONNXRuntimeError] : 7 : INVALID_PROTOBUF : Load model from /home/user/.local/share/mycroft/mimic3/voices/en_US/ljspeech_low/generator.onnx failed: Protobuf parsing failed.

Environment:

Device type: Desktop
OS: Ubuntu 20.04
Mycroft-core version: N/A (using Mimic3 standalone)
Other versions: Mimic3 TTS version 0.2.5, Python 3.10

Additional context:
I have tried downloading and using multiple voice models (en_US/ljspeech_low, en_US/cmu-arctic_low, en_US/vctk_low) and encountered the same error with each. I followed the installation and usage instructions carefully but have not been able to resolve the issue. Any guidance or suggestions to resolve this problem would be greatly appreciated.

Thank you for your assistance.

The text was updated successfully, but these errors were encountered:

Takuchan · 2024-06-09T14:58:40Z

I tried cloning the mimic3 repository from GitHub and running the Dockerfile, but I encountered the same error.

synesthesiam · 2024-06-09T15:08:57Z

Inspect the contents of the onnx file. It is probably error text from a failed download due to GitHub.
You may need manually download voices from Huggingface: https://huggingface.co/mukowaty/mimic3-voices

Xeraster · 2024-07-09T03:34:11Z

Where do you put the downloaded files? Mimic3 doesn't run off the of the files cloned from the repository if installed based on the installation instructions which suggests the downloaded voice files need to be put somewhere special.

note to self: this is the least broken one ive tried so far (im going try every ai tts on github until i find one that actually works)

magiausde · 2024-07-24T23:59:09Z

Since I also ran into this issue today, I had a look at the problem. It is indeed a problem with downloading the generator.onnx file. That file is so large in size that it gets stored in Github's LFS (https://docs.github.com/repositories/working-with-files/managing-large-files/about-git-large-file-storage). This has the side-effect that a direct fetch via the URL

mimic3/mimic3_tts/const.py

Line 23 in be72c18

"https://github.com/MycroftAI/mimic3-voices/raw/master/voices/{lang}/{name}"

does not return the file-contents, but rather a so called pointer file. So currently after downloading, inside the filesystem generator.onnx contains for example:

version https://git-lfs.github.com/spec/v1
oid sha256:3330372429b25fe3a38b10bbe914862a49b2cd0a58da332bbe30fa123035a067
size 76340831

It requires multiple steps to get such an LFS-file, I found this very useful gist: https://gist.github.com/fkraeutli/66fa741d9a8c2a6a238a01d17ed0edc5

So, to fix this, it would be necessary to add this functionality to the downloader. In the meantime one can download the file manually for each voice.

magiausde · 2024-07-25T00:00:04Z

Where do you put the downloaded files? Mimic3 doesn't run off the of the files cloned from the repository if installed based on the installation instructions which suggests the downloaded voice files need to be put somewhere special.

note to self: this is the least broken one ive tried so far (im going try every ai tts on github until i find one that actually works)

The voices are stored at /home/mimic3/.local/share/mycroft/mimic3/voices/

mephi42 · 2024-08-22T17:59:31Z

I get the following when trying to fetch voices from LFS: batch response: This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access.

Saisei2004 added the bug Something isn't working label Jun 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mimic3 TTS models failing to load with INVALID_PROTOBUF error #58

Mimic3 TTS models failing to load with INVALID_PROTOBUF error #58

Saisei2004 commented Jun 2, 2024

Takuchan commented Jun 9, 2024

synesthesiam commented Jun 9, 2024

Xeraster commented Jul 9, 2024 •

edited

Loading

magiausde commented Jul 24, 2024

magiausde commented Jul 25, 2024

mephi42 commented Aug 22, 2024

Mimic3 TTS models failing to load with INVALID_PROTOBUF error #58

Mimic3 TTS models failing to load with INVALID_PROTOBUF error #58

Comments

Saisei2004 commented Jun 2, 2024

Takuchan commented Jun 9, 2024

synesthesiam commented Jun 9, 2024

Xeraster commented Jul 9, 2024 • edited Loading

magiausde commented Jul 24, 2024

magiausde commented Jul 25, 2024

mephi42 commented Aug 22, 2024

Xeraster commented Jul 9, 2024 •

edited

Loading