Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Voice / Speaker keeps changing mid sentence #423

Open
lowl7 opened this issue Apr 29, 2023 · 6 comments
Open

Voice / Speaker keeps changing mid sentence #423

lowl7 opened this issue Apr 29, 2023 · 6 comments

Comments

@lowl7
Copy link

lowl7 commented Apr 29, 2023

No matter what I do, I don't even have any comma or period but the voice still changes completely to a different speaker mid sentence.

Something like "Hello there are you having a great day today on this most joyfull day" will have two speakers with completely different tone and pitch.

Tortoise is pretty cool, but it's basically useless when I have to sit through the generation only for it to generate something like that with two voices.

Is there any solution?

@G-force78
Copy link

I had the same , but it didnt do it a few weeks ago so Im not sure what has changed.

@CodexOmega
Copy link

How many audio clips are you using? I had the same issue when only using 3 or 4 clips. I'm using 62 10 second clips now and it sounds pretty good.

@n8bot
Copy link
Contributor

n8bot commented Apr 30, 2023

Yeah, some voices are better or worse than others. I have made a pull request to add more voices: #425

Each voice has dozens of audio clips.

Try them out yourself and tell me how you like them. They are from https://dillonbecker.itch.io/sdap

@lowl7
Copy link
Author

lowl7 commented May 10, 2023

How many audio clips are you using? I had the same issue when only using 3 or 4 clips. I'm using 62 10 second clips now and it sounds pretty good.

I've tried as low as 2 clips and as high as 30 clips, no luck... voice changes every now and then.

@Florencehinder
Copy link

I've had the same problem using one of the fine tuned voiced train_kennard.

I'm generally struggling to keep the voice consistent one sentence against another, it seems to change substantially. Any ways to reduce this?

@G-force78
Copy link

I've had the same problem using one of the fine tuned voiced train_kennard.

I'm generally struggling to keep the voice consistent one sentence against another, it seems to change substantially. Any ways to reduce this?

i think this is just one of the limitations of the model/code as it seems to start iterating again with new metrics in a new sentence, reading from a text file seems better with stop tokens but still there is a change, or no change at all making it a monotone parrotting of words.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants