Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

zero-shot voice not similar to the target #197

Open
genadik opened this issue Sep 30, 2024 · 1 comment
Open

zero-shot voice not similar to the target #197

genadik opened this issue Sep 30, 2024 · 1 comment

Comments

@genadik
Copy link

genadik commented Sep 30, 2024

Hi all,
I create zero-shot synthetic speech and it is not even close to the reference speaker voice (sometimes different gender)
I use tts.synthesise function to pass the name of the reference speaker file and the produces audio different for different reference speakers never similar to the target.
Any idea what can be wrong?

Just for reference - I use more than 1 min of audio from Multilingual LibriSpeech database (english part)

@genadik
Copy link
Author

genadik commented Sep 30, 2024

I found the problem - the path to the reference speaker audio should be absolute. In my case the code was not able to find the reference and without any warning used random speaker voice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant