Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Possible way to avoide English accent while training on other languages #52

Open
andreibezborodov opened this issue Mar 13, 2023 · 3 comments

Comments

@andreibezborodov
Copy link

Hello!

Thanks for great work! I was trying to finetune a model using non English datasets (Russian, etc.). The resulting voice is really good, but I keep getting the result with super strong English accent even after long training. Are there any possible ways to reduce the accent (or ideally get rid of it)?
I guess that the problem is because of the fine-tuning process using English model..

@HobisPL
Copy link

HobisPL commented Mar 13, 2023

You can try this.
#51

@andreibezborodov
Copy link
Author

You can try this. #51

Thank you! I would also mention that for training on Cyrillic letters it is also required to change the english_cleaners to basic_cleaners. I've made a new tokenizer and started training, but the results so far are not good.

Can you please tell how big was your dataset and for how long did you train? I wonder how big should be a dasatet for fine-tuning on a new language.

@pivolan
Copy link

pivolan commented Oct 17, 2023

@andreibezborodov hi, can you help me with start finetuning on another languages? @cherpekat telegram. Cannot connect with you by email in your github profile.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants