Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GREAT 💟 #1

Open
manhph2211 opened this issue May 18, 2022 · 21 comments
Open

GREAT 💟 #1

manhph2211 opened this issue May 18, 2022 · 21 comments

Comments

@manhph2211
Copy link

Well, for me, this repo deserves thousands of stars 👍

@LegendRGC
Copy link

Well, for me, this repo deserves thousands of stars 👍

Is the model available,please?

@tuanh123789
Copy link
Owner

Yes i have pretrained model training on Vietnamese and English

@yiwei0730
Copy link

yiwei0730 commented May 31, 2022

@tuanh123789
Can I ask for the loss graph in the tensorboard?
I have some problem in phoneme_level_prediction, the loss is always high, when i running the code which i wrote.

@tuanh123789
Copy link
Owner

@yiwei0730 Yes, I will update loss graph in tensorboard soon. In my experiments with both English and Vietnamese phoneme_level_prediction loss is high, but it still working. When phoneme_level_prediction loss convergence, it's about 1.5

@yiwei0730
Copy link

yiwei0730 commented May 31, 2022

@tuanh123789 Thanks for your apply. Is your loss convergence says for the Validation data or the training data
For my experiments the training acoustic loss is in 3 and the validation acoustic loss is in 1.8, but maybe the difference with i did'nt use the masked part in the phoneme level loss

@yiwei0730
Copy link

There is an another problem with tour training, In AdaSpeech paper, the acoustic modeling part is list in phoneme level -> utterance level -> speaker embedding,but in your code you inheritance the list in "rishikksh github" using utterance level -> phoneme level -> speaker embedding,maybe this is a little bug in your code.

@LegendRGC
Copy link

Yes i have pretrained model training on Vietnamese and English

thank you!

@LegendRGC
Copy link

Yes i have pretrained model training on Vietnamese and English

Could you upload your current project include the "text grid folder" as a example,please?I can't run the "preprocess.py" because I don't know what I lack at present.thank you very much!

@LegendRGC
Copy link

about english

@tuanh123789
Copy link
Owner

@LegendRGC ok, i will update data sample soon

@LegendRGC
Copy link

@LegendRGC ok, i will update data sample soon
thank you very much,sir!

@LegendRGC
Copy link

Yes i have pretrained model training on Vietnamese and English

Excuse me,how to get the "checkpoint" file,please?

@LegendRGC
Copy link

Yes i have pretrained model training on Vietnamese and English

Excuse me,how to get the "checkpoint" file,please?

the "vocoder checkpoint"

@tuanh123789
Copy link
Owner

Yes i have pretrained model training on Vietnamese and English

Could you upload your current project include the "text grid folder" as a example,please?I can't run the "preprocess.py" because I don't know what I lack at present.thank you very much!

I uploaded data sample in preprocessed_data and raw_data folder, please check and try again ^^

@tuanh123789
Copy link
Owner

@tuanh123789 Can I ask for the loss graph in the tensorboard? I have some problem in phoneme_level_prediction, the loss is always high, when i running the code which i wrote.

I upload tensorboard for pretrain and finetune in Readme, please check ^^

@yiwei0730
Copy link

Yes, I check your tensorboard, I've seen something interesting.
the phoneme level loss validation have the same result in my training. but the training loss is not the same.
But I don't used the mask in this loss, since it make our acoustic loss more bigger.
And your acoustic loss maybe is the correct, since my acoustic loss is increasing in training but your acoustic loss is decreasing.

@tuanh123789
Copy link
Owner

Yes, I check your tensorboard, I've seen something interesting. the phoneme level loss validation have the same result in my training. but the training loss is not the same. But I don't used the mask in this loss, since it make our acoustic loss more bigger. And your acoustic loss maybe is the correct, since my acoustic loss is increasing in training but your acoustic loss is decreasing.

Yes, you can try masking to compute loss, hope it working ^^

@yiwei0730
Copy link

Thank you, I want to ask for the phoneme level loss, did you have some idea for decline the loss value?

@arampacha
Copy link

Yes i have pretrained model training on Vietnamese and English

Hi, thanks for your work!
Are there pretrained models available already? Could you point me to checkpoints pls?

@zaverichintan
Copy link

I started training (pretraining) using LibriTTS dataset.
I see the following behaviour:
Screenshot 2022-08-16 at 03 47 13

Should I change the batch size or any other parameters?

@vedantk-b
Copy link

Hey, if anyone has some pretrained weights, can they share a drive link or something to that?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants