Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Who can share the pre-trained model which is the AISHELL3 #18

Open
Dyongh613 opened this issue Mar 29, 2022 · 4 comments
Open

Who can share the pre-trained model which is the AISHELL3 #18

Dyongh613 opened this issue Mar 29, 2022 · 4 comments

Comments

@Dyongh613
Copy link

Who can share the pre-trained model which is the AISHELL3

@keonlee9420
Copy link
Owner

Great suggestion @qw1260497397 ! I'm not familiar with Chinese, so I hope someone can apply the AISHELL3 dataset and share the results for the community.

@Dyongh613
Copy link
Author

After training 5000 times with aishell3, an error is reported.
File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 222, in forward
duration_w_rounded, src_w_len, mel_mask))
File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 140, in add_position_enc
pos_enc = coef.unsqueeze(-1) * pos_enc
RuntimeError: The size of tensor a (1298) must match the size of tensor b (1001) at non-singleton dimension 1

@keonlee9420
Copy link
Owner

I see. I think you have to update max_seq_len in model.yaml so that it has the value greater than that of preprocessed_data/AISHELL3/stats.json. For example, it was 870 in LJSpeech so I set max_seq_len as 1000.

@Dyongh613
Copy link
Author

Dyongh613 commented Apr 3, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants