Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问如何用 fairseq 训练 wenetspeech #25

Open
panpan-wu opened this issue Oct 20, 2022 · 2 comments
Open

请问如何用 fairseq 训练 wenetspeech #25

panpan-wu opened this issue Oct 20, 2022 · 2 comments

Comments

@panpan-wu
Copy link

大佬,能把 fairseq 训练 wenetspeech 的脚本、配置文件上传一下吗?我想复现这个过程,但是一些参数不知道应该设为多少,比如 kmeans 聚类时的 cluster 数量,dataset 的 max_tokens 等。

@pengchengguo
Copy link
Collaborator

你好,

除了 min_sample_size: 10000 以外剩下的所有配置和 librispeech 默认配置一样。

@GUOhm230
Copy link

请问大佬们,在训练fairseq中hubert的时候,dict.km.txt都是设置成1嘛?kmeans fit时nshard设置为多少?微调的时候dict.ltr.txt是怎么来的呢?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants