Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions Regarding the Tasks Used for Model Training #171

Open
carter54 opened this issue Jan 6, 2025 · 0 comments
Open

Questions Regarding the Tasks Used for Model Training #171

carter54 opened this issue Jan 6, 2025 · 0 comments

Comments

@carter54
Copy link

carter54 commented Jan 6, 2025

Hello,

Nice work guys. I have been reading your speech model and I have a few questions regarding the training setup.

From the code, it seems that during training, only translation, text continuation, and ASR tasks were used. Could you share the rationale behind selecting these tasks for training?

Besides the datasets uploaded to Hugging Face, were any other datasets used during training? If so, could you provide more details on them?

Thank you for your time, and I look forward to your insights!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant