Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to pretrain the model from scratch? #216

Open
veelion opened this issue Jan 14, 2025 · 1 comment
Open

How to pretrain the model from scratch? #216

veelion opened this issue Jan 14, 2025 · 1 comment

Comments

@veelion
Copy link

veelion commented Jan 14, 2025

Hi @rajatsen91

Thanks for sharing the finetuning code, I have finetuned the 2.0 model with my own data and get the better result.

I want to train the model from scratch, will you have the plan to release the train code from scratch?

Thanks!

@Mhdaw
Copy link

Mhdaw commented Jan 22, 2025

I think sharing the dataset used to training is more important than the code, And I don't know if they plan to share the dataset, If they do pre-training from scratch is probably like fine-tuning(without checkpoint loading and some other parts).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants