-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
task: Ichigo LLM v0.5 Training #122
Comments
Lora for pretraining ref: https://unsloth.ai/blog/contpretraining |
@hahuyhoang411 - Can we combine this into #116 |
Pretrainning phase of ichigo v0.5Methodology
Hyperparams
Results
Learnings
Quicklinks
|
Instruction tuning phase of Ichigo v0.5.Methodology
Hyperparams
Results
Learnings
Quicklinks
|
Data Quality Issue Resolution and Pipeline UpdateIssueText-to-semantic model undertrained on English, causing noise in small fraction of output data. Analysis
Resolution
Quick link |
Can you add an example of affected data @bachvudinh |
Instruction tuning phase of Ichigo v0.5(Second Attempt)Methodology
Hyperparams
Results
Learnings
Quicklinks
|
Training Issues and Budget Request ReportWe are currently training a large language model using 8x H100 GPUs via RunPod. The initial budget estimation was $800 for a 32-hour training run. Technical Issues Encountered
Solution
Financial Impact
Additional Bugdet Request
|
Issues reportWe are facing some issues on ichigo v0.5, in short, it is really bad: Issues:
Root causes:
Mitigation plans:
|
Goal
Experiment on Ichigo model on multilingual.
Tasklist
The text was updated successfully, but these errors were encountered: