Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

使用Qwen-7B使用Qlora时报错在阿里的PAI-DSW pending This problem is yet to be addressed
#5516 opened Sep 23, 2024 by lzy728
1 task done
默认的optimizer是什么?如何添加自己的optimizer如SGD? pending This problem is yet to be addressed
#5514 opened Sep 23, 2024 by DSW2001
1 task done
How to train the mm_proj and the LLM part with lora of Qwen2-VL pending This problem is yet to be addressed
#5512 opened Sep 22, 2024 by leoozy
1 task done
Deepseek v2.5的 template 变了,与 v2不同 pending This problem is yet to be addressed
#5506 opened Sep 21, 2024 by piamo
1 task done
pretrain from scratch 输出都是数字 pending This problem is yet to be addressed
#5504 opened Sep 21, 2024 by UbeCc
1 task done
在checkpoint上继续训练,没有保存训练后的checkpint pending This problem is yet to be addressed
#5499 opened Sep 20, 2024 by cuisws
1 task done
save_only_model后无法续训 pending This problem is yet to be addressed
#5497 opened Sep 20, 2024 by yuepengs
1 task done
Can you support Jamba 1.5 model and Mamba family models, mamba2-hybrid, ssm model, etc pls? pending This problem is yet to be addressed
#5496 opened Sep 20, 2024 by badrabbitt
1 task done
用视频数据微调qwen2-vl-7b的算力要求是什么? pending This problem is yet to be addressed
#5493 opened Sep 20, 2024 by J0eky
1 task done
昇腾910B npu8卡训练显存不足 npu This problem is related to NPU devices pending This problem is yet to be addressed
#5491 opened Sep 20, 2024 by LtroiNGU
1 task done
Is LLAVA chat template correct? pending This problem is yet to be addressed
#5489 opened Sep 20, 2024 by mibejjh
1 task done
Running on machines with limited number of online programs pending This problem is yet to be addressed
#5488 opened Sep 19, 2024 by moshushi007ow
1 task done
启动 webui失败 pending This problem is yet to be addressed
#5485 opened Sep 19, 2024 by ClementeGao
1 task done
请问DPO训练的时候有什么注意事项吗?我训练出来效果很差。 pending This problem is yet to be addressed
#5484 opened Sep 19, 2024 by zlh-source
1 task done
sft do_predict, 生成的json 文件 的 label 都是空 pending This problem is yet to be addressed
#5465 opened Sep 18, 2024 by dayuyang1999
1 task done
qwen2_vl模型训练异常 pending This problem is yet to be addressed
#5462 opened Sep 18, 2024 by will-wiki
AttributeError: 'Qwen2Attention' object has no attribute 'max_position_embeddings' pending This problem is yet to be addressed
#5461 opened Sep 17, 2024 by chengchengpei
1 task done
Tips for implementing LlaMa-Factory for new Hardwares pending This problem is yet to be addressed
#5460 opened Sep 17, 2024 by EtashGuha
no such a file or directory of data pending This problem is yet to be addressed
#5457 opened Sep 17, 2024 by Esmail-ibraheem
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.