Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValueError: Unknown model class ModelArguments(model_name_or_path='lmms-lab/LongVA-7B' #20

Open
Wiselnn570 opened this issue Aug 24, 2024 · 3 comments

Comments

@Wiselnn570
Copy link

I got an error when running dpo.sh. I noticed that the script's entry point is llava/train/train_dpo.py. However, when I changed it to long/train/train_dpo.py, the bug occurred.

@kcz358
Copy link
Contributor

kcz358 commented Aug 24, 2024

Hi, I have fixed this issue. Can you try again? We forgot to update the get_model with our current base_model name in the dpo training script

@Wiselnn570
Copy link
Author

Hi, I have fixed this issue. Can you try again? We forgot to update the get_model with our current base_model name in the dpo training script

Thank you, it’s working fine now. But if I need to add some video data during fine-tuning maybe to give the model new capabilities and train it together with the existing image data, how should I modify the fine-tuning script parameters? Does the current model support this type of training, or have you previously tried adding video data during fine-tuning? Thank you for your further explanation. I have already tried it, and I’m impressed by the model’s capabilities.

@kcz358
Copy link
Contributor

kcz358 commented Sep 4, 2024

I think you can prepare the json file just like the image, just change the image to video and add video_path in the args

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants