Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finetuning on specific datasets #10

Open
HashmatShadab opened this issue Sep 6, 2024 · 2 comments
Open

Finetuning on specific datasets #10

HashmatShadab opened this issue Sep 6, 2024 · 2 comments

Comments

@HashmatShadab
Copy link

Is there an option in the codebase to do the finetuning on only selected datasets mentioned in the readme?

@flyinglynx
Copy link
Collaborator

Yes, you can convert your dataset into LLaVA's format and update the data path in the script accordingly.

Essentially, you'll need to transform your annotations into a list of conversation data. For more details, please refer to the example annotation JSON file. If you run into any issues, feel free to reach out to us.

If you dataset is small, please consider using efficient tuning techniques like LoRA.

@HashmatShadab
Copy link
Author

Thank you for explaining. I was specifically talking about using specific datasets for finetuning that are mentioned in the readme. So for that i can just update the json file

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants