Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What validation dataset did you use during the training process? #215

Closed
jpan72 opened this issue Jul 31, 2024 · 2 comments
Closed

What validation dataset did you use during the training process? #215

jpan72 opened this issue Jul 31, 2024 · 2 comments

Comments

@jpan72
Copy link

jpan72 commented Jul 31, 2024

Hello authors,

What validation dataset did you use to during the training epochs of stage 1, 2, and 3, respectively? I believe validation accuracy is important to monitor model convergence and avoid issues like over-fitting.

Thank you!

@Andy1621
Copy link
Collaborator

Andy1621 commented Aug 1, 2024

Good question! It's hard to evaluate the performance directly.

For stage1, it works like BLIP2 stage1. You can use retrieval tasks to verify it.
For stage2, it only use video cpation or image caption, and it's hard to follow instructions. Thus we just verify it by some selected examples and check whether the output video/image captions are reasonable.
For stage3, we use MVBench.

@yinanhe
Copy link
Member

yinanhe commented Oct 11, 2024

Hi, we will close this issue.

Feel free to contact us if you have other questions.

@yinanhe yinanhe closed this as completed Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants