Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Which mmlu validation setting is recommend? #714

Open
mathfinder opened this issue Aug 27, 2024 · 1 comment
Open

Which mmlu validation setting is recommend? #714

mathfinder opened this issue Aug 27, 2024 · 1 comment
Labels
type/question An issue that's a question

Comments

@mathfinder
Copy link

❓ The question

I found that you provide many mmlu test methods.
Take mmlu_stem as an example, including mmlu_stem_test, mmlu_stem, mmlu_stem_var, mmlu_stem_mc_5shot, mmlu_humanities_mc_5shot, mmlu_humanities_mc_5shot_test.
Which one is more recommended?

@mathfinder mathfinder added the type/question An issue that's a question label Aug 27, 2024
@aman-17
Copy link
Member

aman-17 commented Oct 22, 2024

For initial testing, it’s recommended to start with easier tasks like the 5-shot methods (e.g., mmlu_stem_mc_5shot or mmlu_humanities_mc_5shot_test). These are useful for evaluating the model’s ability to generalize with a few examples. However, for less capable models, it is not recommended to rely on multiple-choice (MC) tasks right away, as they may not perform well. The focus should be on simpler tasks to gauge the model’s baseline performance before moving to more complex evaluations like MC.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/question An issue that's a question
Projects
None yet
Development

No branches or pull requests

2 participants