Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding WildBench #3150

Open
wants to merge 9 commits into
base: main
Choose a base branch
from
Open

Adding WildBench #3150

wants to merge 9 commits into from

Conversation

liamjxu
Copy link
Contributor

@liamjxu liamjxu commented Nov 12, 2024

Added WildBench scenario, adapter, run specs, annotator, and metrics.

TODO:

  • Add a customized adapter that applies chat template for model inference
  • Align with original repo on the prompt format for GPT-as-a-judge

Comment:

  • Currently created a new adapter ChatAdapter to use chat messages in the Request initialization, but it's most likely optimizable. Suggestions on this would be helpful and are very welcome.
  • Right now we only included WB score in the schema, we can also include WB reward.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant