-
On the AlpacaEval leaderboard, models like OpenChat-13b-V3.2, WizardLM-13B-V1.2, and Vicuna-13b-V1.5-16k have shown outstanding performance. Does OpenCompass have plans to support evaluation for these models? |
Beta Was this translation helpful? Give feedback.
Answered by
tonysy
Sep 1, 2023
Replies: 1 comment
-
Thanks for the suggestions, we have added these models into our evaluation plan, results will be updated in next one or two weeks. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
tonysy
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thanks for the suggestions, we have added these models into our evaluation plan, results will be updated in next one or two weeks.