Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench #16

Kosuke-Yamada · 2024-05-20T21:24:04Z

Thank you for maintaining the benchmarks. Currently, I am evaluating models with Japanese MT-Bench, and I need the prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” to execute the function 'make_judge_simgle' in gen_judgement.py, but I can't find them. There seems to be only the prompts “single-v1” and “single-math-v1” in fastchat/llm_judge/data/judge_ja_prompts.jsonl. I would appreciate it if you could tell me where to find them or how to evaluate them.

shyram · 2024-06-18T08:14:17Z

There is no evaluation result for the second turn in model_judge file as well.

Where are the evaluation results for the second turn?

Kosuke-Yamada changed the title ~~Missing “single-v1-multi-turn” and “single-math-v1-multi-turn” prompts in Japanese MT-Bench~~ Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench #16

Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench #16

Kosuke-Yamada commented May 20, 2024

shyram commented Jun 18, 2024

Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench #16

Missing prompts “single-v1-multi-turn” and “single-math-v1-multi-turn” in Japanese MT-Bench #16

Comments

Kosuke-Yamada commented May 20, 2024

shyram commented Jun 18, 2024