Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about ARC datasets #71

Open
Zoeyyao27 opened this issue Dec 7, 2023 · 0 comments
Open

Questions about ARC datasets #71

Zoeyyao27 opened this issue Dec 7, 2023 · 0 comments

Comments

@Zoeyyao27
Copy link

  • When reproducting, I found Llama2-7b can not output the answer in the desired format and hence failed when using exact match. How do you generate the answer in the desired format? Do you add any prompt or just use the past_key_values to do that?
  • In section 4.3, you said "We first concatenate all question-answer pairs from the ARC-[Challenge, Easy] datasets, feed the
    continuous stream to Llama-2-[7,13,70]B-Chat models, and assess model completions at each answer position using an exact match criterion" Do you do it by input:[q1] ->output: [a1] -> past_key_value[q1+a1] +input:[q2] -> output:[a2]->... ?
  • Can you provide the evaluation script for ARC dataset?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant