-
Notifications
You must be signed in to change notification settings - Fork 218
Pull requests: bigcode-project/bigcode-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code
#260
opened Jul 22, 2024 by
q-rz
Loading…
fix: Multiple-E dataset fix go_test.go path for test execution
#225
opened Apr 20, 2024 by
hitesh-1997
Loading…
remove pad tokens added by the accelerator.pad_across_processes
#216
opened Apr 13, 2024 by
IQ17
Loading…
fix apps evaluate error: local variable 'level' referenced before assignment
#206
opened Mar 10, 2024 by
koking0
Loading…
Add support for Ollama, Palm, Claude-2, Cohere, Replicate, Llama2 CodeLlama (100+LLMs) [LiteLLM]
#160
opened Nov 9, 2023 by
ishaan-jaff
Loading…
Dockerfile-multiple no longer fetches pip dependencies needlessly
#157
opened Nov 1, 2023 by
RemcoSchrijver
Loading…
Adding additional optional args for decoding flags and AutoModel kwargs to support models like ReplitLM
#115
opened Jul 12, 2023 by
madhavatreplit
Loading…
Support Seq2SeqLM model class (to facilitate the CodeT5+ models)
#104
opened Jun 26, 2023 by
keyboardAnt
Loading…
Attempt to make MultiPl-E's evaluation parallelization over all completions at once rather than just over each problem.
#86
opened Jun 7, 2023 by
esslushy
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.