bigcode-project / bigcode-evaluation-harness Public

Notifications You must be signed in to change notification settings
Fork 244
Star 948

Code
Issues 59
Pull requests 35
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: bigcode-project/bigcode-evaluation-harness

Labels 9 Milestones 6

New pull request New

35 Open 119 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix UnboundLocalError in APPS task evaluation

#310 opened Apr 20, 2025 by sad-mathematician

Loading…

Updated bigcode-evaluation-harness/leaderboard/README.md

#305 opened Mar 13, 2025 by zoya-hammad

Loading…

Support for remote inference

#302 opened Feb 17, 2025 by pawelknes

Loading…

use tokenizer.chat_template by default for instruction type tasks

#301 opened Jan 25, 2025 by TK-21st

Loading…

Missing comma in MultiPL-E languages

#299 opened Jan 13, 2025 by hrshtv

Loading…

[Pytest] Fix bad import to use relative instead @ module_test

#298 opened Jan 11, 2025 by ggcr

Loading…

Fix the bugs in the ds1000 sample bash script; Fix typos

#295 opened Dec 10, 2024 by gameofby

Loading…

Update multiple.py

#292 opened Dec 8, 2024 by ahmedashrafy

Loading…

Support multiple datasets from MBPP; Fix missing commas in python list; Fix doc typos;

#291 opened Dec 4, 2024 by gameofby

Loading…

add support for hpu devices

#281 opened Oct 25, 2024 by envsp

Loading…

"," missing in LANGUAGES list

#280 opened Oct 21, 2024 by ArtemisDicoTiar

Loading…

Speedup execute.py: Reuse same manager and dict in

#277 opened Oct 1, 2024 by michaelfeil

Loading…

Basecodes

#263 opened Aug 14, 2024 by Abhineetsoccer

Loading…

Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code

#260 opened Jul 22, 2024 by q-rz

Loading…

Fix Max New Tokens in HF's Generation Config

#257 opened Jul 18, 2024 by mostafaelhoushi

Loading…

Fix unnecessary repeated overwrite

#249 opened Jun 29, 2024 by nielstron

Loading…

fix: Multiple-E dataset fix go_test.go path for test execution

#225 opened Apr 20, 2024 by hitesh-1997

Loading…

Add llama3 instruction prompts

#222 opened Apr 19, 2024 by TechxGenus

Loading…

Leaderboard README improvements

#217 opened Apr 14, 2024 by nikita1503

Loading…

remove pad tokens added by the accelerator.pad_across_processes

#216 opened Apr 13, 2024 by IQ17

Loading…

Ensure generations get saved in generation_only mode

#212 opened Mar 31, 2024 by Vipitis

Loading…

fix apps evaluate error: local variable 'level' referenced before assignment

#206 opened Mar 10, 2024 by koking0

Loading…

Update README.md

#204 opened Mar 2, 2024 by AnitaLiu98

Loading…

Fix loading PAL-GSM few-shot examples

#196 opened Feb 8, 2024 by sxjscience

Loading…

Fix typo in README.md

#177 opened Jan 2, 2024 by ab-10

Loading…

Previous 1 2 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!