Fix incorrect annotations for Omni-MATH and WildBench for empty outputs #7929
Set up job
1s
1s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run actions/checkout@v4
1s
1s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Set up Python 3.12
14s
14s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run python3 -m pip install --upgrade build
6s
6s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run python3 -m build
7s
7s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run python3 -m pip install dist/crfm_helm-*.whl
6m 58s
6m 58s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run helm-run --run-entries simple1:model=simple/model1 --max-eval-instances 10 --suite test
5s
5s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run helm-summarize --suite test
5s
5s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Run helm-server --help
0s
0s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Post Set up Python 3.12
0s
0s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Post Run actions/checkout@v4
0s
0s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Complete job
0s
0s
Error:
This step has been truncated due to its large size. Download the full logs from the menu
once the workflow run has completed.
Loading