Skip to content

Commit

Permalink
Merge branch 'main' into add-llm-medical-eval
Browse files Browse the repository at this point in the history
  • Loading branch information
yan-gao-GY authored Sep 9, 2024
2 parents 822cf4d + 9afe0f8 commit 2b72996
Show file tree
Hide file tree
Showing 3 changed files with 2 additions and 4 deletions.
2 changes: 0 additions & 2 deletions .github/workflows/e2e.yml
Original file line number Diff line number Diff line change
Expand Up @@ -146,8 +146,6 @@ jobs:
if: ${{ github.repository == 'adap/flower' && !github.event.pull_request.head.repo.fork && github.actor != 'dependabot[bot]' }}
run: |
python -m pip install https://${{ env.ARTIFACT_BUCKET }}/py/${{ needs.wheel.outputs.dir }}/${{ needs.wheel.outputs.short_sha }}/${{ needs.wheel.outputs.whl_path }}
- name: Install e2e components
run: pip install .
- name: Download dataset
if: ${{ matrix.dataset }}
run: python -c "${{ matrix.dataset }}"
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/flowertune-llm/evaluation/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ The default template generated by `flwr new` (see the [Project Creation Instruct

| | MBPP | HumanEval | MultiPL-E (JS) | MultiPL-E (C++) | Avg |
|:----------:|:-----:|:---------:|:--------------:|:---------------:|:-----:|
| Pass@1 (%) | 32.60 | 26.83 | 29.81 | 24.22 | 28.37 |
| Pass@1 (%) | 31.60 | 23.78 | 28.57 | 25.47 | 27.36 |


## Make submission on FlowerTune LLM Leaderboard
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/flowertune-llm/evaluation/general-nlp/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ huggingface-cli login
Download data from [FastChat](https://github.com/lm-sys/FastChat):

```shell
git clone --depth=1 https://github.com/lm-sys/FastChat.git && cd FastChat && git checkout d561f87b24de197e25e3ddf7e09af93ced8dfe36 && mv fastchat/llm_judge/data ../data && cd .. && rm -rf FastChat
git clone https://github.com/lm-sys/FastChat.git && cd FastChat && git checkout d561f87b24de197e25e3ddf7e09af93ced8dfe36 && mv fastchat/llm_judge/data ../data && cd .. && rm -rf FastChat
```


Expand Down

0 comments on commit 2b72996

Please sign in to comment.