[Refactor]: Refactor the evaluation directory #5222

openhands-agent · 2024-11-23T13:10:27Z

What problem or use case are you trying to solve?

Right now in the evaluation directory, the directory structure is very flat, and it is hard to tell which subdirectories are utilities related to implementing benchmarks or doing basic tests for openhands (utils, integration_tests, regression, static), and which are actual benchmarks from the ML literature (everything else).

To make this more clear, we can move all benchmarks to live under the evaluation/benchmarks/ directory. In addition, all other files that have to do with evaluation (including documentation, github workflows, etc.) will need to be checked and changed to maintain consistency.

While we do this, we can also add some of the benchmarks that are missing from the evaluation/README.md documentation.

The text was updated successfully, but these errors were encountered:

neubig · 2024-11-23T13:11:22Z

Sorry, to clarify, this was me accidentally logged in as the openhands-agent account...

github-actions · 2024-11-23T13:12:43Z

OpenHands started fixing the issue! You can monitor the progress here.

github-actions · 2024-11-23T13:16:00Z

A potential fix has been generated and a draft PR #5223 has been created. Please review the changes.

…ation directory" This reverts commit 4136c53.

Co-authored-by: Engel Nyst <[email protected]>

openhands-agent added enhancement New feature or request fix-me Attempt to fix this issue with OpenHands and removed fix-me Attempt to fix this issue with OpenHands labels Nov 23, 2024

neubig added fix-me Attempt to fix this issue with OpenHands and removed enhancement New feature or request labels Nov 23, 2024

All-Hands-AI deleted a comment from github-actions bot Nov 23, 2024

openhands-agent mentioned this issue Nov 23, 2024

Fix issue #5222: [Refactor]: Refactor the evaluation directory #5223

Merged

enyst added a commit that referenced this issue Nov 23, 2024

Revert "Fix pr #5223: Fix issue #5222: [Refactor]: Refactor the evalu…

1eb6d07

…ation directory" This reverts commit 4136c53.

neubig pushed a commit that referenced this issue Nov 25, 2024

Fix issue #5222: [Refactor]: Refactor the evaluation directory (#5223)

678436d

Co-authored-by: Engel Nyst <[email protected]>

neubig closed this as completed in #5223 Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor]: Refactor the evaluation directory #5222

[Refactor]: Refactor the evaluation directory #5222

openhands-agent commented Nov 23, 2024

neubig commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

[Refactor]: Refactor the evaluation directory #5222

[Refactor]: Refactor the evaluation directory #5222

Comments

openhands-agent commented Nov 23, 2024

neubig commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

github-actions bot commented Nov 23, 2024