Skip to content

Beating the GAIA benchmark with Transformers Agents. πŸš€

License

Notifications You must be signed in to change notification settings

aymeric-roucher/GAIA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

682987c Β· Feb 19, 2025

History

17 Commits
Jun 27, 2024
Jun 27, 2024
Jun 27, 2024
Jun 27, 2024
Sep 4, 2024
Oct 28, 2024
Jun 27, 2024
Feb 19, 2025
Sep 18, 2024
Oct 28, 2024
Sep 4, 2024
Sep 5, 2024
Oct 28, 2024
Oct 28, 2024

Repository files navigation

Beating GAIA with Transformers Agents πŸš€

This is the exact code used for our submission that scores #2 on the test set, #1 on the validation set.

GAIA leaderboard screenshot

Check out the current leaderboard here.

How to run tests?

First, install requirements:

pip install -r requirements.txt

Setup your secrets in a .envfile:

HUGGINGFACEHUB_API_TOKEN
SERPAPI_API_KEY
OPENAI_API_KEY
ANTHROPIC_API_KEY

And optionally if you want to use Anthropic models via AWS bedrock:

AWS_BEDROCK_ID
AWS_BEDROCK_KEY

Then run gaia_multiagent.py to launch tests!

Head to smolagents for a better version!

We've mnaged to increase score to 55% using the new framework smolagents! Head to this folder in smolagents to find a more recent run on GAIA, with better scores than this one!

About

Beating the GAIA benchmark with Transformers Agents. πŸš€

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published