AutoBencher

To quick start, you need to install the following dependencies:

pip insall -r requirements.txt

Then, you can run the following command to start the benchmark to experiment with the kowledge intensive tasks:

```bash 
python run_script.py wiki
python run_script.py multilingual
python run_script.py math

Specifically, the above scripts run the following command:

python wiki_autobencher.py --exp_mode autobencher --test_taker_modelname gpt-4-turbo-preview  --use_helm no --agent_modelname gpt-4-turbo-preview --theme history --outfile_prefix1 KI/history.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
math_autobencher.py		math_autobencher.py
multilingual_autobencher.py		multilingual_autobencher.py
requirements.txt		requirements.txt
run_scripts.py		run_scripts.py
tool_util.py		tool_util.py
util.py		util.py
wiki_autobencher.py		wiki_autobencher.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoBencher

About

Releases

Packages

Languages

XiangLi1999/AutoBencher

Folders and files

Latest commit

History

Repository files navigation

AutoBencher

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages