Skip to content

XiangLi1999/AutoBencher

Repository files navigation

AutoBencher

To quick start, you need to install the following dependencies:

pip insall -r requirements.txt 

Then, you can run the following command to start the benchmark to experiment with the kowledge intensive tasks:

```bash 
python run_script.py wiki
python run_script.py multilingual
python run_script.py math

Specifically, the above scripts run the following command:

python wiki_autobencher.py --exp_mode autobencher --test_taker_modelname gpt-4-turbo-preview  --use_helm no --agent_modelname gpt-4-turbo-preview --theme history --outfile_prefix1 KI/history. 

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages