|
1 |
| -# DS-Agent |
2 |
| - |
3 |
| -This is the official implementation of our work "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning". [[arXiv Version]](https://arxiv.org/abs/2402.17453) [[Download Benchmark (Google Drive)]](https://drive.google.com/file/d/1zfgZFQplmTmwS6L8016Tda73zExW_93D/view?usp=sharing) |
4 |
| - |
5 |
| -## Cite |
6 |
| - |
7 |
| -Please consider citing our paper if you find this work useful: |
8 |
| - |
9 |
| -``` |
10 |
| -@article{DS-Agent, |
11 |
| - title={DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning}, |
12 |
| - author={Guo, Siyuan and Deng, Cheng and Wen, Ying and Chen, Hechang and Chang, Yi and Wang, Jun}, |
13 |
| - journal={arXiv preprint arXiv:2402.17453}, |
14 |
| - year={2024} |
15 |
| -} |
16 |
| -``` |
| 1 | +# DS-Agent |
| 2 | + |
| 3 | +This is the official implementation of our work "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning". [[arXiv Version]](https://arxiv.org/abs/2402.17453) [[Download Benchmark(Google Drive)]](https://drive.google.com/file/d/1xUd1nvCsMLfe-mv9NBBHOAtuYnSMgBGx/view?usp=sharing) |
| 4 | + |
| 5 | + |
| 6 | + |
| 7 | +## Benchmark |
| 8 | + |
| 9 | +We select 30 representative data science tasks covering three data modalities and two fundamental ML task types. Please download the datasets and corresponding configuration files via [[Google Drive]](https://drive.google.com/file/d/1xUd1nvCsMLfe-mv9NBBHOAtuYnSMgBGx/view?usp=sharing) here and unzip them to the directory of "development/benchamarks". |
| 10 | + |
| 11 | + |
| 12 | + |
| 13 | +## Setup |
| 14 | + |
| 15 | +This project is built on top of the framework of MLAgentBench. First, install MLAgentBench package with: |
| 16 | + |
| 17 | +```shell |
| 18 | +cd development |
| 19 | +pip install -e. |
| 20 | +``` |
| 21 | + |
| 22 | +Then, please install neccessary libraries in the requirements. |
| 23 | + |
| 24 | +```shell |
| 25 | +pip install -r requirements.txt |
| 26 | +``` |
| 27 | + |
| 28 | +Since DS-Agent mainly utilizes GPT-3.5 and GPT-4 for all the experiments, please fill in the openai key in development/MLAgentBench/LLM.py and deployment/generate.py |
| 29 | + |
| 30 | +## Development Stage |
| 31 | + |
| 32 | +Run DS-Agent for development tasks with the following command: |
| 33 | + |
| 34 | +```shell |
| 35 | +cd development/MLAgentBench |
| 36 | +python runner.py --task feedbackv2 --llm-name gpt-3.5-turbo-16k --edit-script-llm-name gpt-3.5-turbo-16k |
| 37 | +``` |
| 38 | + |
| 39 | +During execution, logs and intermediate solution files will be saved in logs/ and workspace/. |
| 40 | + |
| 41 | +## Deployment Stage |
| 42 | + |
| 43 | +Run DS-Agent for deployment tasks with the provided command: |
| 44 | + |
| 45 | +```shell |
| 46 | +cd deployment |
| 47 | +bash code_generation.sh |
| 48 | +bash code_evaluation.sh |
| 49 | +``` |
| 50 | + |
| 51 | +For open-sourced LLM, i.e., mixtral-8x7b-Instruct-v0.1 in this paper, we utilize the vllm framework. First, enable the LLMs serverd with |
| 52 | + |
| 53 | +```shell |
| 54 | +cd deployment |
| 55 | +bash start_api.sh |
| 56 | +``` |
| 57 | + |
| 58 | +Then, run the script shell and replace the configuration --llm by mixtral. |
| 59 | + |
| 60 | +## Cite |
| 61 | + |
| 62 | +Please consider citing our paper if you find this work useful: |
| 63 | + |
| 64 | +``` |
| 65 | +@article{DS-Agent, |
| 66 | + title={DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning}, |
| 67 | + author={Guo, Siyuan and Deng, Cheng and Wen, Ying and Chen, Hechang and Chang, Yi and Wang, Jun}, |
| 68 | + journal={arXiv preprint arXiv:2402.17453}, |
| 69 | + year={2024} |
| 70 | +} |
| 71 | +``` |
0 commit comments