Skip to content

Commit 23628e5

Browse files
authored
Add files via upload
1 parent 51ad2c1 commit 23628e5

File tree

3 files changed

+71
-16
lines changed

3 files changed

+71
-16
lines changed

README.md

+71-16
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,71 @@
1-
# DS-Agent
2-
3-
This is the official implementation of our work "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning". [[arXiv Version]](https://arxiv.org/abs/2402.17453) [[Download Benchmark (Google Drive)]](https://drive.google.com/file/d/1zfgZFQplmTmwS6L8016Tda73zExW_93D/view?usp=sharing)
4-
5-
## Cite
6-
7-
Please consider citing our paper if you find this work useful:
8-
9-
```
10-
@article{DS-Agent,
11-
title={DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning},
12-
author={Guo, Siyuan and Deng, Cheng and Wen, Ying and Chen, Hechang and Chang, Yi and Wang, Jun},
13-
journal={arXiv preprint arXiv:2402.17453},
14-
year={2024}
15-
}
16-
```
1+
# DS-Agent
2+
3+
This is the official implementation of our work "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning". [[arXiv Version]](https://arxiv.org/abs/2402.17453) [[Download Benchmark(Google Drive)]](https://drive.google.com/file/d/1xUd1nvCsMLfe-mv9NBBHOAtuYnSMgBGx/view?usp=sharing)
4+
5+
![overview.png](.\figures\overview.png)
6+
7+
## Benchmark
8+
9+
We select 30 representative data science tasks covering three data modalities and two fundamental ML task types. Please download the datasets and corresponding configuration files via [[Google Drive]](https://drive.google.com/file/d/1xUd1nvCsMLfe-mv9NBBHOAtuYnSMgBGx/view?usp=sharing) here and unzip them to the directory of "development/benchamarks".
10+
11+
![overview.png](.\figures\task.png)
12+
13+
## Setup
14+
15+
This project is built on top of the framework of MLAgentBench. First, install MLAgentBench package with:
16+
17+
```shell
18+
cd development
19+
pip install -e.
20+
```
21+
22+
Then, please install neccessary libraries in the requirements.
23+
24+
```shell
25+
pip install -r requirements.txt
26+
```
27+
28+
Since DS-Agent mainly utilizes GPT-3.5 and GPT-4 for all the experiments, please fill in the openai key in development/MLAgentBench/LLM.py and deployment/generate.py
29+
30+
## Development Stage
31+
32+
Run DS-Agent for development tasks with the following command:
33+
34+
```shell
35+
cd development/MLAgentBench
36+
python runner.py --task feedbackv2 --llm-name gpt-3.5-turbo-16k --edit-script-llm-name gpt-3.5-turbo-16k
37+
```
38+
39+
During execution, logs and intermediate solution files will be saved in logs/ and workspace/.
40+
41+
## Deployment Stage
42+
43+
Run DS-Agent for deployment tasks with the provided command:
44+
45+
```shell
46+
cd deployment
47+
bash code_generation.sh
48+
bash code_evaluation.sh
49+
```
50+
51+
For open-sourced LLM, i.e., mixtral-8x7b-Instruct-v0.1 in this paper, we utilize the vllm framework. First, enable the LLMs serverd with
52+
53+
```shell
54+
cd deployment
55+
bash start_api.sh
56+
```
57+
58+
Then, run the script shell and replace the configuration --llm by mixtral.
59+
60+
## Cite
61+
62+
Please consider citing our paper if you find this work useful:
63+
64+
```
65+
@article{DS-Agent,
66+
title={DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning},
67+
author={Guo, Siyuan and Deng, Cheng and Wen, Ying and Chen, Hechang and Chang, Yi and Wang, Jun},
68+
journal={arXiv preprint arXiv:2402.17453},
69+
year={2024}
70+
}
71+
```

figures/overview.png

438 KB
Loading

figures/task.png

438 KB
Loading

0 commit comments

Comments
 (0)