Skip to content

Commit

Permalink
Remove experiment instruction doc
Browse files Browse the repository at this point in the history
  • Loading branch information
dandansamax committed Jul 31, 2024
1 parent 562f604 commit 8cbea7a
Show file tree
Hide file tree
Showing 2 changed files with 0 additions and 3 deletions.
2 changes: 0 additions & 2 deletions crab-benchmark-v0/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,5 +29,3 @@ After setting up the environment, you can start the experiment. A brief overview
2. Start the CRAB server in the Ubuntu environment and get its IP address and port. Let's say they are `192.168.122.72` and `8000`.
3. Choose a task. As an example, we take the task with ID `a3476778-e512-40ca-b1c0-d7aab0c7f18b` from [handmade_tasks](./dataset/handmade_tasks.py). The task is: "Open the 'Tasks' app on Android, check the first incomplete task, then perform the task according to its description."
4. Run [main.py](./main.py) with the command `poetry run python -m crab-benchmark-v0.main --model gpt4o --policy single --remote-url http://192.168.122.72:8000 --task-id a3476778-e512-40ca-b1c0-d7aab0c7f18b`. In this command, `--model gpt4o` and `--policy single` determine the agent system, `--remote-url` specifies the Ubuntu environment interface, and `--task-id` indicates the task to be performed.

[Environment instructions](./docs/experiment_instruction.md) contains more details of the experiment.
1 change: 0 additions & 1 deletion crab-benchmark-v0/docs/experiment_instruction.md

This file was deleted.

0 comments on commit 8cbea7a

Please sign in to comment.