On the Efficacy of Self-reflection for Improving LLM Agent Planning

This repository contains the code for our paper: "On the Efficacy of Self-reflection for Improving LLM Agent Planning".

Navigation

The root of the repository contains the following sub-repositories:

SEA/: Contains the code for our self-relection framework, SEA (Sample-Evaluate-Aggregate).
ToolTalk/: Contains our version of the ToolTalk benchmark repository.
ToolSandbox/: Contains our version of the ToolSandbox benchmark repository.

Experiments

The ToolTalk/ and ToolSandbox/ sub-repositories contain the code necessary to run our experiments on each benchmark, with instructions provided in the top the respective READMEs. Note that the different environments will likely be required to run each benchmark - after creating a new python environment, you will have to: 1) follow the benchmark installation instructions described in its README, and 2) locally install the SEA repository as a package by running pip install . within the SEA/ subrepository. The respective experimental results for each benchmark can be found in the results folder in the root of each sub-repository.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
SEA		SEA
ToolSandbox		ToolSandbox
ToolTalk		ToolTalk
CODEOWNERS.md		CODEOWNERS.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

On the Efficacy of Self-reflection for Improving LLM Agent Planning

Navigation

Experiments

Citation

About

Releases

Packages

Contributors 3

Languages

cohere-ai/self-reflection-planning-paper

Folders and files

Latest commit

History

Repository files navigation

On the Efficacy of Self-reflection for Improving LLM Agent Planning

Navigation

Experiments

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages