feat: prompting o1 #1844

kl2806 · 2024-10-08T17:25:31Z

Please describe the purpose of this pull request.
Is it to add a new feature? Is it to fix a bug?
Create a new agent type O1Agent, that uses prompting to get the agent to "think" more at inference time before responding by calling a think function repeatedly before the final answer function.

How to test
How can we test your PR during review? What commands should we run? What outcomes should we expect?
Added a simple test in test_o1_agent that creates an O1 agent and asks it to compare which two numbers are larger. With got-4o-mini, it improves accuracy from 0->80%. The test just checks that 3 steps to respond.

…ent.inner_step() ie the single step version

…StepResponse

kl2806 · 2024-10-15T20:27:25Z

Closing as it is subsumed by #1891

Kevin Lin added 2 commits October 7, 2024 16:43

prompting o1

f2ac415

add test

e45c6b2

sarahwooders self-requested a review October 8, 2024 18:29

Kevin Lin and others added 18 commits October 8, 2024 11:38

fix formatting

576496d

Merge branch 'main' into o1

fb738b4

add o1 agent type

256ee15

update test with agent type

a44d4bd

pass in llm and embedding configs

c6875bb

clean up checks in server.py

cd7b7fd

clean up saving

dcd8190

update server.py to check agent types

a0b9110

format

a921071

decrease max think steps and add reasoning persona

dbf3d0d

add o1 persona

67e49e0

fix: fix issue with agent state (letta-ai#1859)

fbb7fce

fix: patch interface bug

f122bf2

pass in user message only on first turn

f76a149

check if request agent type is None

8c900b7

clean up

046f53c

tune prompt

489fff7

test

62299d1

kl2806 requested a review from cpacker October 10, 2024 22:16

Kevin Lin and others added 8 commits October 11, 2024 09:20

remove unnecesary print

a10e784

check if response choice is None

29e7aef

use gpt4 for test

a1fd6ea

refactor: make Agent.step() multi-step, and rename Agent.step() to Ag…

e49039a

…ent.inner_step() ie the single step version

drop saving

04e6aa4

add back save with check

eeb1460

fix unpacking problem

ae9b0e9

fix: patched tests

bdabf33

cpacker and others added 28 commits October 14, 2024 17:11

chore: comments

0178348

fix: patch kwarg bug

1492b08

fix: patch test

11f8dcf

refactor: remove the ability to have types list[dict] inside of Agent…

e71e816

…StepResponse

merge

342e32b

add test

ab9c3b5

fix formatting

a10d43d

add o1 agent type

05448f8

update test with agent type

05ca69f

pass in llm and embedding configs

a6a7a90

clean up checks in server.py

942828a

clean up saving

b7f1069

update server.py to check agent types

8b28e02

format

f3fb538

decrease max think steps and add reasoning persona

9f4681c

add o1 persona

b6b4283

fix: fix issue with agent state (letta-ai#1859)

5d7d623

fix: patch interface bug

e08f435

pass in user message only on first turn

fdadc23

check if request agent type is None

2976f0a

clean up

f21a4fc

tune prompt

094affe

test

b62dd83

remove unnecesary print

bd31042

check if response choice is None

9801cf2

use gpt4 for test

9eff4a9

merge

73632f3

merge

e4e69cc

kl2806 closed this Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: prompting o1 #1844

feat: prompting o1 #1844

kl2806 commented Oct 8, 2024 •

edited

Loading

kl2806 commented Oct 15, 2024

feat: prompting o1 #1844

feat: prompting o1 #1844

Conversation

kl2806 commented Oct 8, 2024 • edited Loading

kl2806 commented Oct 15, 2024

kl2806 commented Oct 8, 2024 •

edited

Loading