Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: prompting o1 #1844

Closed
wants to merge 56 commits into from
Closed

feat: prompting o1 #1844

wants to merge 56 commits into from

Conversation

kl2806
Copy link
Collaborator

@kl2806 kl2806 commented Oct 8, 2024

Please describe the purpose of this pull request.
Is it to add a new feature? Is it to fix a bug?
Create a new agent type O1Agent, that uses prompting to get the agent to "think" more at inference time before responding by calling a think function repeatedly before the final answer function.

How to test
How can we test your PR during review? What commands should we run? What outcomes should we expect?
Added a simple test in test_o1_agent that creates an O1 agent and asks it to compare which two numbers are larger. With got-4o-mini, it improves accuracy from 0->80%. The test just checks that 3 steps to respond.

@sarahwooders sarahwooders self-requested a review October 8, 2024 18:29
@kl2806 kl2806 requested a review from cpacker October 10, 2024 22:16
@kl2806
Copy link
Collaborator Author

kl2806 commented Oct 15, 2024

Closing as it is subsumed by #1891

@kl2806 kl2806 closed this Oct 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants