feat: DIA-1402: V1-Submit Prompt auto-refinement job #214

matt-bernstein · 2024-09-23T21:20:49Z

Add a prompt improvement skill and an endpoint to call it through the server

TODO
[x] validation
[x] test coverage
[x] clean up our data models - started in this new codepath, can extend to existing codepaths later

codecov-commenter · 2024-09-23T21:22:13Z

Codecov Report

Attention: Patch coverage is 56.09756% with 36 lines in your changes missing coverage. Please review.

Project coverage is 66.94%. Comparing base (abffd45) to head (330d036).
Report is 3 commits behind head on master.

Files with missing lines	Patch %	Lines
adala/agents/base.py	31.57%	13 Missing ⚠️
server/app.py	55.55%	12 Missing ⚠️
adala/skills/collection/prompt_improvement.py	69.44%	11 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #214      +/-   ##
==========================================
- Coverage   67.09%   66.94%   -0.15%     
==========================================
  Files          44       45       +1     
  Lines        2103     2272     +169     
==========================================
+ Hits         1411     1521     +110     
- Misses        692      751      +59

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

robot-ci-heartex · 2024-09-23T21:22:39Z

Commit https://github.com/HumanSignal/infra/commit/14031fcdd71ee23269a54bfcd78b810c2cbaa9fc is created.

pakelley

overall lgtm

server/prompt_improvement_skill.py

robot-ci-heartex · 2024-09-24T09:05:59Z

Commit https://github.com/HumanSignal/infra/commit/b7ba401d08ae771e19944b715eefe0fbfa6b8b12 is created.

niklub · 2024-09-24T10:32:56Z

server/prompt_improvement_skill.py

+    prompt_improvement_skill = TextGenerationSkill(
+        name="prompt_improvement",
+        instructions="Improve the user prompt for the provided LLM model to complete the task using the provided input variables, with the provided user prompt as a starting point. Variables can be accessed in the user prompt using the format {variable_name} (only the variable values are used, not their names). Make sure your prompt produces output that will continue to conform to the provided json schema. Provide your reasoning for the changes you made to the prompt.",
+        input_template='''


Let's move it to a separarate file, so we could keep refining it easily.
A couple of a prompt eng best practices that we could incorporate right away:

split each variable in a well defined block, for example: Model: {model} --> ## Model\n{model}\n\n

add fewshot examples

add # Instructions: or # Follow the steps: section with a numbered list of instructions

add # Things to avoid: section where we patch the mistakes

prompt engineering for refinement is out of the scope for sure, but moving input_template in a separate file and leaving instructions empty would allow us to easily iterate on that in the future updates

I want to keep all the deps for the PromptImprovementSkill together, not sure putting just the input template in a separate file would be any cleaner.

But one thing I want to get your opinion on re prompt engineering is the trick of factoring out all the prompt contents that don't contain variables into the system prompt, and leaving the variables in the user prompt. This way we can easily use prompt caching on the system prompt, and it would also change the performance in some way - not sure if positive or negative, but I suspect positive, and I wanted to trial it here first, and if it works here bring it to LSE. wdyt?

Updated the prompt. Keeping as separate system and user prompts for now, lmk if you have a strong opinion about that

server/app.py

- make agent with a teacher runtime serializable - add test

robot-ci-heartex · 2024-10-01T14:49:15Z

Commit https://github.com/HumanSignal/infra/commit/e1cd62d405c7b2ddd1e4808baa2e1c8a4c50654c is created.

robot-ci-heartex · 2024-10-01T14:57:37Z

Commit https://github.com/HumanSignal/infra/commit/68e4258c0f937d203744ce62a359b8388b1b6dce is created.

robot-ci-heartex · 2024-10-01T15:19:50Z

Commit https://github.com/HumanSignal/infra/commit/9076e602021526e96fd6c3acc67cde7b8418a3c6 is created.

robot-ci-heartex · 2024-10-02T01:32:46Z

Commit https://github.com/HumanSignal/infra/commit/9d931af115a155b97ca0e1c0c93876a08151bb4f is created.

robot-ci-heartex · 2024-10-02T01:54:51Z

Commit https://github.com/HumanSignal/infra/commit/6e0bdd789ddc14f21ef158c9f071efab51cd3a40 is created.

feat: DIA-1402: V1-Submit Prompt auto-refinement job

d905a9a

robot-ci-heartex temporarily deployed to fb-dia-1402 September 23, 2024 21:22 Destroyed

pakelley approved these changes Sep 23, 2024

View reviewed changes

server/prompt_improvement_skill.py Outdated Show resolved Hide resolved

robot-ci-heartex marked this pull request as draft September 24, 2024 09:05

niklub reviewed Sep 24, 2024

View reviewed changes

server/app.py Outdated Show resolved Hide resolved

matt-bernstein added 5 commits September 29, 2024 23:59

Merge remote-tracking branch 'origin/master' into fb-dia-1402

02e062f

- move prompt improvement to agent

a53675e

- make agent with a teacher runtime serializable - add test

add failure test

7a2b95f

black

dd6ad8f

mark as openai

1f7e247

robot-ci-heartex temporarily deployed to fb-dia-1402 October 1, 2024 14:49 Destroyed

use parse_template for validation

e07c6d1

robot-ci-heartex temporarily deployed to fb-dia-1402 October 1, 2024 14:57 Destroyed

update prompt for prompt improvement skill

330d036

matt-bernstein marked this pull request as ready for review October 1, 2024 15:17

matt-bernstein requested a review from niklub October 1, 2024 15:18

robot-ci-heartex temporarily deployed to fb-dia-1402 October 1, 2024 15:19 Destroyed

robot-ci-heartex marked this pull request as draft October 2, 2024 01:32

matt-bernstein marked this pull request as ready for review October 2, 2024 01:53

robot-ci-heartex deployed to fb-dia-1402 October 2, 2024 01:54 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: DIA-1402: V1-Submit Prompt auto-refinement job #214

feat: DIA-1402: V1-Submit Prompt auto-refinement job #214

matt-bernstein commented Sep 23, 2024 •

edited

Loading

codecov-commenter commented Sep 23, 2024 •

edited

Loading

robot-ci-heartex commented Sep 23, 2024

pakelley left a comment

robot-ci-heartex commented Sep 24, 2024

niklub Sep 24, 2024

matt-bernstein Sep 24, 2024

matt-bernstein Oct 1, 2024

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 2, 2024

robot-ci-heartex commented Oct 2, 2024

feat: DIA-1402: V1-Submit Prompt auto-refinement job #214

Are you sure you want to change the base?

feat: DIA-1402: V1-Submit Prompt auto-refinement job #214

Conversation

matt-bernstein commented Sep 23, 2024 • edited Loading

codecov-commenter commented Sep 23, 2024 • edited Loading

Codecov Report

robot-ci-heartex commented Sep 23, 2024

pakelley left a comment

Choose a reason for hiding this comment

robot-ci-heartex commented Sep 24, 2024

niklub Sep 24, 2024

Choose a reason for hiding this comment

matt-bernstein Sep 24, 2024

Choose a reason for hiding this comment

matt-bernstein Oct 1, 2024

Choose a reason for hiding this comment

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 1, 2024

robot-ci-heartex commented Oct 2, 2024

robot-ci-heartex commented Oct 2, 2024

matt-bernstein commented Sep 23, 2024 •

edited

Loading

codecov-commenter commented Sep 23, 2024 •

edited

Loading