Guidance on describing a text loss? #61

kgourgou · 2024-07-09T10:56:58Z

kgourgou
Jul 9, 2024

Hi all,

I've been using textgrad in experiments and it is fun to see how it updates prompts, etc., but I have a practical problem. While I'm doing my best to describe what I want in the text loss, I see drift in the objective.

Specifically, I'm trying to optimize a large system prompt over three examples. The objective is for the final answer by the llm to finish with a particular formatting, e.g.,

code: <some-code-here>

However, I see that the LLM provides feedback on other aspects of the answer as well, causing a bit of drift and growing the system_prompt.

If you have any general suggestions on how to describe a text loss, I think that would be useful to the community. Thanks!

Answered by mertyg

Jul 10, 2024

We should absolutely add more guidance about how to do this in our tutorials! Thank you for this question @kgourgou .
Here are a couple things I would try:

Use TGD(..., constraints=["your constraint"]): We can try to impose any 'constraint' through the constraints argument. In this case, we can try to have constraints like Your answer should end in the following format: 'Code: $IMPLEMENTATION' where IMPLEMENTATION is your implementation; or Do not grow the system prompt too much etc. In general, this is one major source to inject user guidance to the optimization process.
Use other loss functions: You can use some structured loss function to give feedback only to the parts of interest in…

View full answer

mertyg · 2024-07-10T13:35:15Z

mertyg
Jul 10, 2024
Maintainer

We should absolutely add more guidance about how to do this in our tutorials! Thank you for this question @kgourgou .
Here are a couple things I would try:

Use TGD(..., constraints=["your constraint"]): We can try to impose any 'constraint' through the constraints argument. In this case, we can try to have constraints like Your answer should end in the following format: 'Code: $IMPLEMENTATION' where IMPLEMENTATION is your implementation; or Do not grow the system prompt too much etc. In general, this is one major source to inject user guidance to the optimization process.
Use other loss functions: You can use some structured loss function to give feedback only to the parts of interest in the inputs. For instance, if you use textgrad.loss.MultiFieldEvaluation, you can partition the inputs to the loss function into variables, and set requires_grad=True for only variables you want to optimize. I know this can be a bit confusing, but here's how we use it for a BigBench Hard experiment
Use role descriptions proactively: We often find role descriptions are pretty helpful in guiding optimization, and you can also inject information there. For instance, if you set the role to be concise system prompt to a code generating language model (compared to e.g., system prompt), you'll see that you'll get much more specific/targeted prompts.

In general, there are a bunch of places to modulate the behavior of optimizers, which is a blessing as it is expressive, but curse because grows the search space. I'd recommend starting with these -- and we'll make sure to add more tutorials on this!

Thanks again @kgourgou -- great to have you in the community.

0 replies

kgourgou · 2024-07-10T13:41:48Z

kgourgou
Jul 10, 2024
Author

Very interesting! Thanks for sharing all the tips.

0 replies

mertyg · 2024-07-12T14:24:05Z

mertyg
Jul 12, 2024
Maintainer

Moved this to discussions in case it's more accessible!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance on describing a text loss? #61

{{title}}

Replies: 3 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Guidance on describing a text loss? #61

kgourgou Jul 9, 2024

Replies: 3 comments

mertyg Jul 10, 2024 Maintainer

kgourgou Jul 10, 2024 Author

mertyg Jul 12, 2024 Maintainer

kgourgou
Jul 9, 2024

mertyg
Jul 10, 2024
Maintainer

kgourgou
Jul 10, 2024
Author

mertyg
Jul 12, 2024
Maintainer