First readme.md example fails #465
Replies: 4 comments 3 replies
-
Doesn't seem to be an issue with Outlines. What's the output without guided generation? |
Beta Was this translation helpful? Give feedback.
-
TLDR from long answer below - outlines choice doesn't seem to always match logprobs? The unguided output’s starts with “The“ but maybe more relevant is the logprobs of “Positive” vs “Negative” with the example prompt - they were inverted with the positive-seeming review getting a higher probability for “negative” being the next token. I fixed that with prompt engineering (just had to include some few-shots) but given the misalignment between logprobs and outline output, I’m still wondering what’s going on… My assumption would be the logprobs should consistently reflect Outline’s choice but maybe there’s a temperature setting in Outlines not set to 0 or just my whole mental model is incorrect? Also, is there a way to have the logprobs print in this situation for debugging purposes? As for the prompt itself, I imagine others will use the first example as jumping off point like me, so I would suggest updating to something with choices that have a disparity in probability and align with a reasonable person’s expectations, e.g. “My dog is named [Fido,Steve]”. |
Beta Was this translation helpful? Give feedback.
-
(Accidentally closed when responding above) |
Beta Was this translation helpful? Give feedback.
-
That's probably an artifact of multinomial sampling, and if you took enough samples "Negative" would be more represented. I assume in this case greedy sampling would give the right answer. |
Beta Was this translation helpful? Give feedback.
-
Describe the issue as clearly as possible:
If you change the one-line review to "This restaurant stinks" - you still get a "Positive" answer back.
Steps/code to reproduce the bug:
Expected result:
Error message:
No response
Outlines/Python version information:
Context for the issue:
No response
Beta Was this translation helpful? Give feedback.
All reactions