Top-p version of the `TokenProposal` #51

benlebrun · 2024-07-16T14:13:12Z

Modify the top K TokenProposal to lazily materialize the set of most probable tokens with cumulative probability less than probability p.

The text was updated successfully, but these errors were encountered:

timvieira · 2024-07-17T22:40:22Z

We might need to be more specific about which probability is covered up to p.

The key method in the TokenProposal is traverse_trie(context, p_llm), which returns an iterator over the possible next tokens token and their raw score p_llm(token | context) * p_guide(token | context). The challenge in defining the set of top-p tokens is that we cannot compute the normalization constant Z(context) = sum_{token} p_llm(token | context) * p_guide(token | context) without materializing the complete distribution.

Thus, for efficiency, the top-p set would need to be defined on the sum of p_llm(token | context) * p_guide(token | context). which could be really small if the llm and the guide disagree. However, if we rescaled it by Z(context) it would not be because we'd rescale it by the total agreement.

What did you have in mind @benlebrun ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Top-p version of the `TokenProposal` #51

Top-p version of the `TokenProposal` #51

benlebrun commented Jul 16, 2024

timvieira commented Jul 17, 2024 •

edited

Loading

Top-p version of the TokenProposal #51

Top-p version of the TokenProposal #51

Comments

benlebrun commented Jul 16, 2024

timvieira commented Jul 17, 2024 • edited Loading

Top-p version of the `TokenProposal` #51

Top-p version of the `TokenProposal` #51

timvieira commented Jul 17, 2024 •

edited

Loading