Compare the `CharacterProposal` to `TokenProposal(K=None)` #42

timvieira · 2024-07-09T21:00:22Z

The CharacterProposal is designed to be fast while still hopefully being a good proxy for TokenProposal(K=None). How good is it in practice?

The text was updated successfully, but these errors were encountered:

benlebrun · 2024-07-10T20:26:04Z

Below is a minimal example where the distribution over tokens defined by the CharacterProposal differs from the TokenProposal with K=None (i.e., the local product of experts). In this example, the character proposal places too much probability on a since the frequency of paths with a in them is too high.

Our weights will correct for this issue, but this nonetheless means that we are obtaining sub-optimal token samples from the character proposal.

timvieira · 2024-07-10T22:03:03Z

Excellent work, @benlebrun

timvieira · 2024-07-20T15:52:17Z

The example in tests/test_system.py if we vary the proposal has an interesting qualitative difference that we should dig into:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare the `CharacterProposal` to `TokenProposal(K=None)` #42

Compare the `CharacterProposal` to `TokenProposal(K=None)` #42

timvieira commented Jul 9, 2024

benlebrun commented Jul 10, 2024

timvieira commented Jul 10, 2024

timvieira commented Jul 20, 2024 •

edited

Loading

Compare the CharacterProposal to TokenProposal(K=None) #42

Compare the CharacterProposal to TokenProposal(K=None) #42

Comments

timvieira commented Jul 9, 2024

benlebrun commented Jul 10, 2024

timvieira commented Jul 10, 2024

timvieira commented Jul 20, 2024 • edited Loading

Token:

Character:

Compare the `CharacterProposal` to `TokenProposal(K=None)` #42

Compare the `CharacterProposal` to `TokenProposal(K=None)` #42

timvieira commented Jul 20, 2024 •

edited

Loading