Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compare the CharacterProposal to TokenProposal(K=None) #42

Open
timvieira opened this issue Jul 9, 2024 · 3 comments
Open

Compare the CharacterProposal to TokenProposal(K=None) #42

timvieira opened this issue Jul 9, 2024 · 3 comments

Comments

@timvieira
Copy link
Collaborator

The CharacterProposal is designed to be fast while still hopefully being a good proxy for TokenProposal(K=None). How good is it in practice?

@benlebrun
Copy link
Collaborator

Below is a minimal example where the distribution over tokens defined by the CharacterProposal differs from the TokenProposal with K=None (i.e., the local product of experts). In this example, the character proposal places too much probability on a since the frequency of paths with a in them is too high.

Screenshot 2024-07-10 at 4 09 28 PM

Our weights will correct for this issue, but this nonetheless means that we are obtaining sub-optimal token samples from the character proposal.

@timvieira
Copy link
Collaborator Author

Excellent work, @benlebrun

@timvieira
Copy link
Collaborator Author

timvieira commented Jul 20, 2024

The example in tests/test_system.py if we vary the proposal has an interesting qualitative difference that we should dig into:

Token:

image-4

Character:

image-3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants