Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🤗 [REQUEST] - multi-token-prediction #16

Open
2 tasks done
ethanc8 opened this issue Jul 4, 2024 · 0 comments
Open
2 tasks done

🤗 [REQUEST] - multi-token-prediction #16

ethanc8 opened this issue Jul 4, 2024 · 0 comments
Assignees

Comments

@ethanc8
Copy link

ethanc8 commented Jul 4, 2024

Model introduction

This is a family of four models, where two of the models have been trained to generate 4 tokens per forward pass instead of only a single token like most current LLMs. Multi-token prediction shows significant growths compared to single-token prediction in older benchmarks, so it'd be good to see how much growth can be found in newer benchmarks like BigCode-Bench. These models are not particularly strong, having been trained on 1T tokens or even less.

Model URL

https://huggingface.co/facebook/multi-token-prediction

Additional instructions (Optional)

Inference seems to currently require using Meta's example code.

Author

No

Security

  • I confirm that the model is safe to run which does not contain any malicious code or content.

Integrity

  • I confirm that the model comes from unique and original work and does not contain any plagiarism.
@terryyz terryyz self-assigned this Jul 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants