Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revert "[LLVMGPU] Deprecate the matmul simt pipeline (#19335)" #19508

Merged

Conversation

archana-ramalingam
Copy link
Contributor

@archana-ramalingam archana-ramalingam commented Dec 18, 2024

This reverts commit 6ff00a8.
The above commit causes Llama3.1 8B fp16 model to generate NaN logits for prefill/decode.
Issue: #19506

@MaheshRavishankar
Copy link
Contributor

Please link the issue here

@MaheshRavishankar
Copy link
Contributor

Cc @pashu123

@pashu123
Copy link
Contributor

Cc @pashu123

Taking a look.

@ScottTodd
Copy link
Member

Re-triggered all the workflows that failed. Errors looked like transient network issues on GitHub's side.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants