Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update ck_a8w8 #1636

Open
wants to merge 46 commits into
base: develop
Choose a base branch
from
Open

Update ck_a8w8 #1636

wants to merge 46 commits into from

Conversation

aska-0096
Copy link
Contributor

Update a8w8 kernel library
Update flush cache timing api

@aska-0096 aska-0096 requested review from poyenc, geyyer, bartekxk and a team as code owners November 5, 2024 07:05
zjing14 added a commit to zjing14/FBGEMM that referenced this pull request Dec 19, 2024
Summary:
- Cherry-pick from CK PR ROCm/composable_kernel#1636
- Improve fp8 GEMM rowwise for 70B Prefill with seqlen = 1k/2k

Reviewed By: jwfromm

Differential Revision: D67418190
zjing14 added a commit to zjing14/FBGEMM that referenced this pull request Jan 2, 2025
…ytorch#3517)

Summary:

X-link: facebookresearch/FBGEMM#598

- Cherry-pick from CK PR ROCm/composable_kernel#1636
- Improve fp8 GEMM rowwise for 70B Prefill with seqlen = 1k/2k

Reviewed By: xw285cornell, jwfromm

Differential Revision: D67418190
zjing14 added a commit to zjing14/FBGEMM that referenced this pull request Jan 3, 2025
…ytorch#3517)

Summary:

X-link: facebookresearch/FBGEMM#598

- Cherry-pick from CK PR ROCm/composable_kernel#1636
- Improve fp8 GEMM rowwise for 70B Prefill with seqlen = 1k/2k

Reviewed By: xw285cornell, jwfromm

Differential Revision: D67418190
facebook-github-bot pushed a commit to pytorch/FBGEMM that referenced this pull request Jan 4, 2025
Summary:
Pull Request resolved: #3517

X-link: facebookresearch/FBGEMM#598

- Cherry-pick from CK PR ROCm/composable_kernel#1636
- Improve fp8 GEMM rowwise for 70B Prefill with seqlen = 1k/2k

Reviewed By: xw285cornell, jwfromm

Differential Revision: D67418190

fbshipit-source-id: b6d38715b26d91d6047d03941610fa7e20e54cb7
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants