Skip to content

add int8/tf32 transpose A copy traits #319

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: sycl-develop
Choose a base branch
from

Conversation

taozha2
Copy link
Collaborator

@taozha2 taozha2 commented Apr 21, 2025

No description provided.

@taozha2
Copy link
Collaborator Author

taozha2 commented Apr 21, 2025

@aacostadiaz @joeatodd This PR add int8 and tf32 transpose copy traits support including both A and B. all transpose case for bf16/fp16/int8/uint8/tf32 here https://github.com/taozha2/cutlass-fork/blob/zt/gemm_layout_data_type/examples/sycl/pvc/pvc_gemm.cpp#L432 passed(MUST based on latest driver https://ubit-gfx.intel.com/build/21406574 or later)..
But as i have talked with @aacostadiaz before, latest code base have regression for int8/tf32 transpose gemm support, i think all transpose case should be passed if you fixed that regression.
please review this PR and merge it.

@taozha2 taozha2 changed the title add int8/tf32 transpose A intrinsic add int8/tf32 transpose A copy traits Apr 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants