Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[BACKEND][AMD] Enable swizzling SMEM for transposed operand (#3666)
Transposed operand will be accessed in an opposite order from the original operand. Enabling swizzling seems to help performance. I'm seeing 10% performance improvement for our internal model. This is a backport of ROCm#474.
- Loading branch information