Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AMD] Refactor decompose-unsupported-amd-conversions pass #4262

Merged
merged 3 commits into from
Sep 11, 2024

Conversation

binarman
Copy link
Contributor

@binarman binarman commented Jul 5, 2024

This PR introduces:

  • Use common code, simplify pass code
  • Support 3d tensors in mfma -> dot conversion(supported in common code from item above)
  • More tests for decompose-unsupported-amd-conversions pass

@binarman
Copy link
Contributor Author

binarman commented Jul 5, 2024

This PR depends on #3730

@@ -1,8 +1,9 @@
// RUN: triton-opt %s --split-input-file --decompose-unsupported-amd-conversions=arch=gfx942 | FileCheck %s
Copy link
Contributor Author

@binarman binarman Jul 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wanted to rename this file to test/Conversion/amd/decompose-unsupported-conversions-gfx11.mlir, but git marks it as new file and looses history. So I've left it as-is for now.

@binarman binarman force-pushed the refactor_decompose_conversions branch 2 times, most recently from bfaa932 to b23e154 Compare July 30, 2024 11:32
@binarman binarman changed the title [WIP] [AMD] Refactor decompose-unsupported-amd-conversions pass [AMD] Refactor decompose-unsupported-amd-conversions pass Jul 30, 2024
// RUN: triton-opt %s --split-input-file --decompose-unsupported-amd-conversions=arch=gfx942 | FileCheck %s

// CHECK: #[[DST_ENC:.+]] = #triton_gpu.blocked<{{.*}}>
// CHECK: #[[SRC_ENC:.+]] = #triton_gpu.amd_mfma<{{.*}}>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wanna specify out some different pieces between SRC_ENC and TMP_ENC and change the CHECK here to CHECK-DAG? Right now it relies on solely the ordering to not break..

@antiagainst
Copy link
Collaborator

@binarman ping on this. Can you merge in main and address comments?

@binarman
Copy link
Contributor Author

@antiagainst sure, this will be first thing after I finish with small-m dimension FMA optimization.

This PR:
- Simplifying pass code, reusing common code
- Introduces supports 3d tensors in mfma -> dot conversion(supported in common code from item above)
- Adds more lit tests for decompose-unsupported-amd-conversions pass
@binarman binarman force-pushed the refactor_decompose_conversions branch from b23e154 to cebb7e9 Compare September 10, 2024 20:37
@antiagainst antiagainst marked this pull request as ready for review September 11, 2024 18:40
@antiagainst antiagainst merged commit ec4ca60 into triton-lang:main Sep 11, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants