[LoweringStrategy] Use a more general method to fetch input dims and sizes #1090

yzhang93 · 2025-02-08T02:39:15Z

There is a bug in the existing codes to get M/N/K size from a matmul-like op, i.e.,

const uint64_t M = initShape[0];
const uint64_t N = initShape[1];
const uint64_t K = lhsShape[1];

Apparently, K shouldn't be lhsShape[1] if it's a matmul-tranpose-a op.
It's hard to infer the shape if the input matmul-like ops are transposed and in linalg.generic form. Or even the input has a higher number of dimensions such as mmt4d ops.

In addition, the indexing maps of matmul-like linalg.generic ops can be transposed during dispatch generation by default because of TransposeGenericOpsPass.
Example:
parallel parallel reduction parallel parallel reduction will become
parallel parallel parallel parallel reduction reduction after dispatch generation.
So if we still put the pack size/tile size at the dim for the former indexing map, it will generate wrong size for tiling.

This PR uses the upstream method linalg::inferContractionDims to infer the dim indices of M/N/K for all contraction ops.

…sizes

Abhishek-Varma

Nice! A few comments to address.

Abhishek-Varma · 2025-02-10T05:28:16Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/Transforms/KernelDispatch.cpp

+  SmallVector<int64_t> shapes = linalgOp.getStaticLoopRanges();
+  if (mDims.size() + nDims.size() + kDims.size() > shapes.size()) {
+    return linalgOp.emitOpError(
+        "the total of m/n/k dims is larger than the number of loops.");
+  }


Do we really need this check ?
This seems like something which the upstream utility should handle, if not already.

Therefore if at all we need to ensure this check, I think we can simply assert on this.

Abhishek-Varma · 2025-02-10T05:29:04Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/Transforms/KernelDispatch.cpp

+
+  auto getSizesAt = [&shapes](const SmallVector<unsigned, 2> &idx) {
+    SmallVector<int64_t, 2> sizes;
+    for (auto i : idx) sizes.push_back(shapes[i]);


Suggested change

for (auto i : idx) sizes.push_back(shapes[i]);

for (unsigned i : idx) sizes.push_back(shapes[i]);

Abhishek-Varma · 2025-02-10T05:36:34Z

compiler/plugins/target/AMD-AIE/iree-amd-aie/Transforms/KernelDispatch.cpp

+    AMDAIEDevice targetDevice, uint32_t numRows, uint32_t numCols,
+    uint32_t numLoops) {


We can use linalgOp.getNumLoops(); within each function ?
In that case sending numLoops as a function argument can be dropped here and elsewhere.

yzhang93 requested review from MaheshRavishankar, nirvedhmeshram, Abhishek-Varma and jtuyls as code owners February 8, 2025 02:39

[LoweringStrategy] Use a more general method to fetch input dims and …

bba961e

…sizes

yzhang93 force-pushed the refactor_get_shapes_dims branch from 4ff5e29 to bba961e Compare February 8, 2025 04:33

Abhishek-Varma requested changes Feb 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoweringStrategy] Use a more general method to fetch input dims and sizes #1090

[LoweringStrategy] Use a more general method to fetch input dims and sizes #1090

yzhang93 commented Feb 8, 2025 •

edited

Loading

Abhishek-Varma left a comment

Abhishek-Varma Feb 10, 2025

Abhishek-Varma Feb 10, 2025

Abhishek-Varma Feb 10, 2025

	for (auto i : idx) sizes.push_back(shapes[i]);
	for (unsigned i : idx) sizes.push_back(shapes[i]);

		AMDAIEDevice targetDevice, uint32_t numRows, uint32_t numCols,
		uint32_t numLoops) {

[LoweringStrategy] Use a more general method to fetch input dims and sizes #1090

Are you sure you want to change the base?

[LoweringStrategy] Use a more general method to fetch input dims and sizes #1090

Conversation

yzhang93 commented Feb 8, 2025 • edited Loading

Abhishek-Varma left a comment

Choose a reason for hiding this comment

Abhishek-Varma Feb 10, 2025

Choose a reason for hiding this comment

Abhishek-Varma Feb 10, 2025

Choose a reason for hiding this comment

Abhishek-Varma Feb 10, 2025

Choose a reason for hiding this comment

yzhang93 commented Feb 8, 2025 •

edited

Loading