Skip to content

Commit 041d78b

Browse files
Setting StageCount to auto on PVC for instance emitting (#294)
This is a small change needed for emitting a PVC compatible Gemm3xInstance given the currently supported & generated Configurations. Co-authored-by: Alejandro Acosta <[email protected]>
1 parent 675914c commit 041d78b

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

python/cutlass_library/gemm_operation.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -913,6 +913,8 @@ def emit(self, operation):
913913
stage_count_string = f"cutlass::gemm::collective::StageCount<{str(operation.tile_description.stages)}>"
914914
elif opcode_class_main == OpcodeClass.SparseTensorOp and operation.arch == 100:
915915
stage_count_string = f"cutlass::gemm::collective::StageCountAutoCarveoutEpi<{str(operation.procedural_name())}_epilogue>"
916+
elif operation.is_xe:
917+
stage_count_string = "cutlass::gemm::collective::StageCountAuto"
916918
else:
917919
stage_count_string = f"cutlass::gemm::collective::StageCountAutoCarveout<static_cast<int>(sizeof(typename {str(operation.procedural_name())}_epilogue::SharedStorage))>"
918920

0 commit comments

Comments
 (0)