Ukernel lowering for data-tiled multi_mma
with mfma_i32_16x16x32_i8
#3206
Job | Run time |
---|---|
6s | |
19m 53s | |
19m 59s |
multi_mma
with mfma_i32_16x16x32_i8
#3206
Job | Run time |
---|---|
6s | |
19m 53s | |
19m 59s |