[GPU] Improve sdpa_opt kernel performance with flashattn2 softmax tricks. #40987
Job | Run time |
---|---|
22s | |
47s | |
5m 59s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
40s | |
2m 18s | |
2m 34s | |
1m 20s | |
0s | |
0s | |
0s | |
1s | |
14m 1s |
Job | Run time |
---|---|
22s | |
47s | |
5m 59s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
40s | |
2m 18s | |
2m 34s | |
1m 20s | |
0s | |
0s | |
0s | |
1s | |
14m 1s |