forked from NVIDIA/cutlass
-
Notifications
You must be signed in to change notification settings - Fork 28
Pull requests: codeplaysoftware/cutlass-sycl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix the performance regression for flash attention for 2025.1 release compiler
#327
opened Apr 24, 2025 by
mehdi-goli
Loading…
Correct Tflops, bandwidth calculation with causal mask for FlashAttention
#322
opened Apr 22, 2025 by
min-jean-cho
Loading…
Fix benchmarks (& examples) where sizes exceed max int32
#313
opened Apr 16, 2025 by
joeatodd
Loading…
RFC: test out new syntax for launch with type deduction
#305
opened Apr 12, 2025 by
rolandschulz
Loading…
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.