When would you support USE_FLASH_ATTENTION compile? #1252

xbcReal · 2023-07-07T07:59:14Z

🚀 The feature, motivation and pitch

Hi, I wanna a faster transformer implemention in pytorch, and I found one in pytorch code ,whose path is pytorch/aten/src/Aten/native/transformers/cuda/, and it needs support USE_FLASH_ATTENTION compile. Furtherly I found some asm ptx code in utils.h, and amd-pytorch doesn't support it until now. So do you have any plan to support this feature?

xbcReal changed the title ~~when would you support USE_FLASH_ATTENTION compile?~~ When would you support USE_FLASH_ATTENTION compile? Jul 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When would you support USE_FLASH_ATTENTION compile? #1252

When would you support USE_FLASH_ATTENTION compile? #1252

xbcReal commented Jul 7, 2023 •

edited

Loading

When would you support USE_FLASH_ATTENTION compile? #1252

When would you support USE_FLASH_ATTENTION compile? #1252

Comments

xbcReal commented Jul 7, 2023 • edited Loading

🚀 The feature, motivation and pitch

xbcReal commented Jul 7, 2023 •

edited

Loading