When would you support USE_FLASH_ATTENTION compile? #1252

xbcReal · 2023-07-07T07:59:14Z

🚀 The feature, motivation and pitch

Hi, I wanna a faster transformer implemention in pytorch, and I found one in pytorch code ,whose path is pytorch/aten/src/Aten/native/transformers/cuda/, and it needs support USE_FLASH_ATTENTION compile. Furtherly I found some asm ptx code in utils.h, and amd-pytorch doesn't support it until now. So do you have any plan to support this feature?

darren-amd · 2024-12-03T19:30:31Z

Hi @xbcReal,

This is supported now; support was added through the upstream issue: pytorch#112997. Thanks for reporting it!

xbcReal changed the title ~~when would you support USE_FLASH_ATTENTION compile?~~ When would you support USE_FLASH_ATTENTION compile? Jul 7, 2023

ppanchad-amd added the Under Investigation label Nov 27, 2024

darren-amd closed this as completed Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When would you support USE_FLASH_ATTENTION compile? #1252

When would you support USE_FLASH_ATTENTION compile? #1252

xbcReal commented Jul 7, 2023 •

edited

Loading

darren-amd commented Dec 3, 2024

When would you support USE_FLASH_ATTENTION compile? #1252

When would you support USE_FLASH_ATTENTION compile? #1252

Comments

xbcReal commented Jul 7, 2023 • edited Loading

🚀 The feature, motivation and pitch

darren-amd commented Dec 3, 2024

xbcReal commented Jul 7, 2023 •

edited

Loading