Add support for memory efficient attention for AMD/ROCm #1390

Looong01 · 2024-04-15T12:58:48Z

🚀 The feature, motivation and pitch

Enable support for Flash Attention Memory Efficient and SDPA kernels for AMD GPUs.

At present using these gives below warning with latest nightlies (torch==2.4.0.dev20240413+rocm6.0, pytorch-triton-rocm 3.0.0+0a22a91d04):

/site-packages/diffusers/models/attention_processor.py:1117: UserWarning: 1Torch was not compiled with memory efficient attention. (Triggered internally at ../aten/src/ATen/native/transformers/hip/sdp_utils.cpp:505.)

Alternatives

Users cannot use the native PyTorch APIs with memory efficient attention.

Additional context

No response

Epliz · 2024-05-03T20:17:53Z

Hi,

Not sure what is the status, but looks like AMD has been working on it: pytorch#114309

taylding-amd · 2024-12-03T20:17:50Z

Hi @Looong01, the support should already be included, please refer to this issue for more details: pytorch#112997

Looong01 · 2024-12-03T20:19:14Z

Thanks!

ppanchad-amd added the Under Investigation label Dec 2, 2024

Looong01 closed this as completed Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for memory efficient attention for AMD/ROCm #1390

Add support for memory efficient attention for AMD/ROCm #1390

Looong01 commented Apr 15, 2024

Epliz commented May 3, 2024

taylding-amd commented Dec 3, 2024

Looong01 commented Dec 3, 2024

Add support for memory efficient attention for AMD/ROCm #1390

Add support for memory efficient attention for AMD/ROCm #1390

Comments

Looong01 commented Apr 15, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

Epliz commented May 3, 2024

taylding-amd commented Dec 3, 2024

Looong01 commented Dec 3, 2024