Mi50 Support #29

YehowshuaScaled · 2023-12-31T06:12:24Z

I was able to build flash-attention ROCM for both my Mi100 and Mi50 cards, but only got flash attention working on the Mi100(very impressive performance I might add).

Trying to run flash attention on the Mi50 delivered the following error:
RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem

How hard would it be to port FA to the Mi50. Happy to pay/hire for support on this as I have a rather large stockpile of Mi50s.

jayz0123 · 2024-01-23T10:23:38Z

Hi @YehowshuaScaled. I think it would be better to ask the CK team to see if they are going to support MI50. It won't be an issue if they have FA kernels running on MI50.

#RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem

This error is actually raised from the CK backend.

differentprogramming · 2024-06-30T12:17:03Z

I notice this line in setup.py
allowed_archs = ["native", "gfx90a", "gfx908", "gfx940", "gfx941", "gfx942"]
I'm sad that gfx906 isn't there since I have an MI50 as well.

linchen111 · 2024-07-29T08:08:14Z

I was able to build flash-attention ROCM for both my Mi100 and Mi50 cards, but only got flash attention working on the Mi100(very impressive performance I might add).我能够为我的 Mi100 和 Mi50 卡构建 flash-attention ROCM，但只在 Mi100 上实现 flash-attention（我可能会补充非常令人印象深刻的性能）。

Trying to run flash attention on the Mi50 delivered the following error:尝试在 Mi50 上运行 Flash Attention 时出现以下错误： RuntimeError: DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256, 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft> does not support this problem运行时错误：DeviceGroupedMultiheadAttentionForward_Xdl_CShuffle_V2<256 , 128, 128, 32, 8, 8, 128, 128, 32, 2, Default, ASpecDefault, B0SpecDefault, B1SpecDefault, CSpecDefault, MaskUpperTriangleFromTopLeft>不支持这个问题

How hard would it be to port FA to the Mi50. Happy to pay/hire for support on this as I have a rather large stockpile of Mi50s.将 FA 移植到 Mi50 上有多难？很高兴支付/雇用这方面的支持，因为我有相当大的 Mi50 库存。

did you solve this?

sabreshao added the pre-mi200 label Jan 16, 2024

ThePerfectComputer mentioned this issue Jan 23, 2024

[Issue]: Flash Attention Failure on AMD Mi50 ROCm/composable_kernel#1140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mi50 Support #29

Mi50 Support #29

YehowshuaScaled commented Dec 31, 2023

jayz0123 commented Jan 23, 2024 •

edited

Loading

differentprogramming commented Jun 30, 2024

linchen111 commented Jul 29, 2024

Mi50 Support #29

Mi50 Support #29

Comments

YehowshuaScaled commented Dec 31, 2023

jayz0123 commented Jan 23, 2024 • edited Loading

differentprogramming commented Jun 30, 2024

linchen111 commented Jul 29, 2024

jayz0123 commented Jan 23, 2024 •

edited

Loading