AVX-512 vpmovd2m: _mm256_movepi32_mask(), _mm512_movepi32_mask() #96204
-
Hi, I'm updating some code from AVX/AVX2/FMA to AVX10/256 and AVX10/512. There's cases where So I have a couple questions.
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
|
Beta Was this translation helpful? Give feedback.
-
Thank you once again, Tanner! A follow on question, if I may: is there documentation I should consult that would be helpful to predicting which instructions are provided by As context, the optimized codegen from methods on
when writing System.Runtime.Intrinsics.X86 code. Not entirely sure all of these cases are possible—would have to check the hardware manuals and disassembly in places—and there's probably some I haven't encountered yet. |
Beta Was this translation helpful? Give feedback.
vector.ExtractMostSignificantBits()
will do the equivalent ofmovmsk*
for 128/256-bit scenarios andvpmov*2m
+kmov*
for 512-bit (only emittingkmov
for scenarios where the vector is determined to already be in a mask register, such as after a comparison operation)