[PATCH] D21021: [Clang][AVX512][BuiltIn]Adding intrinsics move_{sd|ss} to clang

Simon Pilgrim via cfe-commits cfe-commits at lists.llvm.org
Fri Sep 16 12:32:52 PDT 2016


RKSimon added a subscriber: RKSimon.

================
Comment at: lib/Headers/avx512fintrin.h:9124
@@ +9123,3 @@
+{
+  return (__m128) __builtin_ia32_movss_mask ((__v4sf) __A, (__v4sf) __B,
+               (__v4sf) __W,
----------------
delena wrote:
> please try the following:
> if (__U)
>   return __builtin_shuffle(A, B, (0, 5, 6, 7)); // may be you need to swap A and B 
>  return W;
> 
> I know that the immediate code will be less optimal, but we can optimize it later.
Any update on this? I currently have a patch (D24653) looking at removing the movss/movsd mask intrinsics as we should be able to do this with purely generic shuffles. I can help with the optimization if necessary.


https://reviews.llvm.org/D21021





More information about the cfe-commits mailing list