[PATCH] D21021: [Clang][AVX512][BuiltIn]Adding intrinsics move_{sd|ss} to clang
Simon Pilgrim via cfe-commits
cfe-commits at lists.llvm.org
Fri Sep 16 12:32:52 PDT 2016
RKSimon added a subscriber: RKSimon.
================
Comment at: lib/Headers/avx512fintrin.h:9124
@@ +9123,3 @@
+{
+ return (__m128) __builtin_ia32_movss_mask ((__v4sf) __A, (__v4sf) __B,
+ (__v4sf) __W,
----------------
delena wrote:
> please try the following:
> if (__U)
> return __builtin_shuffle(A, B, (0, 5, 6, 7)); // may be you need to swap A and B
> return W;
>
> I know that the immediate code will be less optimal, but we can optimize it later.
Any update on this? I currently have a patch (D24653) looking at removing the movss/movsd mask intrinsics as we should be able to do this with purely generic shuffles. I can help with the optimization if necessary.
https://reviews.llvm.org/D21021
More information about the cfe-commits
mailing list