[PATCH] D60512: [X86][AVX] Combine shuffles extracted from a common vector

Sanjay Patel via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 25 12:55:48 PDT 2019


spatel accepted this revision.
spatel added a comment.
This revision is now accepted and ready to land.

LGTM



================
Comment at: test/CodeGen/X86/avx512-shuffles/partial_permute.ll:4411
+; CHECK-NEXT:    vmovapd {{.*#+}} zmm2 = <1,4,u,u,u,u,u,u>
+; CHECK-NEXT:    vpermpd (%rdi), %zmm2, %zmm2
 ; CHECK-NEXT:    vxorpd %xmm3, %xmm3, %xmm3
----------------
RKSimon wrote:
> This is annoying - I'd be happier if the mask in zmm2 was at least reduced to a <1,4> xmm load (or broadcasted) but that's a common issue in a lot of the masks generated in this patch.
Mark this with a TODO and/or file a bug report, so we don't lose track?


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D60512/new/

https://reviews.llvm.org/D60512





More information about the llvm-commits mailing list