[PATCH] D40215: [X86][AVX512] Use PACKSS/PACKUS for vXi16->vXi8 truncations without BWI.

Elena Demikhovsky via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sat Nov 18 22:44:27 PST 2017


delena added inline comments.


================
Comment at: test/CodeGen/X86/vector-compare-results.ll:320
 ; AVX512F-NEXT:    vpcmpgtw %ymm1, %ymm0, %ymm0
-; AVX512F-NEXT:    vpmovsxwd %ymm0, %zmm0
-; AVX512F-NEXT:    vpmovdb %zmm0, %xmm0
+; AVX512F-NEXT:    vextracti128 $1, %ymm0, %xmm1
+; AVX512F-NEXT:    vpacksswb %xmm1, %xmm0, %xmm0
----------------
You are inserting AVX2 instructions instead of AVX-512, right? If yes, the prev code is better, since we have more registers in AVX-512.


Repository:
  rL LLVM

https://reviews.llvm.org/D40215





More information about the llvm-commits mailing list