[PATCH] D40215: [X86][AVX512] Use PACKSS/PACKUS for vXi16->vXi8 truncations without BWI.
Elena Demikhovsky via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sat Nov 18 22:44:27 PST 2017
delena added inline comments.
================
Comment at: test/CodeGen/X86/vector-compare-results.ll:320
; AVX512F-NEXT: vpcmpgtw %ymm1, %ymm0, %ymm0
-; AVX512F-NEXT: vpmovsxwd %ymm0, %zmm0
-; AVX512F-NEXT: vpmovdb %zmm0, %xmm0
+; AVX512F-NEXT: vextracti128 $1, %ymm0, %xmm1
+; AVX512F-NEXT: vpacksswb %xmm1, %xmm0, %xmm0
----------------
You are inserting AVX2 instructions instead of AVX-512, right? If yes, the prev code is better, since we have more registers in AVX-512.
Repository:
rL LLVM
https://reviews.llvm.org/D40215
More information about the llvm-commits
mailing list