[PATCH] D68428: [X86] Add custom type legalization for v16i64->v16i8 truncate and v8i64->v8i8 truncate when v8i64 isn't legal
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sat Oct 5 09:41:08 PDT 2019
RKSimon added inline comments.
================
Comment at: llvm/test/CodeGen/X86/min-legal-vector-width.ll:836
+; CHECK-NEXT: vpmovqb %ymm0, %xmm0
+; CHECK-NEXT: vpunpckldq {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1]
+; CHECK-NEXT: vzeroupper
----------------
craig.topper wrote:
> The loss of the VPERMI2B here is a regression, but the VTRUNC form should allow us to create saturating VTRUNCs for the cases in vector-trunc-ssat.ll and vector-trunc-usat.ll. So maybe we need a late shuffle combine to VPERMI2B?
We don't currently support VTRUNC in shuffle combining as we're still weak at handling conflicting vector sizes - is that limitation ok for now?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D68428/new/
https://reviews.llvm.org/D68428
More information about the llvm-commits
mailing list