[PATCH] D48043: [x86] eliminate more sign-bit tests with vector select
Sanjay Patel via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 12 07:56:11 PDT 2018
spatel added inline comments.
================
Comment at: test/CodeGen/X86/vsel-cmp-load.ll:260
; AVX1-NEXT: vinsertf128 $1, %xmm2, %ymm3, %ymm2
; AVX1-NEXT: vblendvps %ymm2, %ymm0, %ymm1, %ymm0
; AVX1-NEXT: retq
----------------
spatel wrote:
> spatel wrote:
> > RKSimon wrote:
> > > How come this folds but the AVX1 case in slt_zero above doesn't?
> > AVX1 is more complicated due to ISA limitations, so I was planning to catch that one next. There, we've split the PCMPGT into halves, so I'll need to match a pattern with a concat:
> >
> > t41: v4i32 = X86ISD::PCMPGT t37, t32
> > t31: v8i16 = vector_shuffle<4,5,6,7,u,u,u,u> t28, undef:v8i16
> > t33: v4i32 = sign_extend_vector_inreg t31
> > t42: v4i32 = X86ISD::PCMPGT t37, t33
> > t40: v8i32 = concat_vectors t41, t42
> > t4: v8i32,ch = CopyFromReg t0, Register:v8i32 %1
> > t6: v8i32,ch = CopyFromReg t0, Register:v8i32 %2
> > t23: v8i32 = vselect t40, t4, t6
> >
> Or probably easier - we match the pattern after type legalization, but before vector op legalization:
> t21: v8i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
> t22: v8i32 = setcc t24, t21, setlt:ch
> t4: v8i32,ch = CopyFromReg t0, Register:v8i32 %1
> t6: v8i32,ch = CopyFromReg t0, Register:v8i32 %2
> t23: v8i32 = vselect t22, t4, t6
>
See D48078 for an implementation of that suggestion.
https://reviews.llvm.org/D48043
More information about the llvm-commits
mailing list