[cfe-dev] Inefficient code generation for _mm_test{z, c, nzc} (SSE4.1)

Florian Pflug fgp at phlo.org
Thu Apr 12 03:23:01 PDT 2012


On Apr12, 2012, at 08:38 , Craig Topper wrote:
> Interestingly, the AVX ptest intrinsics are correctly taking 4 x i64 arguments. I'll fix the 128-bit versions to take 2 x i64.

Wouldn't it also make more sense for the PTEST intrinsics to return an i1, not an i32? One could then use the results of
llvm.x86.sse41.ptest{z,c,nc} directly as a condition for br, without the intermediate icmp step. Or so I imagine, at least.

best regards,
Florian Pflug





More information about the cfe-dev mailing list