zvi added a comment. In https://reviews.llvm.org/D33311#758420, @RKSimon wrote: > Why don't you use MOVMSKPD/MOVMSKPS for the 32/64 bit cases and avoid the vector truncation? That's a good point. thanks! https://reviews.llvm.org/D33311