[PATCH] D46008: [X86][AArch64][NFC] Add tests for vector masked merge unfolding

Sanjay Patel via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun May 20 08:56:34 PDT 2018


spatel accepted this revision.
spatel added a comment.
This revision is now accepted and ready to land.

In https://reviews.llvm.org/D46008#1101250, @lebedev.ri wrote:

> One observation i can make - under `SSE1`, the `v8i32` are scalarized, not split into 2x `v4i32`.
>  It looks like a bug?


I think that's the same thing I mentioned in https://reviews.llvm.org/D46528. We're not transforming to x86-specific opcodes early, so legalization does the safe thing for the illegal types - scalarization. Nobody cares about the perf of SSE1-only at this point, so it's not worth fixing IMO. That also means it's not worth testing so thoroughly here (and I think even x86 fans' eyeballs would fall out before they get anywhere close to confirming those entire asm sequences are correct!).

So I'd second Simon's opinion that we just remove the SSE1 run, but if you insist...LGTM. :)


Repository:
  rL LLVM

https://reviews.llvm.org/D46008





More information about the llvm-commits mailing list