[PATCH] D18307: [X86][SSE] Add MULHS/MULHU custom lowering for i8 vectors

Thu Mar 24 14:08:02 PDT 2016

qcolombet accepted this revision.
qcolombet added a comment.
This revision is now accepted and ready to land.

Hi Simon,

LGTM. Couple of nits inlined.

Cheers,
-Quentin

================
Comment at: lib/Target/X86/X86ISelLowering.cpp:19094
@@ +19093,3 @@
+      // Before using ymm PACKUS we need to permute inputs to lower/upper xmm.
+      const int LoMask[] = {0,  1,  2,  3,  4,  5,  6,  7,
+                            16, 17, 18, 19, 20, 21, 22, 23};
----------------
Add that unlike the smaller PACKUS, the ymm variant interleaves the 128 bits of the both sources.
Without that in mind, the shuffle does not make sense, whereas it is definitely required :).

================
Comment at: lib/Target/X86/X86ISelLowering.cpp:19117
@@ +19116,3 @@
+
+  // Extract the lo parts and zero/sign extend to i16
+  SDValue ALo, BLo;
----------------
Period.

================
Comment at: lib/Target/X86/X86ISelLowering.cpp:19133
@@ +19132,3 @@
+
+  // Extract the hi parts and zero/sign extend to i16
+  SDValue AHi, BHi;
----------------
Period.

Repository:
  rL LLVM

http://reviews.llvm.org/D18307