[PATCH] D44267: Remove SRAs from v16i8 multiply lowering on sse2 targets

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Mar 8 16:59:31 PST 2018


RKSimon accepted this revision.
RKSimon added a comment.
This revision is now accepted and ready to land.

LGTM



================
Comment at: lib/Target/X86/X86ISelLowering.cpp:22322
     } else {
-      const int ShufMask[] = {-1, 0, -1, 1, -1, 2, -1, 3,
-                              -1, 4, -1, 5, -1, 6, -1, 7};
+      // We're going mask off the low byte of each result element of the mullw,
+      // so it doesn't matter what's in the high byte of each 16-bit element.
----------------
going to mask


================
Comment at: lib/Target/X86/X86ISelLowering.cpp:22342
     } else {
-      const int ShufMask[] = {-1, 8,  -1, 9,  -1, 10, -1, 11,
-                              -1, 12, -1, 13, -1, 14, -1, 15};
+      // We're going mask off the low byte of each result element of the mullw,
+      // so it doesn't matter what's in the high byte of each 16-bit element.
----------------
going to mask


https://reviews.llvm.org/D44267





More information about the llvm-commits mailing list