[llvm] [X86] SimplifyDemandedVectorEltsForTargetNode - reduce the size of VPERMV/VPERMV3 nodes if the upper elements are not demanded (PR #133923)

Phoebe Wang via llvm-commits llvm-commits at lists.llvm.org
Wed Apr 2 02:28:02 PDT 2025


================
@@ -43814,6 +43815,66 @@ bool X86TargetLowering::SimplifyDemandedVectorEltsForTargetNode(
       }
       break;
     }
+    case X86ISD::VPERMV: {
+      SmallVector<int, 16> Mask;
+      SmallVector<SDValue, 2> Ops;
+      // TODO: Handle 128-bit PERMD/Q -> PSHUFD
+      if (Subtarget.hasVLX() &&
+          (VT.is512BitVector() || VT.getScalarSizeInBits() <= 16) &&
+          getTargetShuffleMask(Op, /*AllowSentinelZero=*/false, Ops, Mask)) {
+        // For lane-crossing shuffles, only split in half in case we're still
+        // referencing higher elements.
+        unsigned HalfElts = NumElts / 2;
+        unsigned HalfSize = SizeInBits / 2;
+        Mask.resize(HalfElts);
+        if (all_of(Mask,
+                   [&](int M) { return isUndefOrInRange(M, 0, HalfElts); })) {
----------------
phoebewang wrote:

Where did we check the upper elements are not demanded?

https://github.com/llvm/llvm-project/pull/133923


More information about the llvm-commits mailing list