[PATCH] D137639: [SLP]Fix PR58863: Mask index beyond mask size for non-power-2 insertelement analysis.

Alexey Bataev via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Nov 8 07:06:47 PST 2022


ABataev updated this revision to Diff 473994.
ABataev added a comment.

Address comment


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D137639/new/

https://reviews.llvm.org/D137639

Files:
  llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
  llvm/test/Transforms/SLPVectorizer/slp-non-pow-2-insertelement.ll


Index: llvm/test/Transforms/SLPVectorizer/slp-non-pow-2-insertelement.ll
===================================================================
--- /dev/null
+++ llvm/test/Transforms/SLPVectorizer/slp-non-pow-2-insertelement.ll
@@ -0,0 +1,19 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
+; RUN: opt -S -passes=slp-vectorizer < %s | FileCheck %s
+
+define void @PR58863() {
+; CHECK-LABEL: @PR58863(
+; CHECK-NEXT:  entry:
+; CHECK-NEXT:    [[MUL_I:%.*]] = fmul float poison, poison
+; CHECK-NEXT:    [[MUL11_I:%.*]] = fmul float poison, poison
+; CHECK-NEXT:    [[I:%.*]] = insertelement <3 x float> <float poison, float 0.000000e+00, float poison>, float [[MUL_I]], i64 0
+; CHECK-NEXT:    [[I1:%.*]] = insertelement <3 x float> [[I]], float [[MUL11_I]], i64 2
+; CHECK-NEXT:    ret void
+;
+entry:
+  %mul.i = fmul float poison, poison
+  %mul11.i = fmul float poison, poison
+  %i = insertelement <3 x float> <float poison, float 0.000000e+00, float poison>, float %mul.i, i64 0
+  %i1 = insertelement <3 x float> %i, float %mul11.i, i64 2
+  ret void
+}
Index: llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
===================================================================
--- llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
+++ llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
@@ -6593,7 +6593,8 @@
           if (Mask[I] != UndefMaskElem)
             Mask[I] = I + VecSz;
         for (unsigned I = OffsetEnd + 1 - Offset; I < VecSz; ++I)
-          Mask[I] = InMask.test(I) ? UndefMaskElem : I;
+          Mask[I] =
+              ((I >= InMask.size()) || InMask.test(I)) ? UndefMaskElem : I;
         Cost += TTI->getShuffleCost(TTI::SK_PermuteTwoSrc, InsertVecTy, Mask);
       }
     }


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D137639.473994.patch
Type: text/x-patch
Size: 1730 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20221108/82a8eea7/attachment.bin>


More information about the llvm-commits mailing list