[PATCH] D46302: [LV] Fix for PR37248, Broadcast codegen incorrectly assumed vector loop body is single basic block
Hideki Saito via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue May 1 00:10:14 PDT 2018
hsaito created this revision.
hsaito added reviewers: rengolin, fhahn, hfinkel.
Herald added a subscriber: llvm-commits.
Broadcast code generation emitted instructions in pre-header, while the instruction they are dependent on in the vector loop body.
This resulted in an IL verification error ---- value used before defined.
Repository:
rL LLVM
https://reviews.llvm.org/D46302
Files:
lib/Transforms/Vectorize/LoopVectorize.cpp
test/Transforms/LoopVectorize/pr37248.ll
Index: lib/Transforms/Vectorize/LoopVectorize.cpp
===================================================================
--- lib/Transforms/Vectorize/LoopVectorize.cpp
+++ lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -1739,7 +1739,8 @@
Value *InnerLoopVectorizer::getBroadcastInstrs(Value *V) {
// We need to place the broadcast of invariant variables outside the loop.
Instruction *Instr = dyn_cast<Instruction>(V);
- bool NewInstr = (Instr && Instr->getParent() == LoopVectorBody);
+ bool NewInstr = Instr &&
+ !DT->dominates(Instr->getParent(), LoopVectorPreHeader);
bool Invariant = OrigLoop->isLoopInvariant(V) && !NewInstr;
// Place the code for broadcasting invariant variables in the new preheader.
Index: test/Transforms/LoopVectorize/pr37248.ll
===================================================================
--- test/Transforms/LoopVectorize/pr37248.ll
+++ test/Transforms/LoopVectorize/pr37248.ll
@@ -0,0 +1,37 @@
+; RUN: opt -passes='loop-vectorize' -force-vector-width=2 -S < %s
+;
+; Forcing VF=2 to trigger vector code gen
+;
+; This is a test case that let's vectorizer's code gen to generate
+; more than one BasicBlocks in the loop body (emulated masked scatter)
+; for those targets that do not support masked scatter. Broadcast
+; code generation was previously dependent on loop body being
+; a single basic block and this test case exposed incorrect code gen
+; resulting in an assert in IL verification. Test passes if IL verification
+; does not fail.
+
+ at a = external global [2 x i16], align 1
+
+define void @f1() {
+entry:
+ br label %for.body
+
+for.body: ; preds = %land.end, %entry
+ %0 = phi i32 [ undef, %entry ], [ %dec, %land.end ]
+ br i1 undef, label %land.end, label %land.rhs
+
+land.rhs: ; preds = %for.body
+ %1 = load i32, i32* undef, align 1
+ br label %land.end
+
+land.end: ; preds = %land.rhs, %for.body
+ %2 = trunc i32 %0 to i16
+ %arrayidx = getelementptr inbounds [2 x i16], [2 x i16]* @a, i16 0, i16 %2
+ store i16 undef, i16* %arrayidx, align 1
+ %dec = add nsw i32 %0, -1
+ %cmp = icmp sgt i32 %0, 1
+ br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge
+
+for.cond.for.end_crit_edge: ; preds = %land.end
+ unreachable
+}
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D46302.144674.patch
Type: text/x-patch
Size: 2376 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180501/601b539e/attachment.bin>
More information about the llvm-commits
mailing list