[llvm] r364342 - Update phis in AMDGPUUnifyDivergentExitNodes
Diego Novillo via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 25 11:55:16 PDT 2019
Author: dnovillo
Date: Tue Jun 25 11:55:16 2019
New Revision: 364342
URL: http://llvm.org/viewvc/llvm-project?rev=364342&view=rev
Log:
Update phis in AMDGPUUnifyDivergentExitNodes
Original patch https://reviews.llvm.org/D63659 from
Steven Perron <stevenperron at google.com>
The pass AMDGPUUnifyDivergentExitNodes does not update the phi nodes in
the successors of blocks that is splits. This is fixed by calling
BasicBlock::splitBasicBlock to split the block instead of doing it
manually. This does extra work because a new conditional branch is
created in BB which is immediately replaced, but I think the simplicity
is worth it. It also helps make the code more future proof in case other
things need to be updated.
Added:
llvm/trunk/test/CodeGen/AMDGPU/update-phi.ll
Modified:
llvm/trunk/lib/Target/AMDGPU/AMDGPUUnifyDivergentExitNodes.cpp
Modified: llvm/trunk/lib/Target/AMDGPU/AMDGPUUnifyDivergentExitNodes.cpp
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/AMDGPU/AMDGPUUnifyDivergentExitNodes.cpp?rev=364342&r1=364341&r2=364342&view=diff
==============================================================================
--- llvm/trunk/lib/Target/AMDGPU/AMDGPUUnifyDivergentExitNodes.cpp (original)
+++ llvm/trunk/lib/Target/AMDGPU/AMDGPUUnifyDivergentExitNodes.cpp Tue Jun 25 11:55:16 2019
@@ -198,14 +198,11 @@ bool AMDGPUUnifyDivergentExitNodes::runO
BranchInst::Create(LoopHeaderBB, DummyReturnBB, BoolTrue, BB);
} else { // Conditional branch.
// Create a new transition block to hold the conditional branch.
- BasicBlock *TransitionBB = BasicBlock::Create(F.getContext(),
- "TransitionBlock", &F);
+ BasicBlock *TransitionBB = BB->splitBasicBlock(BI, "TransitionBlock");
- // Move BI from BB to the new transition block.
- BI->removeFromParent();
- TransitionBB->getInstList().push_back(BI);
-
- // Create a branch that will always branch to the transition block.
+ // Create a branch that will always branch to the transition block and
+ // references DummyReturnBB.
+ BB->getTerminator()->eraseFromParent();
BranchInst::Create(TransitionBB, DummyReturnBB, BoolTrue, BB);
}
}
Added: llvm/trunk/test/CodeGen/AMDGPU/update-phi.ll
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/AMDGPU/update-phi.ll?rev=364342&view=auto
==============================================================================
--- llvm/trunk/test/CodeGen/AMDGPU/update-phi.ll (added)
+++ llvm/trunk/test/CodeGen/AMDGPU/update-phi.ll Tue Jun 25 11:55:16 2019
@@ -0,0 +1,39 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
+; RUN: opt -mtriple=amdgcn-- -S -amdgpu-unify-divergent-exit-nodes -verify %s | FileCheck -check-prefix=IR %s
+
+; Make sure that the phi in n28 is updated when the block is split by unify
+; divergent exit nodes.
+
+define amdgpu_ps void @_amdgpu_ps_main() local_unnamed_addr #3 {
+; IR-LABEL: @_amdgpu_ps_main(
+; IR-NEXT: .entry:
+; IR-NEXT: br label [[DOTLOOPEXIT:%.*]]
+; IR: .loopexit:
+; IR-NEXT: br label [[N28:%.*]]
+; IR: n28:
+; IR-NEXT: [[DOT01:%.*]] = phi float [ 0.000000e+00, [[DOTLOOPEXIT]] ], [ [[N29:%.*]], [[TRANSITIONBLOCK:%.*]] ]
+; IR-NEXT: [[N29]] = fadd float [[DOT01]], 1.000000e+00
+; IR-NEXT: [[N30:%.*]] = fcmp ogt float [[N29]], 4.000000e+00
+; IR-NEXT: br i1 true, label [[TRANSITIONBLOCK]], label [[DUMMYRETURNBLOCK:%.*]]
+; IR: TransitionBlock:
+; IR-NEXT: br i1 [[N30]], label [[DOTLOOPEXIT]], label [[N28]]
+; IR: n31:
+; IR-NEXT: ret void
+; IR: DummyReturnBlock:
+; IR-NEXT: ret void
+;
+.entry:
+ br label %.loopexit
+
+.loopexit: ; preds = %n28, %.entry
+ br label %n28
+
+n28: ; preds = %.loopexit, %n28
+ %.01 = phi float [ 0.000000e+00, %.loopexit ], [ %n29, %n28 ]
+ %n29 = fadd float %.01, 1.0
+ %n30 = fcmp ogt float %n29, 4.000000e+00
+ br i1 %n30, label %.loopexit, label %n28
+
+n31: ; preds =
+ ret void
+}
More information about the llvm-commits
mailing list