[PATCH] D55602: AMDGPU/InsertWaitcnts: Update VGPR/SGPR bounds when brackets are merged
Carl Ritson via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 12 07:46:52 PST 2018
critson created this revision.
critson added a reviewer: nhaehnle.
Herald added subscribers: llvm-commits, jfb, t-tye, tpr, dstuttard, yaxunl, wdng, jvesely, kzhuravl, arsenm.
Fix an issue where VGPR/SGPR bounds are not properly extended when brackets are merged.
This manifests as missing waitcnt insertions when multiple brackets are forwarded to a successor block and the first forward has lower VGPR/SGPR bounds.
Irreducible loop test has been extended based on a CTS failure detected for GFX9.
Repository:
rL LLVM
https://reviews.llvm.org/D55602
Files:
lib/Target/AMDGPU/SIInsertWaitcnts.cpp
test/CodeGen/AMDGPU/waitcnt-loop-irreducible.mir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D55602.177855.patch
Type: text/x-patch
Size: 3335 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20181212/e2acec78/attachment.bin>
More information about the llvm-commits
mailing list