[PATCH] D40091: [AMDGPU] Add options for waitcnt pass debugging; add instr count in debug output.
Mark Searles via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Nov 15 10:40:45 PST 2017
msearles created this revision.
Herald added subscribers: JDevlieghere, t-tye, tpr, dstuttard, yaxunl, nhaehnle, wdng, kzhuravl.
-amdgpu-waitcnt-forcezero={1|0} Force all waitcnt instrs to be emitted as s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
-amdgpu-waitcnt-forceexp=<n> Force emit a s_waitcnt expcnt(0) before the first <n> instrs
-amdgpu-waitcnt-forcelgkm=<n> Force emit a s_waitcnt lgkmcnt(0) before the first <n> instrs
-amdgpu-waitcnt-forcevm=<n> Force emit a s_waitcnt vmcnt(0) before the first <n> instrs
https://reviews.llvm.org/D40091
Files:
lib/Target/AMDGPU/SIInsertWaitcnts.cpp
test/CodeGen/AMDGPU/waitcnt-debug.mir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D40091.123055.patch
Type: text/x-patch
Size: 6283 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20171115/2c4d38da/attachment.bin>
More information about the llvm-commits
mailing list