[PATCH] D31161: [AMDGPU] New Waitcnt Insertion Pass
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Mar 23 18:22:00 PDT 2017
rampitec added a comment.
Test with barrier surrounded with fences is needed. All relevant combinations of fences needs to be checked and pattern shall check that only needed counters are used, wait is produced, and that is only one wait. Please refer to OpenCL barrier() implementation for the code to check.
================
Comment at: lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1005
+ // TODO: Tie force zero to a compiler triage option.
+ bool ForceZero = false;
+
----------------
Not needed anymore.
https://reviews.llvm.org/D31161
More information about the llvm-commits
mailing list