[PATCH] D35523: [AMDGPU] refactor WQM pass in preparation for WWM (NFCI)

Nicolai Hähnle via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jul 20 14:39:54 PDT 2017


nhaehnle accepted this revision.
nhaehnle added a comment.
This revision is now accepted and ready to land.

One minor comment. Either way, LGTM.



================
Comment at: lib/Target/AMDGPU/SIWholeQuadMode.cpp:228-236
+  // Ignore if the flag is already encompassed by the existing needs.
+  if ((II.Needs & Flag) == Flag)
     return;
 
-  II.Needs = Flag;
+  // Remove any disabled states from the flag. The user that required it gets
+  // an undefined value in the helper lanes. For example, this can happen if
+  // the result of an atomic is used by instruction that requires WQM, where
----------------
I think you can mask out the Flag first.


https://reviews.llvm.org/D35523





More information about the llvm-commits mailing list