[PATCH] D22092: AMDGPU: Reduce the duration of whole-quad-mode

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Thu Sep 1 10:46:22 PDT 2016


arsenm accepted this revision.
arsenm added a comment.
This revision is now accepted and ready to land.

LGTM


================
Comment at: lib/Target/AMDGPU/SIWholeQuadMode.cpp:232-235
@@ +231,6 @@
+
+        // Since we're in machine SSA, we do not need to track physical
+        // registers across basic blocks.
+        if (Value->isPHIDef())
+          continue;
+
----------------
I thought I had an example similar to this a long time ago. Is duplicating the s_cmp an optimization that started working relatively recently? I guess you can ignore this for now, maybe you can do something with inline asm but it seems to be pretty broken with scc right now

================
Comment at: lib/Target/AMDGPU/SIWholeQuadMode.cpp:475
@@ -424,1 +474,3 @@
 
+    DEBUG(dbgs() << "  " << MI);
+
----------------
This looks like a contextless instruction printing which probably isn't helpful


https://reviews.llvm.org/D22092





More information about the llvm-commits mailing list