[PATCH] D38949: AMDGPU: Merge S_BUFFER_LOAD_DWORD_IMM into x2, x4

Nicolai Hähnle via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Oct 23 08:32:01 PDT 2017


nhaehnle accepted this revision.
nhaehnle added a comment.
This revision is now accepted and ready to land.

One comment inline, apart from that LGTM. I imagine some of those additional spills could be improved with better register allocation, and that's a bullet someone has to bite eventually, but not now.



================
Comment at: lib/Target/AMDGPU/AMDGPUSubtarget.h:326-328
+    // Only use the "x1" variants on GFX9 or don't use the buffer variants.
+    // TODO: There is one future GFX9 chip that doesn't have this bug.
+    return getGeneration() != GFX9;
----------------
Could you add an explanation of what the bug is in the comment? (I think it may be the page-crossing bug that I vaguely remember but don't find the reference to right now, in which case we could still consider merging stores that are properly aligned in a future patch.)


https://reviews.llvm.org/D38949





More information about the llvm-commits mailing list