[PATCH] D38949: AMDGPU: Merge S_BUFFER_LOAD_DWORD_IMM into x2, x4

Marek Olšák via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Oct 31 12:05:58 PDT 2017


mareko added inline comments.


================
Comment at: lib/Target/AMDGPU/AMDGPUSubtarget.h:326-328
+    // Only use the "x1" variants on GFX9 or don't use the buffer variants.
+    // TODO: There is one future GFX9 chip that doesn't have this bug.
+    return getGeneration() != GFX9;
----------------
nhaehnle wrote:
> Could you add an explanation of what the bug is in the comment? (I think it may be the page-crossing bug that I vaguely remember but don't find the reference to right now, in which case we could still consider merging stores that are properly aligned in a future patch.)
The bug is very well documented internally at the usual place.


https://reviews.llvm.org/D38949





More information about the llvm-commits mailing list