[PATCH] D53160: AMDGPU: Avoid selecting ds_{read,write}2_b32 on SI

Nicolai Hähnle via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Oct 16 11:34:39 PDT 2018


nhaehnle added inline comments.


================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:6711
+    if (Subtarget->getGeneration() == AMDGPUSubtarget::SOUTHERN_ISLANDS &&
+        NumElements == 2 && VT.getStoreSize() == 8 &&
+        Store->getAlignment() < 8) {
----------------
arsenm wrote:
> NumElements == 2 is redundant and possibly wrong?
I don't know. We shouldn't have unaligned i64 loads at this point, I guess, but the check does ensure that we're really dealing with a vector load. And NumElements > 2 is dealt with above.


Repository:
  rL LLVM

https://reviews.llvm.org/D53160





More information about the llvm-commits mailing list