[PATCH] D71929: AMDGPU/GlobalISel: Refine SMRD selection rules

Thu Jan 2 12:59:30 PST 2020

arsenm marked an inline comment as done.
arsenm added inline comments.

================
Comment at: llvm/lib/Target/AMDGPU/AMDGPURegisterBankInfo.cpp:414
+  // There are no extending SMRD/SMEM loads, and they require 4-byte alignment.
   return MMO->getSize() >= 4 && MMO->getAlignment() >= 4 &&
+    // Can't do a scalar atomic load.
----------------
rampitec wrote:
> arsenm wrote:
> > rampitec wrote:
> > > '(MMO->getSize() % 4) == 0'? What if we need 6 bytes, like v3i16?
> > If it was legal, the alignment is high enough
> Not necessarily, a vload() may result in underaligned loads. Also we cannot really load a subdword even if aligned, we would need to zero/sign extend it manually.
The legalizer has already run by this point and would have broken down totally illegal loads. If it is sufficiently aligned, extra bits can be loaded. The high bits do not need to be considered here. The G_ZEXTLOAD/G_SEXTLOAD are not legal, so cases where the high bits matter will not reach here

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D71929/new/

https://reviews.llvm.org/D71929