[PATCH] D28782: [AMDGPU] Do not allow register coalescer to create big superregs

Mon Jan 16 20:36:51 PST 2017

arsenm added inline comments.

================
Comment at: lib/Target/AMDGPU/SIRegisterInfo.cpp:1484-1486
+  unsigned SrcSize = SrcRC->getSize();
+  unsigned DstSize = DstRC->getSize();
+  unsigned NewSize = NewRC->getSize();
----------------
rampitec wrote:
> arsenm wrote:
> > This isn't being used for the spill size, so this is supposed to use getRegBitWidth
> What do you mean?
> 
> 
> ```
> /// getSize - Return the size of the register in bytes, which is also the size
> /// of a stack slot allocated to hold a spilled copy of this register.
> ```
Since https://reviews.llvm.org/D24631 the TargetRegisterClass is supposed to be considered only the spill size, which may be different from the register bit width

================
Comment at: lib/Target/AMDGPU/SIRegisterInfo.cpp:1491-1493
+  // Always allow dword and sub-dword coalescing.
+  if (SrcSize <= 4 || DstSize <= 4)
+    return true;
----------------
rampitec wrote:
> arsenm wrote:
> > We don't have sub-dword registers, so the < and comment are misleading
> This is for packed f16, we do not want to revisit this.
Even with packed f16 we don't have smaller sub registers than 4

Repository:
  rL LLVM

https://reviews.llvm.org/D28782