[PATCH] D143762: [AMDGPU] Enable whole wave register copy

Christudasan Devadasan via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jul 4 09:58:06 PDT 2023


cdevadas added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp:1356
+
+  addPass(&SILowerWWMCopiesID);
   return true;
----------------
I'm still not convinced why this is needed in the -O0 flow?
By now, the VGPR allocation is done in the -O0 flow, and we no longer have any virtual registers. This pass act on virtual registers to see if wwm copies needed exec manipulation.


================
Comment at: llvm/lib/Target/AMDGPU/SILowerWWMCopies.cpp:90
+void SILowerWWMCopies::addToWWMSpills(MachineFunction &MF, Register Reg) {
+  if (Reg.isPhysical())
+    return;
----------------
If this pass is enabled for for -O0 as well, put back the !VRM check to ensure an early return from here for -O0 flow.
And may be a comment too?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D143762/new/

https://reviews.llvm.org/D143762



More information about the llvm-commits mailing list