[PATCH] D143762: [AMDGPU] Enable whole wave register copy

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jun 23 06:14:50 PDT 2023


arsenm added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/SILowerPredicatedCopies.cpp:159
+        LLVM_DEBUG(dbgs() << "Updated " << MI << " to use COPY opcode");
+        MI.setDesc(TII->get(AMDGPU::COPY));
+        Changed |= true;
----------------
arsenm wrote:
> arsenm wrote:
> > yassingh wrote:
> > > arsenm wrote:
> > > > Do these need to gain an implicit exec use?
> > > Do you mean we should add the implicit exec? In that case, SIFixVGPRCopies will take care?
> > This isn't something that can be taken care of later. SIFixVGPRCopies is a horribly broken hack, the less we depend on it the better
> Actually these split copies might have been the original problem which caused it to be added. Maybe we have a way to drop it now?
That was the reasoning given in D28874 when it was added. How about as a next step we make sure all the VGPR splits end up with exec reads


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D143762/new/

https://reviews.llvm.org/D143762



More information about the llvm-commits mailing list