[PATCH] D143762: [AMDGPU] Enable whole wave register copy
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 23 06:14:50 PDT 2023
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SILowerPredicatedCopies.cpp:159
+ LLVM_DEBUG(dbgs() << "Updated " << MI << " to use COPY opcode");
+ MI.setDesc(TII->get(AMDGPU::COPY));
+ Changed |= true;
----------------
arsenm wrote:
> arsenm wrote:
> > yassingh wrote:
> > > arsenm wrote:
> > > > Do these need to gain an implicit exec use?
> > > Do you mean we should add the implicit exec? In that case, SIFixVGPRCopies will take care?
> > This isn't something that can be taken care of later. SIFixVGPRCopies is a horribly broken hack, the less we depend on it the better
> Actually these split copies might have been the original problem which caused it to be added. Maybe we have a way to drop it now?
That was the reasoning given in D28874 when it was added. How about as a next step we make sure all the VGPR splits end up with exec reads
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D143762/new/
https://reviews.llvm.org/D143762
More information about the llvm-commits
mailing list