[PATCH] D115807: AMDGPU: Use v_accvgpr_mov_b32 when copying AGPR tuples on gfx90a
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 15 09:01:40 PST 2021
arsenm created this revision.
arsenm added a reviewer: rampitec.
Herald added subscribers: foad, kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, jvesely, kzhuravl.
arsenm requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.
This is an optimization, but also fixes a compile failure when no free
VGPRs are available. The problem still exists for gfx908 where a
scratch register is still required. This also still exists for the
SGPR to AGPR case.
https://reviews.llvm.org/D115807
Files:
llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
llvm/test/CodeGen/AMDGPU/accvgpr-copy.mir
llvm/test/CodeGen/AMDGPU/agpr-copy-no-vgprs.mir
llvm/test/CodeGen/AMDGPU/agpr-copy-sgpr-no-vgprs.mir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D115807.394587.patch
Type: text/x-patch
Size: 42445 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20211215/6062c2d1/attachment.bin>
More information about the llvm-commits
mailing list