[PATCH] D98549: [AMDGPU] Fix copyPhysReg to not produce unalined vgpr access
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 12 13:26:19 PST 2021
rampitec created this revision.
rampitec added a reviewer: arsenm.
Herald added subscribers: kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, jvesely, kzhuravl.
rampitec requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.
RA can insert something like a sub1_sub2 COPY of a wide VGPR
tuple which results in the unaligned acces with v_pk_mov_b32
after the copy is expanded. This is regression after D97316 <https://reviews.llvm.org/D97316>.
https://reviews.llvm.org/D98549
Files:
llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
llvm/test/CodeGen/AMDGPU/copy_phys_vgpr64.mir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D98549.330356.patch
Type: text/x-patch
Size: 12956 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210312/0fd182a0/attachment.bin>
More information about the llvm-commits
mailing list