[all-commits] [llvm/llvm-project] bc27a3: [AMDGPU] Fix copyPhysReg to not produce unalined v...
Stanislav Mekhanoshin via All-commits
all-commits at lists.llvm.org
Mon Mar 15 14:14:52 PDT 2021
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: bc27a31801a3ec79e860aba9ac5df77a4d73e330
https://github.com/llvm/llvm-project/commit/bc27a31801a3ec79e860aba9ac5df77a4d73e330
Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
Date: 2021-03-15 (Mon, 15 Mar 2021)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
M llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
M llvm/lib/Target/AMDGPU/SIRegisterInfo.h
M llvm/test/CodeGen/AMDGPU/copy_phys_vgpr64.mir
Log Message:
-----------
[AMDGPU] Fix copyPhysReg to not produce unalined vgpr access
RA can insert something like a sub1_sub2 COPY of a wide VGPR
tuple which results in the unaligned acces with v_pk_mov_b32
after the copy is expanded. This is regression after D97316.
Differential Revision: https://reviews.llvm.org/D98549
More information about the All-commits
mailing list