[all-commits] [llvm/llvm-project] bc27a3: [AMDGPU] Fix copyPhysReg to not produce unalined v...

Stanislav Mekhanoshin via All-commits all-commits at lists.llvm.org
Mon Mar 15 14:14:52 PDT 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: bc27a31801a3ec79e860aba9ac5df77a4d73e330
      https://github.com/llvm/llvm-project/commit/bc27a31801a3ec79e860aba9ac5df77a4d73e330
  Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin at amd.com>
  Date:   2021-03-15 (Mon, 15 Mar 2021)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIInstrInfo.cpp
    M llvm/lib/Target/AMDGPU/SIRegisterInfo.cpp
    M llvm/lib/Target/AMDGPU/SIRegisterInfo.h
    M llvm/test/CodeGen/AMDGPU/copy_phys_vgpr64.mir

  Log Message:
  -----------
  [AMDGPU] Fix copyPhysReg to not produce unalined vgpr access

RA can insert something like a sub1_sub2 COPY of a wide VGPR
tuple which results in the unaligned acces with v_pk_mov_b32
after the copy is expanded. This is regression after D97316.

Differential Revision: https://reviews.llvm.org/D98549




More information about the All-commits mailing list