[all-commits] [llvm/llvm-project] bfcfd5: [AMDGPU] Add GFX11 llvm.amdgcn.permlane64 intrinsic

Mon Jun 13 13:12:23 PDT 2022

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: bfcfd53b9244874b9807409a01407fd9e1d5d3e3
      https://github.com/llvm/llvm-project/commit/bfcfd53b9244874b9807409a01407fd9e1d5d3e3
  Author: Jay Foad <jay.foad at amd.com>
  Date:   2022-06-13 (Mon, 13 Jun 2022)

  Changed paths:
    M llvm/include/llvm/IR/IntrinsicsAMDGPU.td
    M llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp
    M llvm/lib/Target/AMDGPU/AMDGPURegisterBankInfo.cpp
    M llvm/lib/Target/AMDGPU/VOP1Instructions.td
    M llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
    A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.permlane64.ll
    A llvm/test/Transforms/InstCombine/AMDGPU/permlane64.ll

  Log Message:
  -----------
  [AMDGPU] Add GFX11 llvm.amdgcn.permlane64 intrinsic

Compared to permlane16, permlane64 has no BC input because it has no
boundary conditions, no fi input because the instruction acts as if FI
were always enabled, and no OLD input because it always writes to every
active lane.

Also use the new intrinsic in the atomic optimizer pass.

Differential Revision: https://reviews.llvm.org/D127662