[all-commits] [llvm/llvm-project] 53aad3: [AMDGPU] Introduce "amdgpu-uniform-intrinsic-combi...

Thu Oct 9 00:15:18 PDT 2025

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 53aad35208d00c8382b62b1d23005938aea77469
      https://github.com/llvm/llvm-project/commit/53aad35208d00c8382b62b1d23005938aea77469
  Author: Pankaj Dwivedi <pankajkumar.divedi at amd.com>
  Date:   2025-10-09 (Thu, 09 Oct 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPU.h
    M llvm/lib/Target/AMDGPU/AMDGPUPassRegistry.def
    M llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
    A llvm/lib/Target/AMDGPU/AMDGPUUniformIntrinsicCombine.cpp
    M llvm/lib/Target/AMDGPU/CMakeLists.txt
    A llvm/test/CodeGen/AMDGPU/amdgpu-simplify-uniform-waterfall.ll
    A llvm/test/CodeGen/AMDGPU/amdgpu-uniform-intrinsic-combine.ll
    A llvm/test/CodeGen/AMDGPU/amdgpu-uniform-temporal-divergence.ll

  Log Message:
  -----------
  [AMDGPU] Introduce "amdgpu-uniform-intrinsic-combine" pass to combine uniform AMDGPU lane Intrinsics. (#116953)

This pass introduces optimizations for AMDGPU intrinsics by leveraging
the uniformity of their arguments. When an intrinsic's arguments are
detected as uniform, redundant computations are eliminated, and the
intrinsic calls are simplified accordingly.

By utilizing the UniformityInfo analysis, this pass identifies cases
where intrinsic calls are uniform across all lanes, allowing
transformations that reduce unnecessary operations and improve the IR's
efficiency.

These changes enhance performance by streamlining intrinsic usage in
uniform scenarios without altering the program's semantics.

For background, see PR #99878

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications