[all-commits] [llvm/llvm-project] 53aad3: [AMDGPU] Introduce "amdgpu-uniform-intrinsic-combi...
Pankaj Dwivedi via All-commits
all-commits at lists.llvm.org
Thu Oct 9 00:15:18 PDT 2025
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 53aad35208d00c8382b62b1d23005938aea77469
https://github.com/llvm/llvm-project/commit/53aad35208d00c8382b62b1d23005938aea77469
Author: Pankaj Dwivedi <pankajkumar.divedi at amd.com>
Date: 2025-10-09 (Thu, 09 Oct 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPU.h
M llvm/lib/Target/AMDGPU/AMDGPUPassRegistry.def
M llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
A llvm/lib/Target/AMDGPU/AMDGPUUniformIntrinsicCombine.cpp
M llvm/lib/Target/AMDGPU/CMakeLists.txt
A llvm/test/CodeGen/AMDGPU/amdgpu-simplify-uniform-waterfall.ll
A llvm/test/CodeGen/AMDGPU/amdgpu-uniform-intrinsic-combine.ll
A llvm/test/CodeGen/AMDGPU/amdgpu-uniform-temporal-divergence.ll
Log Message:
-----------
[AMDGPU] Introduce "amdgpu-uniform-intrinsic-combine" pass to combine uniform AMDGPU lane Intrinsics. (#116953)
This pass introduces optimizations for AMDGPU intrinsics by leveraging
the uniformity of their arguments. When an intrinsic's arguments are
detected as uniform, redundant computations are eliminated, and the
intrinsic calls are simplified accordingly.
By utilizing the UniformityInfo analysis, this pass identifies cases
where intrinsic calls are uniform across all lanes, allowing
transformations that reduce unnecessary operations and improve the IR's
efficiency.
These changes enhance performance by streamlining intrinsic usage in
uniform scenarios without altering the program's semantics.
For background, see PR #99878
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list