[llvm] [AMDGPU] Extend wave reduce intrinsics for i32 type (PR #126469)
via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 11 07:23:09 PST 2025
================
@@ -4940,6 +4940,28 @@ static MachineBasicBlock *emitIndirectDst(MachineInstr &MI,
return LoopBB;
}
+static uint32_t getInitialValueForWaveReduction(unsigned Opc) {
+ switch (Opc) {
+ case AMDGPU::S_MIN_U32:
+ return std::numeric_limits<uint32_t>::max();
+ case AMDGPU::S_MIN_I32:
+ return std::numeric_limits<int32_t>::max();
+ case AMDGPU::S_MAX_U32:
+ return std::numeric_limits<uint32_t>::min();
+ case AMDGPU::S_MAX_I32:
+ return std::numeric_limits<int32_t>::min();
+ case AMDGPU::S_ADD_I32:
+ case AMDGPU::S_SUB_I32:
+ case AMDGPU::S_OR_B32:
+ case AMDGPU::S_XOR_B32:
+ return std::numeric_limits<uint32_t>::min();
----------------
easyonaadit wrote:
The identity element for XOR is 0.
The same value is also used in the atomic optimizer pass.
https://github.com/llvm/llvm-project/pull/126469
More information about the llvm-commits
mailing list