[llvm] [AMDGPU] Add wave reduce intrinsics for float types - 2 (PR #161815)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Thu Nov 20 11:26:21 PST 2025
================
@@ -5480,11 +5480,15 @@ static uint32_t getIdentityValueFor32BitWaveReduction(unsigned Opc) {
return std::numeric_limits<uint32_t>::min();
case AMDGPU::S_MAX_I32:
return std::numeric_limits<int32_t>::min();
+ case AMDGPU::V_SUB_F32_e64: // +0.0
+ return __builtin_bit_cast(uint32_t, +0.0f);
----------------
arsenm wrote:
Should not be using a raw builtin like this. It's easier to just write the raw bit value
https://github.com/llvm/llvm-project/pull/161815
More information about the llvm-commits
mailing list