[llvm] [AMDGPU] Extend wave reduce intrinsics for i32 type (PR #126469)

Mon Jun 2 01:29:19 PDT 2025

================
@@ -2327,8 +2327,14 @@ class AMDGPUWaveReduce<LLVMType data_ty = llvm_anyint_ty> : Intrinsic<
     ],
     [IntrNoMem, IntrConvergent, IntrWillReturn, IntrNoCallback, IntrNoFree, ImmArg<ArgIndex<1>>]>;
 
-def int_amdgcn_wave_reduce_umin : AMDGPUWaveReduce;
-def int_amdgcn_wave_reduce_umax : AMDGPUWaveReduce;
+multiclass AMDGPUWaveReduceOps<list<string> Operations> {
+  foreach Op = Operations in { def Op : AMDGPUWaveReduce; }
+}
+
+defvar Operations = [
+  "umin", "min", "umax", "max", "uadd", "add", "usub", "sub", "and", "or", "xor"
----------------
jmmartinez wrote:

Why do we have to distinguish between sub/usub and add/uadd. The generated code seems to be the same.

Am I missing something?

https://github.com/llvm/llvm-project/pull/126469