[Openmp-commits] [clang] [llvm] [mlir] [openmp] [OpenMP][offload] Cross-team reductions with variable number of teams (PR #195102)

Johannes Doerfert via Openmp-commits openmp-commits at lists.llvm.org
Thu May 7 17:46:55 PDT 2026


================
@@ -61,6 +65,78 @@ static uint32_t gpu_irregular_simd_reduce(void *reduce_data,
   return (logical_lane_id == 0);
 }
 
+// Reduction within a block on the GPU.
+//
+// Template parameters:
+// - checkLiveness: Whether to check the liveness of the lanes. This is only
+//                  useful if gpu_block_reduce is called in a context where
+//                  L2 parallel regions are possible.
----------------
jdoerfert wrote:

L2 parallel regions are sequentialized, no? That should be the trivial case of num_threads == 1 handled in nvptx_parallel_reduce_nowait. Am I missing something?

https://github.com/llvm/llvm-project/pull/195102


More information about the Openmp-commits mailing list