[clang] [libc] [Clang] Fix cross-lane scan when given divergent lanes (PR #127703)
Shilei Tian via cfe-commits
cfe-commits at lists.llvm.org
Wed Feb 19 14:32:51 PST 2025
================
@@ -188,6 +186,32 @@ __DO_LANE_SCAN(float, uint32_t, f32); // float __gpu_lane_scan_f32(m, x)
__DO_LANE_SCAN(double, uint64_t, f64); // double __gpu_lane_scan_f64(m, x)
#undef __DO_LANE_SCAN
+// Gets the sum of all lanes inside the warp or wavefront.
----------------
shiltian wrote:
nit: why do you need to sink this macro?
https://github.com/llvm/llvm-project/pull/127703
More information about the cfe-commits
mailing list