[libc-commits] [PATCH] D159276: [libc][gpu] Thread divergence fix on volta
Joseph Huber via Phabricator via libc-commits
libc-commits at lists.llvm.org
Thu Aug 31 06:20:25 PDT 2023
jhuber6 accepted this revision.
jhuber6 added a comment.
This revision is now accepted and ready to land.
LG With nits.
================
Comment at: libc/src/__support/GPU/amdgpu/utils.h:128-130
+[[clang::convergent]] LIBC_INLINE uint32_t broadcast_value(uint64_t lane_mask,
+ uint32_t x) {
+ (void)lane_mask;
----------------
You can just leave out the argument, same below.
================
Comment at: libc/src/__support/GPU/nvptx/utils.h:127
[[clang::convergent]] LIBC_INLINE uint64_t ballot(uint64_t lane_mask, bool x) {
+ uint32_t mask = static_cast<uint32_t>(lane_mask);
#if __CUDA_ARCH__ >= 600
----------------
Unrelated?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D159276/new/
https://reviews.llvm.org/D159276
More information about the libc-commits
mailing list