[PATCH] D100124: [Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX redux.sync instructions
Jon Chesterfield via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 12 15:03:20 PDT 2021
JonChesterfield added a comment.
Interesting. Reduction across lanes in warp? If so, this is probably a way to handle the last step reduction for openmp reductions
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D100124/new/
https://reviews.llvm.org/D100124
More information about the llvm-commits
mailing list