[Openmp-commits] [PATCH] D45326: [OpenMP] [CUDA plugin] Add support for teams reduction via scratchpad
George Rokos via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Thu Apr 5 08:21:13 PDT 2018
grokos created this revision.
grokos added reviewers: Hahnfeld, ABataev.
grokos added a project: OpenMP.
Herald added a subscriber: guansong.
This patch adds support for teams reduction into the CUDA plugin. The number of variables to be reduced as well as their size are passed from the compiler to the plugin via a struct of kernel computation properties (which also includes the execution mode). Before a kernel is launched, the plugin allocates space for the scratchpad to be used for the reduction. A pointer to the allocated scratchpad is passed as the last parameter to the kernel at launch.
Repository:
rOMP OpenMP
https://reviews.llvm.org/D45326
Files:
libomptarget/plugins/cuda/src/rtl.cpp
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D45326.141159.patch
Type: text/x-patch
Size: 6391 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/openmp-commits/attachments/20180405/3ec0b1a8/attachment.bin>
More information about the Openmp-commits
mailing list