[Openmp-commits] [PATCH] D45326: [OpenMP] [CUDA plugin] Add support for teams reduction via scratchpad

George Rokos via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Thu Apr 5 08:21:13 PDT 2018


grokos created this revision.
grokos added reviewers: Hahnfeld, ABataev.
grokos added a project: OpenMP.
Herald added a subscriber: guansong.

This patch adds support for teams reduction into the CUDA plugin. The number of variables to be reduced as well as their size are passed from the compiler to the plugin via a struct of kernel computation properties (which also includes the execution mode). Before a kernel is launched, the plugin allocates space for the scratchpad to be used for the reduction. A pointer to the allocated scratchpad is passed as the last parameter to the kernel at launch.


Repository:
  rOMP OpenMP

https://reviews.llvm.org/D45326

Files:
  libomptarget/plugins/cuda/src/rtl.cpp

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D45326.141159.patch
Type: text/x-patch
Size: 6391 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/openmp-commits/attachments/20180405/3ec0b1a8/attachment.bin>


More information about the Openmp-commits mailing list