[Openmp-commits] [PATCH] D74145: [OpenMP][Offloading] Added support for multiple streams so that multiple kernels can be executed concurrently

Ye Luo via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Fri Feb 7 01:33:50 PST 2020

ye-luo added a comment.

@jdoerfert I can try it on a test program. miniQMC is choked by the linker at the moment. Is the "map" thread-safe now?

Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:525
         // need for device copies.
         cuMemcpyHtoD(cuptr, e->addr, sizeof(void *));
         DP("Copy linked variable host address (" DPxMOD ")"
jdoerfert wrote:
> We need the async versions at the HtoD and at the DtoH sides to use the streams. After the async call we directly have to wait for the stream to make it synchronous but on as specific stream.
In this direction, the H2D, kernel and D2H  optimally can be scheduled as a whole entity in the tasking runtime and use the same stream if they are on the same OpenMP pragma line.



More information about the Openmp-commits mailing list