[Openmp-commits] [PATCH] D74145: [OpenMP][Offloading] Added support for multiple streams so that multiple kernels can be executed concurrently
Shilei Tian via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Sat Feb 8 14:20:51 PST 2020
tianshilei1992 added inline comments.
================
Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:246
+ // By default let's create 32 streams per device
+ EnvNumStreams = 32;
+ envStr = getenv("LIBOMPTARGET_NUM_STREAMS");
----------------
jdoerfert wrote:
> The hardware will cap the number internally anyway so we should go higher here. Maybe 256?
Sure
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D74145/new/
https://reviews.llvm.org/D74145
More information about the Openmp-commits
mailing list