[Openmp-commits] [PATCH] D74145: [OpenMP][Offloading] Added support for multiple streams so that multiple kernels can be executed concurrently
Johannes Doerfert via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Tue Feb 11 11:56:08 PST 2020
jdoerfert added a comment.
I'll commit this one and D74258 <https://reviews.llvm.org/D74258> later.
================
Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:182
+ // Get the next stream on a given device in a round robin manner
+ CUstream &getNextStream(const int DeviceId) {
+ assert(DeviceId < static_cast<int>(NextStreamId.size()) &&
----------------
tianshilei1992 wrote:
> JonChesterfield wrote:
> > It looks like DeviceID should be unsigned here
> Well, yes, it should be. But if you take a look at what they're used, for example at line 725, you can see the declaration is `int32_t device_id`.
we make it an unsigned here. I can do that before I commit as well.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D74145/new/
https://reviews.llvm.org/D74145
More information about the Openmp-commits
mailing list