[PATCH] D154822: Support '-fgpu-default-stream=per-thread' for NVIDIA CUDA
boxu.zhang via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Mon Jul 10 19:37:32 PDT 2023
boxu-zhang added a comment.
In D154822#4485700 <https://reviews.llvm.org/D154822#4485700>, @tra wrote:
> Looking at CUDA headers, it appears that changing only compiler-generated-glue may be insufficient. A lot of other CUDA API calls need to be changed to `_ptsz` variant and for that we need to have `CUDA_API_PER_THREAD_DEFAULT_STREAM` defined.
CUDA_API_PER_THREAD_DEFAULT_STREAM is defined now.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D154822/new/
https://reviews.llvm.org/D154822
More information about the cfe-commits
mailing list