[PATCH] D154822: Support '-fgpu-default-stream=per-thread' for NVIDIA CUDA

Artem Belevich via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Mon Jul 10 09:32:15 PDT 2023


tra added a reviewer: tra.
tra added a comment.

Looking at CUDA headers, it appears that changing only compiler-generated-glue may be insufficient. A lot of other CUDA API calls need to be changed to `_ptsz` variant and for that we need to have `CUDA_API_PER_THREAD_DEFAULT_STREAM` defined.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D154822/new/

https://reviews.llvm.org/D154822



More information about the cfe-commits mailing list