[Openmp-commits] [PATCH] D86038: [OpenMP][CUDA] Cache the maximal number of threads per block (per kernel)

Jon Chesterfield via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Sun Aug 16 09:27:08 PDT 2020


JonChesterfield accepted this revision.
JonChesterfield added a comment.
This revision is now accepted and ready to land.

LGTM



================
Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:892
+      Err = cuFuncGetAttribute(&KernelInfo->MaxThreadsPerBlock,
+                               CU_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK,
+                               KernelInfo->Func);
----------------
tab


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D86038/new/

https://reviews.llvm.org/D86038



More information about the Openmp-commits mailing list