[Openmp-commits] [PATCH] D86038: [OpenMP][CUDA] Cache the maximal number of threads per block (per kernel)
Jon Chesterfield via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Sun Aug 16 09:27:08 PDT 2020
JonChesterfield accepted this revision.
JonChesterfield added a comment.
This revision is now accepted and ready to land.
LGTM
================
Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:892
+ Err = cuFuncGetAttribute(&KernelInfo->MaxThreadsPerBlock,
+ CU_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK,
+ KernelInfo->Func);
----------------
tab
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D86038/new/
https://reviews.llvm.org/D86038
More information about the Openmp-commits
mailing list