[Openmp-commits] [PATCH] D93135: [libomptarget][devicertl] Port amdgcn devicertl to openmp
Jon Chesterfield via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Fri Dec 11 11:31:42 PST 2020
JonChesterfield added inline comments.
================
Comment at: openmp/libomptarget/deviceRTLs/amdgcn/CMakeLists.txt:101
macro(add_cuda_bc_library)
set(cu_cmd ${AOMP_BINDIR}/clang++
-std=c++14
----------------
Trunk clang doesn't accept these flags just yet, but trunk clang also refuses to compile this code as -x hip so this isn't much of a regression.
================
Comment at: openmp/libomptarget/deviceRTLs/common/src/omp_data.cu:36
-DEVICE SHARED uint8_t parallelLevel[MAX_THREADS_PER_TEAM / WARPSIZE];
-DEVICE SHARED uint16_t threadLimit;
-DEVICE SHARED uint16_t threadsInTeam;
-DEVICE SHARED uint16_t nThreads;
+#ifdef _OPENMP
+DEVICE [[clang::loader_uninitialized]] uint8_t
----------------
This is the only array variable, cleaner to write the pragma allocate here than to introduce another macro
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D93135/new/
https://reviews.llvm.org/D93135
More information about the Openmp-commits
mailing list