[Openmp-commits] [PATCH] D70414: [libomptarget] Build a minimal deviceRTL for amdgcn
Jon Chesterfield via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Thu Nov 21 11:09:31 PST 2019
JonChesterfield marked 2 inline comments as done.
JonChesterfield added inline comments.
================
Comment at: openmp/libomptarget/deviceRTLs/amdgcn/CMakeLists.txt:136
+ )
+endforeach()
----------------
jdoerfert wrote:
> Nit: I'm not a cmake person but I am surprised to see "cuda" above so many times, e.g. in file names?
The source is (currently still) cuda so it's not too bad that variables refer to cuda. I didn't rename anything from Greg's original script. Happy to rename the variables before or after commit.
I'd like to put the remaining cuda intrinsics behind an API and rename the files to .cpp, but the renaming caused problems with nvcc that I am not yet sure how to solve.
================
Comment at: openmp/libomptarget/deviceRTLs/amdgcn/src/target_impl.h:114
+EXTERN int32_t __kmpc_impl_shfl_down_sync(__kmpc_impl_lanemask_t, int32_t Var,
+ uint32_t Delta, int32_t Width);
----------------
jdoerfert wrote:
> Where did these implementations go?
The previous implementation called through to functions in a support library with fairly ugly prototypes.
I'm planning to implement these in target_impl.hip, which doesn't exist yet. Until then they're just another couple of missing symbols.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D70414/new/
https://reviews.llvm.org/D70414
More information about the Openmp-commits
mailing list