[Openmp-commits] [PATCH] D65836: Factor architecture dependent code out of loop.cu
Alexey Bataev via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Wed Aug 7 07:42:16 PDT 2019
ABataev added inline comments.
================
Comment at: openmp/libomptarget/deviceRTLs/nvptx/src/target_impl.h:17-18
+
+#define FORCEINLINE __forceinline__ __device__
+
+FORCEINLINE void __kmpc_impl_unpack(int64_t val, int32_t *lo, int32_t *hi) {
----------------
JonChesterfield wrote:
> ABataev wrote:
> > Better to use original `INLINE` macro defined in the project rather than to define the new one.
> I'd prefer that too, but INLINE maps to `__inline__`, rather than `__forceinline__`, and that leaves calls to these functions in the bitcode library for nvptx.
Then better to fix original `INLINE` macro and replace `__inline__` with `__forceinline__`. I assume we'd like to inline all the functions.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D65836/new/
https://reviews.llvm.org/D65836
More information about the Openmp-commits
mailing list