[PATCH] D14254: [OpenMP] Initial implementation of OpenMP offloading library - libomptarget device RTLs.
Guansong Zhang via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 12 09:37:39 PST 2018
guansong added inline comments.
================
Comment at: libomptarget/deviceRTLs/nvptx/CMakeLists.txt:158
+ set(CUDA_ARCH "")
+ set(CUDA_ARCH --cuda-gpu-arch=sm_${LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITY})
+
----------------
For cuda bc files, a CUDA install will have bc files for different arches, such as
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_35.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_30.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_50.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_20.10.bc
Should we consider to build different bc files for the end user?
Repository:
rL LLVM
https://reviews.llvm.org/D14254
More information about the llvm-commits
mailing list