[PATCH] D14254: [OpenMP] Initial implementation of OpenMP offloading library - libomptarget device RTLs.

Guansong Zhang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jan 12 09:37:39 PST 2018


guansong added inline comments.


================
Comment at: libomptarget/deviceRTLs/nvptx/CMakeLists.txt:158
+      set(CUDA_ARCH "")
+      set(CUDA_ARCH --cuda-gpu-arch=sm_${LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITY})
+
----------------
For cuda bc files, a CUDA install will have bc files for different arches, such as 

/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_35.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_30.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_50.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_20.10.bc

Should we consider to build different bc files for the end user? 



Repository:
  rL LLVM

https://reviews.llvm.org/D14254





More information about the llvm-commits mailing list