[PATCH] D14254: [OpenMP] Initial implementation of OpenMP offloading library - libomptarget device RTLs.
    Guansong Zhang via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Fri Jan 12 09:37:39 PST 2018
    
    
  
guansong added inline comments.
================
Comment at: libomptarget/deviceRTLs/nvptx/CMakeLists.txt:158
+      set(CUDA_ARCH "")
+      set(CUDA_ARCH --cuda-gpu-arch=sm_${LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITY})
+
----------------
For cuda bc files, a CUDA install will have bc files for different arches, such as 
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_35.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_30.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_50.10.bc
/usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_20.10.bc
Should we consider to build different bc files for the end user? 
Repository:
  rL LLVM
https://reviews.llvm.org/D14254
    
    
More information about the llvm-commits
mailing list