[Openmp-commits] [PATCH] D14254: [OpenMP] Initial implementation of OpenMP offloading library - libomptarget device RTLs.
Gheorghe-Teodor Bercea via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Fri Jan 12 09:49:51 PST 2018
gtbercea added inline comments.
================
Comment at: libomptarget/deviceRTLs/nvptx/CMakeLists.txt:158
+ set(CUDA_ARCH "")
+ set(CUDA_ARCH --cuda-gpu-arch=sm_${LIBOMPTARGET_NVPTX_COMPUTE_CAPABILITY})
+
----------------
guansong wrote:
> For cuda bc files, a CUDA install will have bc files for different arches, such as
>
> /usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_35.10.bc
> /usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_30.10.bc
> /usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_50.10.bc
> /usr/local/cuda-8.0/nvvm/libdevice/libdevice.compute_20.10.bc
>
> Should we consider to build different bc files for the end user?
>
What do you mean by that?
Does this patch do what you mean: https://reviews.llvm.org/D41724 ?
Repository:
rL LLVM
https://reviews.llvm.org/D14254
More information about the Openmp-commits
mailing list