[Openmp-commits] [PATCH] D41724: [OpenMP][libomptarget] Enable the compilation of multiple bc libraries for runtime inlining
Gheorghe-Teodor Bercea via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Thu Jan 4 04:50:53 PST 2018
gtbercea created this revision.
gtbercea added reviewers: Hahnfeld, hfinkel, carlo.bertolli, caomhin.
Herald added a subscriber: mgorny.
Different NVIDIA GPUs support different compute capabilities. To enable the inlining of runtime functions and the best performance on different generations of NVIDIA GPUs, a bc library for each compute capability needs to be compiled. The same compiler build will then be usable in conjunction with multiple generations of NVIDIA GPUs.
To differentiate between versions of the same bc lib, the output file name will contain the compute capability ID.
Repository:
rOMP OpenMP
https://reviews.llvm.org/D41724
Files:
libomptarget/deviceRTLs/nvptx/CMakeLists.txt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D41724.128607.patch
Type: text/x-patch
Size: 5521 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/openmp-commits/attachments/20180104/be764aa1/attachment.bin>
More information about the Openmp-commits
mailing list