[all-commits] [llvm/llvm-project] 3530c3: [OpenMP] Use CUDA's non-RDC mode when LTO has whol...

Joseph Huber via All-commits all-commits at lists.llvm.org
Sat Apr 23 09:43:03 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 3530c35c660919b9367f1ac598abfb9a569e7606
      https://github.com/llvm/llvm-project/commit/3530c35c660919b9367f1ac598abfb9a569e7606
  Author: Joseph Huber <jhuber6 at vols.utk.edu>
  Date:   2022-04-23 (Sat, 23 Apr 2022)

  Changed paths:
    M clang/test/Driver/linker-wrapper.c
    M clang/tools/clang-linker-wrapper/ClangLinkerWrapper.cpp

  Log Message:
  -----------
  [OpenMP] Use CUDA's non-RDC mode when LTO has whole program visibility

When we do LTO we consider ourselves to have whole program visibility if
every single input file we have contains LLVM bitcode. If we have whole
program visibliity then we can create a single image and utilize CUDA's
non-RDC mode by not passing `-c` to `ptxas` and ignoring the `nvlink`
job. This should be faster for some situations and also saves us the
time executing `nvlink`.

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D124292




More information about the All-commits mailing list