[PATCH] D142232: Increase inline threshold multiplier to 11 in nvptx backend.

Tue Jan 24 10:57:26 PST 2023

tra added a comment.

In D142232#4076580 <https://reviews.llvm.org/D142232#4076580>, @JackAKirk wrote:

> OK, I've looked into building thrust with clang cuda. Ran into some issues: https://github.com/NVIDIA/thrust/issues/1853

Newer versions of thrust may have issues w/ clang. In the past it regularly needed portability fixes. An old thrust revision f5ea60fd3aa3828c0eb8991a54acdfbed6707bd7 should be buildable w/ clang, though the CMakeFiles there may be too old to support clang as the cuda compiler. If you run into too much trouble, just skip it.

cutlass and nccl may be in better shape. Sorry about being vague -- we do compile cutlass/nccl/thrust with clang, but not always a recent version and we're not relying on cmake to do it, so I can't say what's the state of the official build of those projects when it comes to using clang as a CUDA compiler.
For the purposes of this experiment a quick-and-dirty solution of configuring the build to use nvcc, capturing the commands run by the build, editing them to replace NVCC and NVCC-specific options with clang equivalents, and running those commands as a script may do the trick.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142232/new/

https://reviews.llvm.org/D142232