[PATCH] D142232: Increase inline threshold multiplier to 11 in nvptx backend.

Tue Jan 24 04:00:12 PST 2023

JackAKirk added a comment.

In D142232#4074418 <https://reviews.llvm.org/D142232#4074418>, @tra wrote:

> I would expect thrust to be close to the worst-case scenario, and it also has a pretty extensive set of tests to compile. If there's no major compile-time regression on thrust tests, I'll be fine with the patch.
>
>> I am currently investigating tensorflow. Will post updates soon.
>
> Keep in mind that GPU-related part of TF compilation is relatively small, compared to the rest of the build, so results there may be noisy. 
> The other projects on the list have more GPU code to compile. They may also be much easier to compile, compared to tensorflow.
>
> You may also add https://github.com/NVIDIA/nccl to the list (It's also part of TF build).

OK, I've looked into building thrust with clang cuda. Ran into some issues: https://github.com/NVIDIA/thrust/issues/1853

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142232/new/

https://reviews.llvm.org/D142232