[PATCH] D142232: Increase inline threshold multiplier to 11 in nvptx backend.

Artem Belevich via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jan 23 11:05:22 PST 2023


tra added a comment.

I would expect thrust to be close to the worst-case scenario, and it also has a pretty extensive set of tests to compile. If there's no major compile-time regression on thrust tests, I'll be fine with the patch.

> I am currently investigating tensorflow. Will post updates soon.

Keep in mind that GPU-related part of TF compilation is relatively small, compared to the rest of the build, so results there may be noisy. 
The other projects on the list have more GPU code to compile. They may also be much easier to compile, compared to tensorflow.

You may also add https://github.com/NVIDIA/nccl to the list (It's also part of TF build).


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142232/new/

https://reviews.llvm.org/D142232



More information about the llvm-commits mailing list