[PATCH] D142232: Increase inline threshold multiplier to 11 in nvptx backend.
Artem Belevich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jan 23 11:05:22 PST 2023
tra added a comment.
I would expect thrust to be close to the worst-case scenario, and it also has a pretty extensive set of tests to compile. If there's no major compile-time regression on thrust tests, I'll be fine with the patch.
> I am currently investigating tensorflow. Will post updates soon.
Keep in mind that GPU-related part of TF compilation is relatively small, compared to the rest of the build, so results there may be noisy.
The other projects on the list have more GPU code to compile. They may also be much easier to compile, compared to tensorflow.
You may also add https://github.com/NVIDIA/nccl to the list (It's also part of TF build).
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D142232/new/
https://reviews.llvm.org/D142232
More information about the llvm-commits
mailing list