[Openmp-commits] [PATCH] D95752: [OpenMP][DeviceRTL] Extract shuffle idiom and port it to declare variant
Johannes Doerfert via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Mon Feb 1 15:27:55 PST 2021
jdoerfert added a comment.
In D95752#2535155 <https://reviews.llvm.org/D95752#2535155>, @JonChesterfield wrote:
> We could presumably replace the `#if CUDA_VERSION >= 9000` in the target_impl.cu file (we should rename these!) with variant, orthogonal to this change. Doing that for the five instances, even just within that file, would let us significantly reduce the number of devicertl libraries compiled.
yes, it's case by case though. We should check what ptx version, or other criterion, is a good selector and replace them. Here it was rather easy in the end.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D95752/new/
https://reviews.llvm.org/D95752
More information about the Openmp-commits
mailing list