[PATCH] D120129: [NVPTX] Enhance vectorization of ld.param & st.param
Artem Belevich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 18 10:35:47 PDT 2022
tra added a comment.
In D120129#3392681 <https://reviews.llvm.org/D120129#3392681>, @yaxunl wrote:
> For HIP, we mark non-kernel device functions with hidden visibility and internalize them in a LLVM pass for -fno-gpu-rdc.
Looks like now we may have a reason to do so for CUDA, too. Could you point me to where we do it for HIP?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D120129/new/
https://reviews.llvm.org/D120129
More information about the llvm-commits
mailing list