[PATCH] D120129: [NVPTX] Enhance vectorization of ld.param & st.param

Artem Belevich via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 18 10:35:47 PDT 2022


tra added a comment.

In D120129#3392681 <https://reviews.llvm.org/D120129#3392681>, @yaxunl wrote:

> For HIP, we mark non-kernel device functions with hidden visibility and internalize them in a LLVM pass for -fno-gpu-rdc.

Looks like now we may have a reason to do so for CUDA, too. Could you point me to where we do it for HIP?


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120129/new/

https://reviews.llvm.org/D120129



More information about the llvm-commits mailing list