[PATCH] D120129: [NVPTX] Enhance vectorization of ld.param & st.param
Johannes Doerfert via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 25 09:18:08 PDT 2022
jdoerfert added a comment.
In D120129#3408174 <https://reviews.llvm.org/D120129#3408174>, @kovdan01 wrote:
> In D120129#3408168 <https://reviews.llvm.org/D120129#3408168>, @jdoerfert wrote:
>
>> You don't need to run it. If you use this command line it doesn't crash?
>
> Yes, I run the same command (with sm_75 instead of sm_80 because I don't have libomptarget-nvptx-sm_80.bc), and it doesn't crash.
So it's not sm_80 but the cuda version that is important. 11.4.0 works fine for me, 11.0.2 breaks also for sm_70.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D120129/new/
https://reviews.llvm.org/D120129
More information about the llvm-commits
mailing list