[PATCH] D120129: [NVPTX] Enhance vectorization of ld.param & st.param

Johannes Doerfert via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Mar 25 09:18:08 PDT 2022


jdoerfert added a comment.

In D120129#3408174 <https://reviews.llvm.org/D120129#3408174>, @kovdan01 wrote:

> In D120129#3408168 <https://reviews.llvm.org/D120129#3408168>, @jdoerfert wrote:
>
>> You don't need to run it. If you use this command line it doesn't crash?
>
> Yes, I run the same command (with sm_75 instead of sm_80 because I don't have libomptarget-nvptx-sm_80.bc), and it doesn't crash.

So it's not sm_80 but the cuda version that is important. 11.4.0 works fine for me, 11.0.2 breaks also for sm_70.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120129/new/

https://reviews.llvm.org/D120129



More information about the cfe-commits mailing list