[Openmp-commits] [PATCH] D127505: [Libomptarget] Add checks for CUDA subarchitecture using new info

Joseph Huber via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Fri Jun 10 08:56:37 PDT 2022


jhuber6 added a comment.

Should I add a test for this? Something like

  clang --offload-arch=sm_35 --offload-arch=sm_37 --offload-arch=sm_50 --offload-arch=sm_52 --offload-arch=sm_53 --offload-arch=sm_60 --offload-arch=sm_61 --offload-arch=sm_62 --offload-arch=sm_70 --offload-arch=sm_72 --offload-arch=sm_75 --offload-arch=sm_80 --offload-arch=sm_86



================
Comment at: openmp/libomptarget/plugins/cuda/src/rtl.cpp:1518
+    if (ArchStr == image->Info.Arch)
+      return true;
+  }
----------------
tianshilei1992 wrote:
> I think here we should check if all devices are compatible instead of one.
> ```
>   if (ArchStr != image->Info.Arch)
>     return true;
> }
> 
> return true.
> ```
Yeah I wasn't sure how to handle this in the case of multiple devices, but it's probably more reasonable to expect each one to match the architecture for it to be compatible. I'll change it.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D127505/new/

https://reviews.llvm.org/D127505



More information about the Openmp-commits mailing list