[PATCH] D95976: [OpenMP] Simplify offloading parallel call codegen

Michael Kruse via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Apr 16 11:36:02 PDT 2021


Meinersbur requested changes to this revision.
Meinersbur added inline comments.
This revision now requires changes to proceed.


================
Comment at: clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp:569-570
+  else
+    ThreadLimit = Bld.CreateNUWSub(RT.getGPUNumThreads(CGF),
+                                   RT.getGPUWarpSize(CGF), "thread_limit");
+  assert(ThreadLimit != nullptr && "Expected non-null ThreadLimit");
----------------
getGPUNumThreads and getGPUWarpSize still have undefined call order.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D95976/new/

https://reviews.llvm.org/D95976



More information about the llvm-commits mailing list