[Openmp-commits] [PATCH] D95976: [OpenMP] Simplify offloading	parallel call codegen
    Michael Kruse via Phabricator via Openmp-commits 
    openmp-commits at lists.llvm.org
       
    Fri Apr 16 11:36:02 PDT 2021
    
    
  
Meinersbur requested changes to this revision.
Meinersbur added inline comments.
This revision now requires changes to proceed.
================
Comment at: clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp:569-570
+  else
+    ThreadLimit = Bld.CreateNUWSub(RT.getGPUNumThreads(CGF),
+                                   RT.getGPUWarpSize(CGF), "thread_limit");
+  assert(ThreadLimit != nullptr && "Expected non-null ThreadLimit");
----------------
getGPUNumThreads and getGPUWarpSize still have undefined call order.
Repository:
  rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D95976/new/
https://reviews.llvm.org/D95976
    
    
More information about the Openmp-commits
mailing list