[Openmp-commits] [PATCH] D95976: [OpenMP] Simplify offloading parallel call codegen
Michael Kruse via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Fri Apr 16 11:36:02 PDT 2021
Meinersbur requested changes to this revision.
Meinersbur added inline comments.
This revision now requires changes to proceed.
================
Comment at: clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp:569-570
+ else
+ ThreadLimit = Bld.CreateNUWSub(RT.getGPUNumThreads(CGF),
+ RT.getGPUWarpSize(CGF), "thread_limit");
+ assert(ThreadLimit != nullptr && "Expected non-null ThreadLimit");
----------------
getGPUNumThreads and getGPUWarpSize still have undefined call order.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D95976/new/
https://reviews.llvm.org/D95976
More information about the Openmp-commits
mailing list