[Openmp-commits] [PATCH] D92463: [libomptarget][AMDGPU] Remove MaxParallelLevel
Jon Chesterfield via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Wed Dec 2 05:31:40 PST 2020
JonChesterfield accepted this revision.
JonChesterfield added a comment.
This revision is now accepted and ready to land.
Very good, thank you.
Some context. Nested parallel is implemented by a transform to sequential execution. This transform uses linked call frames, allocated by malloc from the gpu within deviceRTL.
A previous version of amdgpu's deviceRTL had a buggy malloc implementation. The code that this patch deletes worked around that by allocating the gpu memory on the host instead.
That work around is now gone from the downstream fork (because malloc is no longer buggy), thus also removed from here.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D92463/new/
https://reviews.llvm.org/D92463
More information about the Openmp-commits
mailing list