[Openmp-commits] [PATCH] D102107: [OpenMP] Codegen aggregate for outlined function captures
Alexey Bataev via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Thu Jun 17 06:40:18 PDT 2021
ABataev added a comment.
In D102107#2823706 <https://reviews.llvm.org/D102107#2823706>, @jdoerfert wrote:
> In D102107#2821976 <https://reviews.llvm.org/D102107#2821976>, @ABataev wrote:
>
>> We used this kind of codegen initially but later found out that it causes a large overhead when gathering pointers into a record. What about hybrid scheme where the first args are passed as arguments and others (if any) are gathered into a record?
>
> I'm confused, maybe I misunderstand the problem. The parallel function arguments need to go from the main thread to the workers somehow, I don't see how this is done w/o a record. This patch makes it explicit though.
Pass it in a record for workers only? And use a hybrid scheme for all other parallel regions.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D102107/new/
https://reviews.llvm.org/D102107
More information about the Openmp-commits
mailing list