[all-commits] [llvm/llvm-project] a349ac: [flang][OpenMP] Allocate `allocatable` init temps ...
Kareem Ergawy via All-commits
all-commits at lists.llvm.org
Fri Oct 17 01:51:49 PDT 2025
Branch: refs/heads/users/ergawy/alloc_on_stack
Home: https://github.com/llvm/llvm-project
Commit: a349ac2be7e54c65193e9c70cdcbd3ca8a7a4248
https://github.com/llvm/llvm-project/commit/a349ac2be7e54c65193e9c70cdcbd3ca8a7a4248
Author: ergawy <kareem.ergawy at amd.com>
Date: 2025-10-17 (Fri, 17 Oct 2025)
Changed paths:
M flang/lib/Lower/Support/PrivateReductionUtils.cpp
M flang/test/Lower/OpenMP/DelayedPrivatization/target-private-allocatable.f90
Log Message:
-----------
[flang][OpenMP] Allocate `allocatable` init temps on the stack for GPUs
Temps needed for the allocatable reduction/privatization init regions
are now allocated on the heap all the time. However, this is performance
killer for GPUs since malloc calls are prohibitively expensive.
Therefore, we should do these allocations on the stack for GPU reductions.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list