[PATCH] D145586: [AMDGPU] Tweak PromoteAlloca limits
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 8 11:34:30 PST 2023
rampitec added a comment.
Increasing the limit may result in more spilling in other cases. In general a good performance testing is needed to reason if this is beneficial.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp:181
- MaxVGPRs = ST.getMaxNumVGPRs(ST.getWavesPerEU(F).first);
- // A non-entry function has only 32 caller preserved registers.
- // Do not promote alloca which will force spilling.
----------------
Isn't this still true? You might not see alloca, but you will see spills.
================
Comment at: llvm/test/CodeGen/AMDGPU/vector-alloca-limits.ll:6
-; OPT-LABEL: @alloca_8xi64_max1024(
-; OPT-NOT: alloca
----------------
I would not drop or change tests, but rather add new as needed.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D145586/new/
https://reviews.llvm.org/D145586
More information about the llvm-commits
mailing list