[llvm] [AMDGPU] Use table strategy for LowerModuleLDSPass at O0 (PR #160181)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Sep 22 22:21:46 PDT 2025
================
@@ -588,14 +588,17 @@ class AMDGPULowerModuleLDS {
return OrderedKernels;
}
- static void partitionVariablesIntoIndirectStrategies(
+ void partitionVariablesIntoIndirectStrategies(
Module &M, LDSUsesInfoTy const &LDSUsesInfo,
VariableFunctionMap &LDSToKernelsThatNeedToAccessItIndirectly,
DenseSet<GlobalVariable *> &ModuleScopeVariables,
DenseSet<GlobalVariable *> &TableLookupVariables,
DenseSet<GlobalVariable *> &KernelAccessVariables,
DenseSet<GlobalVariable *> &DynamicVariables) {
+ if (TM.getOptLevel() == CodeGenOptLevel::None)
+ LoweringKindLoc = LoweringKind::table;
----------------
arsenm wrote:
This should not override the explicit flag. This also seems like a dubious way to avoid going over the limit; we can't rely on other optimizations without -O0 either. It it possible to compute the size usage with the different strategies before committing to one?
https://github.com/llvm/llvm-project/pull/160181
More information about the llvm-commits
mailing list