[llvm] [AMDGPU] Use table strategy for LowerModuleLDSPass at O0 (PR #160181)

Jon Chesterfield via llvm-commits llvm-commits at lists.llvm.org
Tue Sep 30 13:35:12 PDT 2025


JonChesterfield wrote:

There is an underlying error here. We are too eager to promote variables to the structure that is allocated at address zero and that can lead to allocating variables in kernels that should not do so.

There's an easy correctness fix, I'm still considering whether there's a reasonable way to get a better result. Forcing table mode for everything would get you correct behaviour I believe, but you should get performance regressions from it relative to fixing the underlying error

https://github.com/llvm/llvm-project/pull/160181


More information about the llvm-commits mailing list