[llvm] [AMDGPU] Introduce "amdgpu-sw-lower-lds" pass to lower LDS accesses. (PR #87265)
via llvm-commits
llvm-commits at lists.llvm.org
Thu Aug 22 21:03:21 PDT 2024
b-sumner wrote:
> > The runtime doesn't split the dispatch into machine-sized chunks. If it does have a limit, then it is probably much larger than we want to allocate for.
>
> I thought it already had to do this if stack was enabled to avoid going over a device wide limit
Yes, there is a special mode when scratch space is low but something like that would not be desirable to impose on every dispatch.
https://github.com/llvm/llvm-project/pull/87265
More information about the llvm-commits
mailing list