[clang] [llvm] [mlir] [AMDGPU] Add a new amdgcn.load.to.lds intrinsic (PR #137425)
Jay Foad via cfe-commits
cfe-commits at lists.llvm.org
Tue Apr 29 01:50:48 PDT 2025
jayfoad wrote:
> I still think we need an intrinsic here because a load + an addtid store can be scheduled much different from the asynchronous "gather to LDS" - and because we don't want this load/store to not be optimized
IMO the intrinsic should only be added as a last resort if we really can't get the pattern based codegen to work well enough.
https://github.com/llvm/llvm-project/pull/137425
More information about the cfe-commits
mailing list