[llvm] [AMDGPU] Set glc/slc on volatile/nontemporal SMEM loads (PR #77443)

Carl Ritson via llvm-commits llvm-commits at lists.llvm.org
Tue Jan 9 21:33:40 PST 2024


https://github.com/perlfu commented:

The code changes make sense to me.
It does seem the memory does not explicitly cover SMEM at the moment.
Adding cache bypass bits certainly shouldn't make the code gen any less correct.

Not really a problem with this patch, but I cannot convince myself that `RT_NT` is the right temporal hint on GFX12 for volatile -- I would have expected perhaps `LU`.  Seems some kind of scope might also be needed.

https://github.com/llvm/llvm-project/pull/77443


More information about the llvm-commits mailing list