[llvm] [AMDGPU] Set glc/slc on volatile/nontemporal SMEM loads (PR #77443)
Tony Tye via llvm-commits
llvm-commits at lists.llvm.org
Thu Jan 11 15:39:11 PST 2024
================
@@ -5813,6 +5813,18 @@ in table :ref:`amdgpu-amdhsa-memory-model-code-sequences-gfx6-gfx9-table`.
be reordered by
hardware.
+ load *none* *none* - constant - !volatile & !nontemporal
----------------
t-tye wrote:
AFAIK we do not generate scalar stores as the instruction was removed from the isa.
I do not think you need to say "only is the address is uniform" as the compiler cannot generate scalar loads unless that condition is already met.
https://github.com/llvm/llvm-project/pull/77443
More information about the llvm-commits
mailing list