[llvm] [AMDGPU] Set glc/slc on volatile/nontemporal SMEM loads (PR #77443)

Tony Tye via llvm-commits llvm-commits at lists.llvm.org
Thu Jan 11 15:39:11 PST 2024


================
@@ -5813,6 +5813,18 @@ in table :ref:`amdgpu-amdhsa-memory-model-code-sequences-gfx6-gfx9-table`.
                                                               be reordered by
                                                               hardware.
 
+     load         *none*       *none*         - constant - !volatile & !nontemporal
----------------
t-tye wrote:

AFAIK we do not generate scalar stores as the instruction was removed from the isa.

I do not think you need to say "only is the address is uniform" as the compiler cannot generate scalar loads unless that condition is already met.

https://github.com/llvm/llvm-project/pull/77443


More information about the llvm-commits mailing list