[clang] [flang] [llvm] [AMDGPU] Use a target feature to enable __builtin_amdgcn_global_load_lds on gfx9/10 (PR #133055)

Juan Manuel Martinez CaamaƱo via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 31 05:12:05 PDT 2025


================
@@ -260,7 +260,7 @@ AMDGPUTargetInfo::AMDGPUTargetInfo(const llvm::Triple &Triple,
 
   MaxAtomicPromoteWidth = MaxAtomicInlineWidth = 64;
   CUMode = !(GPUFeatures & llvm::AMDGPU::FEATURE_WGP);
-  for (auto F : {"image-insts", "gws"})
+  for (auto F : {"image-insts", "gws", "mem-to-lds-load-insts"})
----------------
jmmartinez wrote:

I wasn't aware there were `buffer_store_lds` instructions. They seem to be removed after gfx940 and not available on gfx10.

I've renamed the attribute to match "vmem-to-lds-load-insts".

https://github.com/llvm/llvm-project/pull/133055


More information about the llvm-commits mailing list