[Mlir-commits] [mlir] [mlir][ROCDL] Add tensor load and store instructions to ROCDL (PR #165016)

Alan Li llvmlistbot at llvm.org
Mon Oct 27 12:45:55 PDT 2025


================
@@ -663,6 +663,82 @@ def ROCDL_GlobalLoadLDSOp :
   }];
 }
 
+//===---------------------------------------------------------------------===//
+// Tensor load/store intrinsics (available in GFX1250)
+//===---------------------------------------------------------------------===//
+
+def ROCDL_TensorLoadToLDSIntrOp :
+  ROCDL_IntrOp<"tensor.load.to.lds", [], [], [], 0, 0, 1, 0, [4], ["cachePolicy"]> {
+  dag args = (ins Arg<LLVM_VectorOf<I32>, "", [MemRead]>:$dgroup0,
----------------
lialan wrote:

I believe here is a design question: per the manual, `TENSOR_LOAD_TO_LDS` will be decomposed into a series of `GLOBAL_LOAD_ASYNC_TO_LDS` internally as it is using TDM. So it is intrinsically doing something more.

But looking around, I don't see other ROCDL ops that works on mem actually have related memory traits.  So I guess it is okay to skip it.

https://github.com/llvm/llvm-project/pull/165016


More information about the Mlir-commits mailing list