[Mlir-commits] [mlir] [mlir][ROCDL] Add tensor load and store instructions to ROCDL (PR #165016)
Alan Li
llvmlistbot at llvm.org
Mon Oct 27 12:45:55 PDT 2025
================
@@ -663,6 +663,82 @@ def ROCDL_GlobalLoadLDSOp :
}];
}
+//===---------------------------------------------------------------------===//
+// Tensor load/store intrinsics (available in GFX1250)
+//===---------------------------------------------------------------------===//
+
+def ROCDL_TensorLoadToLDSIntrOp :
+ ROCDL_IntrOp<"tensor.load.to.lds", [], [], [], 0, 0, 1, 0, [4], ["cachePolicy"]> {
+ dag args = (ins Arg<LLVM_VectorOf<I32>, "", [MemRead]>:$dgroup0,
----------------
lialan wrote:
I believe here is a design question: per the manual, `TENSOR_LOAD_TO_LDS` will be decomposed into a series of `GLOBAL_LOAD_ASYNC_TO_LDS` internally as it is using TDM. So it is intrinsically doing something more.
But looking around, I don't see other ROCDL ops that works on mem actually have related memory traits. So I guess it is okay to skip it.
https://github.com/llvm/llvm-project/pull/165016
More information about the Mlir-commits
mailing list