[Mlir-commits] [mlir] [MLIR][AMDGPU] Add amdgpu.global_transpose_load op for gfx1200+ global memory transpose loads (PR #195287)

Jakub Kuderski llvmlistbot at llvm.org
Sat May 2 07:47:20 PDT 2026


================
@@ -1079,6 +1079,43 @@ LogicalResult TransposeLoadOp::verify() {
   return success();
 }
 
+//===----------------------------------------------------------------------===//
+// GlobalTransposeLoadOp
+//===----------------------------------------------------------------------===//
+
+LogicalResult GlobalTransposeLoadOp::verify() {
+  MemRefType srcType = cast<MemRefType>(getSrc().getType());
+
+  if (!hasGlobalMemorySpace(srcType.getMemorySpace()))
+    return emitOpError("source memory address space must be Global");
+
+  auto resultType = cast<VectorType>(getType());
+  size_t numElements = resultType.getNumElements();
+  size_t elementTypeSize = resultType.getElementType().getIntOrFloatBitWidth();
+
+  // ElementSize -> NumElements. Chipset gating (gfx1200 vs gfx1250) is
+  // enforced in the lowering.
+  const llvm::SmallDenseMap<size_t, size_t> kValidLoadSizeMap = {
----------------
kuhar wrote:

```suggestion
  static const llvm::SmallDenseMap<size_t, size_t> kValidLoadSizeMap = {
```

https://github.com/llvm/llvm-project/pull/195287


More information about the Mlir-commits mailing list