[Mlir-commits] [mlir] [MLIR][AMDGPU] Add amdgpu.global_transpose_load op for gfx1200+ global memory transpose loads (PR #195287)
Jakub Kuderski
llvmlistbot at llvm.org
Sat May 2 07:47:20 PDT 2026
================
@@ -1079,6 +1079,43 @@ LogicalResult TransposeLoadOp::verify() {
return success();
}
+//===----------------------------------------------------------------------===//
+// GlobalTransposeLoadOp
+//===----------------------------------------------------------------------===//
+
+LogicalResult GlobalTransposeLoadOp::verify() {
+ MemRefType srcType = cast<MemRefType>(getSrc().getType());
+
+ if (!hasGlobalMemorySpace(srcType.getMemorySpace()))
+ return emitOpError("source memory address space must be Global");
+
+ auto resultType = cast<VectorType>(getType());
+ size_t numElements = resultType.getNumElements();
+ size_t elementTypeSize = resultType.getElementType().getIntOrFloatBitWidth();
+
+ // ElementSize -> NumElements. Chipset gating (gfx1200 vs gfx1250) is
+ // enforced in the lowering.
+ const llvm::SmallDenseMap<size_t, size_t> kValidLoadSizeMap = {
----------------
kuhar wrote:
```suggestion
static const llvm::SmallDenseMap<size_t, size_t> kValidLoadSizeMap = {
```
https://github.com/llvm/llvm-project/pull/195287
More information about the Mlir-commits
mailing list