[Mlir-commits] [mlir] [MLIR][NVGPU] Improve and Cleanup verifier of TMA OPs (PR #70923)

Guray Ozen llvmlistbot at llvm.org
Wed Nov 1 09:55:14 PDT 2023


================
@@ -25,6 +25,8 @@ constexpr int kWarpSize = 32;
 
 /// M size of wgmma.mma_async instruction
 constexpr int kWgmmaSizeM = 64;
+/// Maximum tensor dimension that TMA supports
+constexpr int kMaxTMATensorDimension = 5;
----------------
grypp wrote:

tma supports max 5dim
[see the instruction](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-bulk-tensor)

https://github.com/llvm/llvm-project/pull/70923


More information about the Mlir-commits mailing list