[Mlir-commits] [mlir] [MLIR][NVGPU] Improve and Cleanup verifier of TMA OPs (PR #70923)
Guray Ozen
llvmlistbot at llvm.org
Wed Nov 1 09:55:14 PDT 2023
================
@@ -25,6 +25,8 @@ constexpr int kWarpSize = 32;
/// M size of wgmma.mma_async instruction
constexpr int kWgmmaSizeM = 64;
+/// Maximum tensor dimension that TMA supports
+constexpr int kMaxTMATensorDimension = 5;
----------------
grypp wrote:
tma supports max 5dim
[see the instruction](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-bulk-tensor)
https://github.com/llvm/llvm-project/pull/70923
More information about the Mlir-commits
mailing list