[llvm] [MLIR][NVGPU] Introduce `nvgpu.wargroup.mma.store` Op for Hopper GPUs (PR #65441)
via llvm-commits
llvm-commits at lists.llvm.org
Wed Oct 4 07:20:32 PDT 2023
================
@@ -1306,6 +1319,116 @@ struct NVGPUWarpgroupMmaOpLowering
}
};
+struct NVGPUWarpgroupMmaStoreOpLowering
+ : public ConvertOpToLLVMPattern<nvgpu::WarpgroupMmaStoreOp> {
+ using ConvertOpToLLVMPattern<
+ nvgpu::WarpgroupMmaStoreOp>::ConvertOpToLLVMPattern;
+
+ /// This function stores a fragmented register matrix owned by a warp group
+ /// (128 threads) into a memref. Each thread has 64 registers, each the size
+ /// of a struct.
+ /// Here is what each threads (T) holds, each `d` is struct value with a
+ /// number.
+ ///
+ /// Threads in warp-group (128 threads) and what they owns in the matriD:
----------------
qcolombet wrote:
Nit: matri**x**D
https://github.com/llvm/llvm-project/pull/65441
More information about the llvm-commits
mailing list