[Mlir-commits] [mlir] [MLIR][NVVM] Add Ops for tcgen05 cp and shift (PR #127798)
Guray Ozen
llvmlistbot at llvm.org
Thu Feb 20 04:27:04 PST 2025
================
@@ -2810,6 +2810,113 @@ def NVVM_Tcgen05CommitOp : NVVM_Op<"tcgen05.commit"> {
}];
}
+def NVVM_Tcgen05ShiftOp : NVVM_Op<"tcgen05.shift"> {
+ let summary = "Tcgen05 shift operation";
+ let description = [{
+ The `tcgen05.shift` is an asynchronous instruction which initiates
+ the shifting of 32-byte elements downwards across all the rows,
+ except the last, by one row. The operand `taddr` specifies the base
+ address of the matrix in Tensor Memory whose rows must be down shifted.
+ [For more information refer to the PTX ISA]
+ (https://docs.nvidia.com/cuda/parallel-thread-execution/#tcgen05-instructions-tcgen05-shift)
----------------
grypp wrote:
```suggestion
[For more information, see PTX ISA](https://docs.nvidia.com/cuda/parallel-thread-execution/#tcgen05-instructions-tcgen05-shift)
```
https://github.com/llvm/llvm-project/pull/127798
More information about the Mlir-commits
mailing list