[llvm] [LLVM][NVPTX] Add movmatrix intrinsic and PTX instruction support (PR #190109)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Thu Apr 2 10:55:04 PDT 2026
================
@@ -3936,6 +3936,34 @@ an event.
For more information on the pmevent instructions, refer to the `PTX ISA
<https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#miscellaneous-instructions-pmevent>`__.
+movmatrix Intrinsics
----------------
AlexMaclean wrote:
This category seems a little too narrow. Maybe we should follow PTX and define something like "warp level matrix for mma Intrinsics" https://docs.nvidia.com/cuda/parallel-thread-execution/#warp-level-matrix-instructions-for-mma
https://github.com/llvm/llvm-project/pull/190109
More information about the llvm-commits
mailing list