[all-commits] [llvm/llvm-project] 1b4c85: [NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldma...
Artem Belevich via All-commits
all-commits at lists.llvm.org
Fri Aug 6 16:14:25 PDT 2021
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 1b4c85fc02cc87b4abcd794c98e6ff91a3d3766b
https://github.com/llvm/llvm-project/commit/1b4c85fc02cc87b4abcd794c98e6ff91a3d3766b
Author: Steffen Larsen <steffen.larsen at codeplay.com>
Date: 2021-08-06 (Fri, 06 Aug 2021)
Changed paths:
M llvm/include/llvm/IR/IntrinsicsNVVM.td
M llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
M llvm/test/CodeGen/NVPTX/wmma.py
Log Message:
-----------
[NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldmatrix instructions
Adds NVPTX intrinsics for the CUDA PTX `ldmatrix.sync.aligned` instructions added in PTX 6.5.
PTX ISA description of `ldmatrix.sync.aligned`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions-ldmatrix
Authored-by: Steffen Larsen <steffen.larsen at codeplay.com>
Reviewed By: tra
Differential Revision: https://reviews.llvm.org/D107046
More information about the All-commits
mailing list