[all-commits] [llvm/llvm-project] 1b4c85: [NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldma...

Artem Belevich via All-commits all-commits at lists.llvm.org
Fri Aug 6 16:14:25 PDT 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 1b4c85fc02cc87b4abcd794c98e6ff91a3d3766b
      https://github.com/llvm/llvm-project/commit/1b4c85fc02cc87b4abcd794c98e6ff91a3d3766b
  Author: Steffen Larsen <steffen.larsen at codeplay.com>
  Date:   2021-08-06 (Fri, 06 Aug 2021)

  Changed paths:
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    M llvm/test/CodeGen/NVPTX/wmma.py

  Log Message:
  -----------
  [NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldmatrix instructions

Adds NVPTX intrinsics for the CUDA PTX `ldmatrix.sync.aligned` instructions added in PTX 6.5.

PTX ISA description of `ldmatrix.sync.aligned`: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions-ldmatrix

Authored-by: Steffen Larsen <steffen.larsen at codeplay.com>

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D107046




More information about the All-commits mailing list