[all-commits] [llvm/llvm-project] d774b4: [NVPTX, CUDA] Add .and.popc variant of the b1 MMA ...

Artem Belevich via All-commits all-commits at lists.llvm.org
Thu Jul 15 12:02:59 PDT 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: d774b4aa5eac785ffe40009091667521e183df40
      https://github.com/llvm/llvm-project/commit/d774b4aa5eac785ffe40009091667521e183df40
  Author: Artem Belevich <tra at google.com>
  Date:   2021-07-15 (Thu, 15 Jul 2021)

  Changed paths:
    M clang/include/clang/Basic/BuiltinsNVPTX.def
    M clang/lib/CodeGen/CGBuiltin.cpp
    M clang/test/CodeGen/builtins-nvptx-mma.cu
    M clang/test/CodeGen/builtins-nvptx-mma.py
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    M llvm/test/CodeGen/NVPTX/wmma.py

  Log Message:
  -----------
  [NVPTX, CUDA] Add .and.popc variant of the b1 MMA instruction.

That should allow clang to compile mma.h from CUDA-11.3.

Differential Revision: https://reviews.llvm.org/D105384




More information about the All-commits mailing list