[all-commits] [llvm/llvm-project] d774b4: [NVPTX, CUDA] Add .and.popc variant of the b1 MMA ...
Artem Belevich via All-commits
all-commits at lists.llvm.org
Thu Jul 15 12:02:59 PDT 2021
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: d774b4aa5eac785ffe40009091667521e183df40
https://github.com/llvm/llvm-project/commit/d774b4aa5eac785ffe40009091667521e183df40
Author: Artem Belevich <tra at google.com>
Date: 2021-07-15 (Thu, 15 Jul 2021)
Changed paths:
M clang/include/clang/Basic/BuiltinsNVPTX.def
M clang/lib/CodeGen/CGBuiltin.cpp
M clang/test/CodeGen/builtins-nvptx-mma.cu
M clang/test/CodeGen/builtins-nvptx-mma.py
M llvm/include/llvm/IR/IntrinsicsNVVM.td
M llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
M llvm/test/CodeGen/NVPTX/wmma.py
Log Message:
-----------
[NVPTX, CUDA] Add .and.popc variant of the b1 MMA instruction.
That should allow clang to compile mma.h from CUDA-11.3.
Differential Revision: https://reviews.llvm.org/D105384
More information about the All-commits
mailing list