[PATCH] D105384: [NVPTX, CUDA] Add .and.popc variant of the b1 MMA instruction.

Artem Belevich via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jul 2 17:07:55 PDT 2021


tra created this revision.
tra added a reviewer: steffenlarsen.
Herald added subscribers: bixia, hiraditya, yaxunl, jholewinski.
tra requested review of this revision.
Herald added a subscriber: jdoerfert.
Herald added projects: clang, LLVM.

Extends the changes in D104847 <https://reviews.llvm.org/D104847> and adds another MMA instruction variant and corresponding intrinsics & builtins.

That should allow clang to compile mma.h from CUDA-11.3.

Didn't test it much yet. There may still be some sharp corners.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D105384

Files:
  clang/include/clang/Basic/BuiltinsNVPTX.def
  clang/lib/CodeGen/CGBuiltin.cpp
  clang/test/CodeGen/builtins-nvptx-mma.cu
  clang/test/CodeGen/builtins-nvptx-mma.py
  llvm/include/llvm/IR/IntrinsicsNVVM.td
  llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
  llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
  llvm/test/CodeGen/NVPTX/wmma.py

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D105384.356302.patch
Type: text/x-patch
Size: 32442 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210703/47ccb68e/attachment.bin>


More information about the llvm-commits mailing list