[PATCH] D60015: [NVPTX] Added intrinsics/instructions for MMA ops on (sub-)integers

Artem Belevich via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 29 14:56:48 PDT 2019


tra created this revision.
tra added a reviewer: timshen.
Herald added subscribers: jdoerfert, bixia, hiraditya, jlebar, sanjoy, jholewinski.
Herald added a project: LLVM.

PTX 6.3 (CUDA-10.0) extends `wmma` instruction to support s8/u8/s4/u4/b1 -> s32.

      

All of the new instructions are still handled mostly by tablegen. I've slightly
refactored the code to drive intrinsic/instruction generation from a master
list of supported variants, so all irregularities have to be implemented in one place only.

      

The test generation script wmma.py has been refactored in a similar way.
I've added additional checks to verify the sanity of the set of tests generated 
by the script for particular PTX and SM combination.


https://reviews.llvm.org/D60015

Files:
  llvm/include/llvm/IR/IntrinsicsNVVM.td
  llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
  llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
  llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
  llvm/test/CodeGen/NVPTX/wmma.py

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D60015.192921.patch
Type: text/x-patch
Size: 50788 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20190329/fee99a32/attachment.bin>


More information about the llvm-commits mailing list