[PATCH] D60015: [NVPTX] Added intrinsics/instructions for MMA ops on (sub-)integers
Artem Belevich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 29 14:56:48 PDT 2019
tra created this revision.
tra added a reviewer: timshen.
Herald added subscribers: jdoerfert, bixia, hiraditya, jlebar, sanjoy, jholewinski.
Herald added a project: LLVM.
PTX 6.3 (CUDA-10.0) extends `wmma` instruction to support s8/u8/s4/u4/b1 -> s32.
All of the new instructions are still handled mostly by tablegen. I've slightly
refactored the code to drive intrinsic/instruction generation from a master
list of supported variants, so all irregularities have to be implemented in one place only.
The test generation script wmma.py has been refactored in a similar way.
I've added additional checks to verify the sanity of the set of tests generated
by the script for particular PTX and SM combination.
https://reviews.llvm.org/D60015
Files:
llvm/include/llvm/IR/IntrinsicsNVVM.td
llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
llvm/test/CodeGen/NVPTX/wmma.py
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D60015.192921.patch
Type: text/x-patch
Size: 50788 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20190329/fee99a32/attachment.bin>
More information about the llvm-commits
mailing list