[PATCH] D118977: [NVPTX] Add more FMA intriniscs/builtins
Jakub Chlanda via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 4 02:00:08 PST 2022
jchlanda created this revision.
jchlanda added reviewers: tra, csigg, bkramer.
Herald added subscribers: asavonic, hiraditya, jholewinski.
jchlanda requested review of this revision.
Herald added subscribers: llvm-commits, cfe-commits, jdoerfert.
Herald added projects: clang, LLVM.
This patch adds builtins/intrinsics for the following variants of FMA:
- f16
- rn
- rn_ftz
- rn_sat
- rn_ftz_sat
- rn_relu
- rn_ftz_relu
- f16x2
- rn
- rn_ftz
- rn_sat
- rn_ftz_sat
- rn_relu
- rn_ftz_relu
- bf16
- rn
- rn_relu
- bf16x2
- rn
- rn_relu
They all require PTX 7.0, SM_80.
`ptxas` (Cuda compilation tools, release 11.0, V11.0.194) is happy with the generated assembly.
Depends on D<117887>
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D118977
Files:
clang/include/clang/Basic/BuiltinsNVPTX.def
clang/test/CodeGen/builtins-nvptx-native-half-type.c
clang/test/CodeGen/builtins-nvptx.c
llvm/include/llvm/IR/IntrinsicsNVVM.td
llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
llvm/test/CodeGen/NVPTX/math-intrins-sm80-ptx70-instcombine.ll
llvm/test/CodeGen/NVPTX/math-intrins-sm80-ptx70.ll
llvm/test/CodeGen/NVPTX/math-intrins-sm86-ptx72.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D118977.405891.patch
Type: text/x-patch
Size: 40262 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20220204/27834754/attachment.bin>
More information about the llvm-commits
mailing list