[PATCH] D56596: Enable fma formation for fp16 on x86 and aarch64
Florian Hahn via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 11 09:16:50 PST 2019
fhahn added reviewers: SjoerdMeijer, olista01.
fhahn added a comment.
The AArch64 side LGTM. Also adding Oliver and Sjoerd as they worked on FP16 side as well, in case they have any thoughts.
The only public documentation for Arm v8.2+ CPUs I could find indicates FADD,FMUL and FMADD have the same latencies and throughputs on Cortex-A75: https://static.docs.arm.com/101398/0200/arm_cortex_a75_software_optimization_guide_v2.pdf
For the X86, I think a test would be great.
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D56596/new/
https://reviews.llvm.org/D56596
More information about the llvm-commits
mailing list