[PATCH] D56596: Enable fma formation for fp16 on x86 and aarch64

Florian Hahn via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jan 11 09:16:50 PST 2019


fhahn added reviewers: SjoerdMeijer, olista01.
fhahn added a comment.

The AArch64 side LGTM. Also adding Oliver and Sjoerd as they worked on FP16 side as well, in case they have any thoughts.

The only public documentation for Arm v8.2+ CPUs I could find indicates FADD,FMUL and FMADD have the same latencies and throughputs on Cortex-A75:  https://static.docs.arm.com/101398/0200/arm_cortex_a75_software_optimization_guide_v2.pdf

For the X86, I think a test would be great.


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D56596/new/

https://reviews.llvm.org/D56596





More information about the llvm-commits mailing list