[PATCH] D23583: [AArch64] Add feature has-fast-fma

James Molloy via llvm-commits llvm-commits at lists.llvm.org
Wed Aug 17 06:02:44 PDT 2016


jmolloy added a subscriber: jmolloy.
jmolloy added a comment.

Hi,

I also have concerns here. The TargetLowering hook states:

  /// Return true if target always beneficiates from combining into FMA for a
  /// given value type. This must typically return false on targets where FMA
  /// takes more cycles to execute than *FADD*.

Whereas you say:

  In spite of what the original author intended, I observed that the extra folds are worth it if FMA is as quick *FMUL* instead.

Which is correct? Or are you using this hook in a way the hook users don't intend? The wording used is vague and I really think we need to have more detail about what property of Exynos-M1 makes this good for Exynos but not for any other microarchitecture.

Cheers,

James


Repository:
  rL LLVM

https://reviews.llvm.org/D23583





More information about the llvm-commits mailing list