[PATCH] D23583: [AArch64] Add feature has-fast-fma

Renato Golin via llvm-commits llvm-commits at lists.llvm.org
Wed Aug 17 13:59:23 PDT 2016


rengolin added a comment.

In https://reviews.llvm.org/D23583#518142, @evandro wrote:

> However, if you look at the foldings in `DAGCompiner.cpp` performed only if  `enableAggressiveFMAFusion()` is true you will find that `FMUL` is the pivot of the folding, not `FADD`, in spite of what the comments states.


This makes sense. FMA being faster-than/as-fast-as FADD doesn't make sense.

> For instance, whether it should fold into FMA twice, back to back.


This may be remnants of an ARMv7 issue around OOO cores having a hazard for FMA+FMA...

> As a matter of fact, it seems to me that most "big" targets would benefit from folding into FMA more aggressively.


I tend to agree, but I don't have evidence. The best evidence would be some benchmark results or scheduling descriptions.

I'm not against this change in itself, but I really think this is a matter for the scheduler, not a target feature.


Repository:
  rL LLVM

https://reviews.llvm.org/D23583





More information about the llvm-commits mailing list