[PATCH] D148068: [AArch64] Lower fused complex multiply-add intrinsic to AArch64::FCMA

Hugo Trachino via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Apr 12 18:26:30 PDT 2023


nujaa added a comment.

In D148068#4260805 <https://reviews.llvm.org/D148068#4260805>, @igor.kirillov wrote:

> Could you clarify if there will be additional work in the future? The thing is there is a pass at llvm/lib/CodeGen/ComplexDeinterleaving.cpp that generates FCMLA/FCADD architecture specific intrinsics using TargetLower::createComplexDeinterleavingIR

Hi,
I did not know about this pass, I will look into it and check whether this fits our needs. Thank you.
As additional work within LLVM, we added complex multiplication without accumulator and conjugate recognition to fuse with the FCMLA.
For a bit of context, we are generating this complex code from MLIR where we handle vectors of complex. It currently works but is not ready for upstreaming.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D148068/new/

https://reviews.llvm.org/D148068



More information about the llvm-commits mailing list