[PATCH] D148068: [AArch64] Lower fused complex multiply-add intrinsic to AArch64::FCMA

Wed Apr 12 18:26:30 PDT 2023

nujaa added a comment.

In D148068#4260805 <https://reviews.llvm.org/D148068#4260805>, @igor.kirillov wrote:

> Could you clarify if there will be additional work in the future? The thing is there is a pass at llvm/lib/CodeGen/ComplexDeinterleaving.cpp that generates FCMLA/FCADD architecture specific intrinsics using TargetLower::createComplexDeinterleavingIR

Hi,
I did not know about this pass, I will look into it and check whether this fits our needs. Thank you.
As additional work within LLVM, we added complex multiplication without accumulator and conjugate recognition to fuse with the FCMLA.
For a bit of context, we are generating this complex code from MLIR where we handle vectors of complex. It currently works but is not ready for upstreaming.

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D148068/new/

https://reviews.llvm.org/D148068