[PATCH] D148068: [AArch64] Lower fused complex multiply-add intrinsic to AArch64::FCMA
Hugo Trachino via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Apr 12 18:26:30 PDT 2023
nujaa added a comment.
In D148068#4260805 <https://reviews.llvm.org/D148068#4260805>, @igor.kirillov wrote:
> Could you clarify if there will be additional work in the future? The thing is there is a pass at llvm/lib/CodeGen/ComplexDeinterleaving.cpp that generates FCMLA/FCADD architecture specific intrinsics using TargetLower::createComplexDeinterleavingIR
Hi,
I did not know about this pass, I will look into it and check whether this fits our needs. Thank you.
As additional work within LLVM, we added complex multiplication without accumulator and conjugate recognition to fuse with the FCMLA.
For a bit of context, we are generating this complex code from MLIR where we handle vectors of complex. It currently works but is not ready for upstreaming.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D148068/new/
https://reviews.llvm.org/D148068
More information about the llvm-commits
mailing list