[llvm] [AArch64][SVE] Add dot product lowering for PARTIAL_REDUCE_MLA node (PR #130933)
Benjamin Maxwell via llvm-commits
llvm-commits at lists.llvm.org
Thu Mar 20 05:52:04 PDT 2025
================
@@ -1585,6 +1585,21 @@ AArch64TargetLowering::AArch64TargetLowering(const TargetMachine &TM,
setOperationAction(ISD::MSTORE, VT, Custom);
}
+ for (MVT VT : MVT::integer_scalable_vector_valuetypes()) {
+ if (!EnablePartialReduceNodes)
+ break;
+ for (MVT InnerVT : MVT::integer_scalable_vector_valuetypes()) {
+ ElementCount VTElemCount = VT.getVectorElementCount();
+ if (VTElemCount.getKnownMinValue() == 1)
+ continue;
+ if (VTElemCount * 4 == InnerVT.getVectorElementCount())
+ setPartialReduceMLAAction(VT, InnerVT, Custom);
+ if (InnerVT.getVectorElementType().getSizeInBits() * 4 ==
+ VT.getVectorElementType().getSizeInBits())
+ setPartialReduceMLAAction(VT, InnerVT, Legal);
+ }
+ }
----------------
MacDue wrote:
My understanding is the actual number of type combinations where we would do something special is actually quite small, e.g. something like:
```
// These are lowered directly to UDOT/SDOT.
setPartialReduceMLAAction(MVT::nxv2i64, MVT::nxv8i16, Legal);
setPartialReduceMLAAction(MVT::nxv4i32, MVT::nxv16i8, Legal);
// These can be lowered to wide adds in LowerOperation.
setPartialReduceMLAAction(MVT::nxv2i64, MVT::nxv4i32, Custom);
setPartialReduceMLAAction(MVT::nxv4i32, MVT::nxv8i16, Custom);
setPartialReduceMLAAction(MVT::nxv8i16, MVT::nxv16i8, Custom);
```
https://github.com/llvm/llvm-project/pull/130933
More information about the llvm-commits
mailing list