[llvm] [AArch64] Add partial reduce patterns for new fdot instructions (PR #184659)

David Green via llvm-commits llvm-commits at lists.llvm.org
Thu Mar 5 03:38:05 PST 2026


================
@@ -2031,6 +2038,8 @@ AArch64TargetLowering::AArch64TargetLowering(const TargetMachine &TM,
       // We can use SVE2p1 fdot to emulate the fixed-length variant.
       setPartialReduceMLAAction(ISD::PARTIAL_REDUCE_FMLA, MVT::v4f32,
                                 MVT::v8f16, Custom);
+      setPartialReduceMLAAction(ISD::PARTIAL_REDUCE_FMLA, MVT::v2f32,
+                                MVT::v4f16, Custom);
----------------
davemgreen wrote:

> as it might introduce faulting behaviour for the inactive lanes if the runtime vector length > 128bits.

This isn't a thing. it sounds like a misunderstanding of how strictfp is treated in llvm.  There is not observable  "faulting" fp instructions outside of strict-fp.

https://github.com/llvm/llvm-project/pull/184659


More information about the llvm-commits mailing list