[PATCH] D112548: [LoopVectorize] Propagate fast-math flags for inloop reductions
David Sherwood via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 1 02:32:43 PDT 2021
david-arm added inline comments.
================
Comment at: llvm/test/Transforms/LoopVectorize/reduction-inloop-pred.ll:1020
+; CHECK-NEXT: [[TMP40:%.*]] = select fast <4 x i1> [[TMP0]], <4 x float> [[TMP38]], <4 x float> zeroinitializer
+; CHECK-NEXT: [[TMP41:%.*]] = call fast float @llvm.vector.reduce.fadd.v4f32(float [[VEC_PHI]], <4 x float> [[TMP40]])
+; CHECK-NEXT: [[TMP42:%.*]] = select fast <4 x i1> [[TMP0]], <4 x float> [[TMP39]], <4 x float> zeroinitializer
----------------
Nice! Instcombine has folded the call and the fadd together into one!
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D112548/new/
https://reviews.llvm.org/D112548
More information about the llvm-commits
mailing list