[PATCH] D112548: [LoopVectorize] Propagate fast-math flags for inloop reductions

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Nov 1 02:32:43 PDT 2021


david-arm added inline comments.


================
Comment at: llvm/test/Transforms/LoopVectorize/reduction-inloop-pred.ll:1020
+; CHECK-NEXT:    [[TMP40:%.*]] = select fast <4 x i1> [[TMP0]], <4 x float> [[TMP38]], <4 x float> zeroinitializer
+; CHECK-NEXT:    [[TMP41:%.*]] = call fast float @llvm.vector.reduce.fadd.v4f32(float [[VEC_PHI]], <4 x float> [[TMP40]])
+; CHECK-NEXT:    [[TMP42:%.*]] = select fast <4 x i1> [[TMP0]], <4 x float> [[TMP39]], <4 x float> zeroinitializer
----------------
Nice! Instcombine has folded the call and the fadd together into one!


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D112548/new/

https://reviews.llvm.org/D112548



More information about the llvm-commits mailing list