[PATCH] D137936: [AArch64] Optimize cmp chain before legalization

Allen zhong via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Nov 21 22:14:08 PST 2022


Allen marked 5 inline comments as done.
Allen added inline comments.


================
Comment at: llvm/lib/Target/AArch64/AArch64ISelLowering.cpp:8614
     // Exit early by inverting the condition, which help reduce indentations.
-    SDValue TVal = DAG.getConstant(1, DL, VT);
-    SDValue FVal = DAG.getConstant(0, DL, VT);
-    AArch64CC::CondCode CC = changeIntCCToAArch64CC(Cond);
-    AArch64CC::CondCode InvCC = AArch64CC::getInvertedCondCode(CC);
-    return DAG.getNode(AArch64ISD::CSEL, DL, VT, FVal, TVal,
-                       DAG.getConstant(InvCC, DL, MVT::i32), CCmp);
+    return DAG.getSetCC(DL, VT, Cmp, DAG.getConstant(0, DL, VT), Cond);
   }
----------------
bcl5980 wrote:
> bcl5980 wrote:
> > Allen wrote:
> > > bcl5980 wrote:
> > > > It looks this can continue to be simplified to 
> > > > 
> > > > ```
> > > >     unsigned LogicOp = (Cond == ISD::SETEQ) ? ISD::AND : ISD::OR;
> > > >     SDValue Cmp = DAG.getSetCC(DL, VT, XOR0, XOR1, Cond);
> > > >     for (unsigned I = 1; I < WorkList.size(); I++) {
> > > >       std::tie(XOR0, XOR1) = WorkList[I];
> > > >       SDValue CmpChain = DAG.getSetCC(DL, VT, XOR0, XOR1, Cond);
> > > >       Cmp = DAG.getNode(LogicOp, DL, VT, Cmp, CmpChain);
> > > >     }
> > > > 
> > > >     return Cmp;
> > > > ```
> > > > Looks more cases can get benefit from it.
> > > Yes, this change fix the case @PR58675, while regression on case combine_setcc_glue, so I'll need more work on it.
> > > ```
> > > +++ b/llvm/test/CodeGen/AArch64/dag-combine-setcc.ll
> > > @@ -191,9 +191,11 @@ define i32 @combine_setcc_glue(i128 noundef %x, i128 noundef %y) {
> > >  ; CHECK-LABEL: combine_setcc_glue:
> > >  ; CHECK:       // %bb.0: // %entry
> > >  ; CHECK-NEXT:    cmp x1, x3
> > > -; CHECK-NEXT:    ccmp x0, x2, #0, eq
> > > -; CHECK-NEXT:    ccmp x0, x2, #4, ne
> > > -; CHECK-NEXT:    cset w0, eq
> > > +; CHECK-NEXT:    cset w8, eq
> > > +; CHECK-NEXT:    cmp x0, x2
> > > +; CHECK-NEXT:    cset w9, eq
> > > +; CHECK-NEXT:    and w8, w9, w8
> > > +; CHECK-NEXT:    orr w0, w8, w9
> > >  ; CHECK-NEXT:    ret
> > > ```
> > Don't worry about the combine_setcc_glue regression. D138401 already fixed that.
> Sorry, the change fix combine_setcc_glue is D138398
Thanks for the fixing, I rebase on that.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D137936/new/

https://reviews.llvm.org/D137936



More information about the llvm-commits mailing list