[PATCH] D137936: [AArch64] Optimize cmp chain when the result is tested for [in]equality with 0
Allen zhong via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Nov 16 07:46:31 PST 2022
Allen marked 2 inline comments as done.
Allen added a comment.
In D137936#3930036 <https://reviews.llvm.org/D137936#3930036>, @dmgreen wrote:
> There is code in DAGCombiner::BackwardsPropagateMask that can propagate And's back to loads, and would usually handle patterns like this but it can't look through any_extends. It has seemed to be useful in the past though.
> You could also imagine transforming `i64 and (any_extend(i32 x), mask)` into `i64 zext(i32 and(x, mask)` under AArch64, as we know the zext will be free. I think that would run into other problems though, as the zext between the And isn't handled for all the BFI cases. Without improving BFI at the same time it would lead to other regressions.
>
> So I'm not sure either of those methods would be better than this, even if they are more general. I think it would be useful to add deliberate tests for this though if we can, especially for the edge cases.
a) visitAND already try the i64 and (any_extend(i32 x), mask)` into `i64 zext(i32 and(x, mask)`, but fail as checking **MaskedValueIsZero**, https://github.com/llvm/llvm-project/blob/main/llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp#L6302
b) I also tryed to adjust `ISD::ANY_EXTEND --> ISD::ZERO_EXTEND` in **DAGTypeLegalizer::PromoteIntOp_ZERO_EXTEND**, then we'll also get the expect zext without AND, but some **x86 cases** regressions for chains of ANDs, so I limited to both operands are Loads.
================
Comment at: llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp:11982
+// (zext (and/or/xor (zextload x, zextload x)))
+SDValue DAGCombiner::CombineZExtLogicopDupExtLoad(SDNode *N) {
+ assert(N->getOpcode() == ISD::AND && "Unexpected opcode");
----------------
dmgreen wrote:
> Why "Dup" in the name? Because there are two loads? When I see Dup I think of vector splats.
Yes, Both operands are Load. I'll rename it to CombineZExtLogicopDoubleExtLoad
================
Comment at: llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp:12013
+ if (!Load0->hasOneUse() || !Load1->hasOneUse() || MemVT0 != MemVT1 ||
+ !DAG.MaskedValueIsZero(N1, Mask))
+ return SDValue();
----------------
dmgreen wrote:
> Does it not need to check other bits about the Mask?
Done, thanks
================
Comment at: llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp:12030
+
+ SDValue Logic = DAG.getNode(ISD::XOR, SDLoc(N), OrigVT, ExtLoad0, ExtLoad1);
+ return DAG.getNode(ISD::ZERO_EXTEND, SDLoc(N), VT, Logic);
----------------
dmgreen wrote:
> Should XOR be Logicop.getOpcode()?
Oh, good catch. thanks
================
Comment at: llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp:20666
// (trunc (srl $1 half-width))
-// (trunc (srl $1 (2 * half-width))) …)
+// (trunc (srl $1 (2 * half-width))) ...)
// to (bitcast $1)
----------------
dmgreen wrote:
> This can be done separately.
ok, https://reviews.llvm.org/D138124
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D137936/new/
https://reviews.llvm.org/D137936
More information about the llvm-commits
mailing list