[PATCH] D147518: [DAGCombiner] Fix (shl (ctlz x) n) for non-power-of-two Data
Sam Elliott via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 4 03:47:12 PDT 2023
lenary created this revision.
Herald added subscribers: ecnelises, steven.zhang, hiraditya.
Herald added a project: All.
lenary requested review of this revision.
Herald added a project: LLVM.
Herald added a subscriber: llvm-commits.
This DAGCombine is not valid for some combinations of the known bits
of x and non-power-of-two widths of x. As shown in the bug:
- The bitwidth of x is 35 (n=5)
- The unknown bits of x is only the least significant bit
- This gives the result of the ctlz two possible values: 34 or 35, both of which will give 1 when left-shifted 5 bits.
- So the `eor x, 1` that this optimisation would give is not correct.
A similar instcombine optimisation is only applied when the width of x is
a power-of-two. GlobalISel does not have this bug, as shown by the testcase.
Fixes #61549
Depends on D147517 <https://reviews.llvm.org/D147517>
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D147518
Files:
llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
llvm/test/CodeGen/AArch64/pr61549.ll
Index: llvm/test/CodeGen/AArch64/pr61549.ll
===================================================================
--- llvm/test/CodeGen/AArch64/pr61549.ll
+++ llvm/test/CodeGen/AArch64/pr61549.ll
@@ -9,7 +9,9 @@
; CHECK-NEXT: sbfx x9, x0, #0, #35
; CHECK-NEXT: sdiv x10, x8, x9
; CHECK-NEXT: msub x8, x10, x9, x8
-; CHECK-NEXT: eor x0, x8, #0x1
+; CHECK-NEXT: clz x8, x8
+; CHECK-NEXT: sub x8, x8, #29
+; CHECK-NEXT: ubfx x0, x8, #5, #30
; CHECK-NEXT: ret
;
; GISEL-LABEL: f:
Index: llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
===================================================================
--- llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+++ llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
@@ -10304,8 +10304,10 @@
return DAG.getNode(ISD::SRL, SDLoc(N), VT, N0.getOperand(0), N1);
}
- // fold (srl (ctlz x), "5") -> x iff x has one bit set (the low bit).
+ // fold (srl (ctlz x), "5") -> x iff x has one bit set (the low bit), and x has a power
+ // of two bitwidth. The "5" represents (log2 (bitwidth x)).
if (N1C && N0.getOpcode() == ISD::CTLZ &&
+ isPowerOf2_32(OpSizeInBits) &&
N1C->getAPIntValue() == Log2_32(OpSizeInBits)) {
KnownBits Known = DAG.computeKnownBits(N0.getOperand(0));
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D147518.510740.patch
Type: text/x-patch
Size: 1263 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230404/c9cc75c6/attachment.bin>
More information about the llvm-commits
mailing list