[PATCH] D93092: [PowerPC] KnownBits should be constant when performing non-sign comparison
Kai Luo via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Dec 11 00:45:26 PST 2020
lkail created this revision.
lkail added reviewers: nemanjai, PowerPC, jsji.
Herald added subscribers: steven.zhang, shchenz, kbarton, hiraditya.
lkail requested review of this revision.
Herald added a project: LLVM.
Herald added a subscriber: llvm-commits.
In `PPCTargetLowering::DAGCombineTruncBoolExt`, when checking if it's correct to perform the transformation for non-sign comparison, as the comment says
// This is neither a signed nor an unsigned comparison, just make sure
// that the high bits are equal.
Origin check
if (Op1Known.Zero != Op2Known.Zero || Op1Known.One != Op2Known.One)
return SDValue();
is not strong enough. For example,
Op1Known = 111x000x;
Op2Known = 111x000x;
Bit 4, besides bit 0, is still unknown and affects the final result.
This patch fixes https://bugs.llvm.org/show_bug.cgi?id=48388.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D93092
Files:
llvm/lib/Target/PowerPC/PPCISelLowering.cpp
llvm/test/CodeGen/PowerPC/pr48388.ll
Index: llvm/test/CodeGen/PowerPC/pr48388.ll
===================================================================
--- /dev/null
+++ llvm/test/CodeGen/PowerPC/pr48388.ll
@@ -0,0 +1,42 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
+; RUN: llc -verify-machineinstrs -mtriple=powerpc64le -ppc-asm-full-reg-names \
+; RUN: < %s | FileCheck %s
+
+define i64 @julia_div_i64(i64 %0, i64 %1) local_unnamed_addr #0 {
+; CHECK-LABEL: julia_div_i64:
+; CHECK: # %bb.0: # %entry
+; CHECK-NEXT: divd r6, r3, r4
+; CHECK-NEXT: li r5, 32767
+; CHECK-NEXT: sldi r5, r5, 32
+; CHECK-NEXT: oris r7, r5, 40069
+; CHECK-NEXT: oris r5, r5, 40079
+; CHECK-NEXT: cmpdi r3, 0
+; CHECK-NEXT: ori r7, r7, 13456
+; CHECK-NEXT: ori r5, r5, 65264
+; CHECK-NEXT: iselgt r9, r5, r7
+; CHECK-NEXT: cmpdi r4, 0
+; CHECK-NEXT: mulld r8, r6, r4
+; CHECK-NEXT: iselgt r4, r5, r7
+; CHECK-NEXT: xor r4, r9, r4
+; CHECK-NEXT: cntlzd r4, r4
+; CHECK-NEXT: rldicl r4, r4, 58, 63
+; CHECK-NEXT: xor r3, r8, r3
+; CHECK-NEXT: addic r5, r3, -1
+; CHECK-NEXT: subfe r3, r5, r3
+; CHECK-NEXT: and r3, r4, r3
+; CHECK-NEXT: add r3, r6, r3
+; CHECK-NEXT: blr
+entry:
+ %2 = sdiv i64 %0, %1
+ %3 = icmp sgt i64 %0, 0
+ %4 = icmp sgt i64 %1, 0
+ %5 = select i1 %3, i64 140735820070640, i64 140735819363472
+ %6 = select i1 %4, i64 140735820070640, i64 140735819363472
+ %7 = icmp eq i64 %5, %6
+ %8 = mul i64 %2, %1
+ %9 = icmp ne i64 %8, %0
+ %10 = and i1 %7, %9
+ %11 = zext i1 %10 to i64
+ %12 = add i64 %2, %11
+ ret i64 %12
+}
Index: llvm/lib/Target/PowerPC/PPCISelLowering.cpp
===================================================================
--- llvm/lib/Target/PowerPC/PPCISelLowering.cpp
+++ llvm/lib/Target/PowerPC/PPCISelLowering.cpp
@@ -13189,7 +13189,8 @@
Op1Known.Zero.clearBit(0); Op1Known.One.clearBit(0);
Op2Known.Zero.clearBit(0); Op2Known.One.clearBit(0);
- if (Op1Known.Zero != Op2Known.Zero || Op1Known.One != Op2Known.One)
+ if (!Op1Known.isConstant() || !Op2Known.isConstant() ||
+ Op1Known.getConstant() != Op2Known.getConstant())
return SDValue();
}
}
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D93092.311131.patch
Type: text/x-patch
Size: 2198 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20201211/7f3fe71a/attachment.bin>
More information about the llvm-commits
mailing list