[llvm] [AMDGPU][CodeGenPrepare] Narrow 64 bit math to 32 bit if profitable (PR #130577)
via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 12 09:47:36 PDT 2025
================
@@ -8594,6 +8594,68 @@ static bool optimizeBranch(BranchInst *Branch, const TargetLowering &TLI,
return false;
}
+static bool tryNarrowMathIfNoOverflow(Instruction *I,
+ const TargetTransformInfo *TTI,
+ const DataLayout &DL) {
+ unsigned Opc = I->getOpcode();
+ Type *OldType = I->getType();
+
+ if (Opc != Instruction::Add && Opc != Instruction::Mul)
+ return false;
+
+ unsigned OrigBit = OldType->getScalarSizeInBits();
+ unsigned MaxBitsNeed = OrigBit;
+ switch (Opc) {
+ case Instruction::Add:
+ MaxBitsNeed = KnownBits::add(computeKnownBits(I->getOperand(0), DL),
+ computeKnownBits(I->getOperand(1), DL))
+ .countMaxActiveBits();
+ break;
+ case Instruction::Mul:
+ MaxBitsNeed = KnownBits::mul(computeKnownBits(I->getOperand(0), DL),
+ computeKnownBits(I->getOperand(1), DL))
+ .countMaxActiveBits();
+ break;
+ default:
+ break;
+ }
+
+ MaxBitsNeed = std::max<unsigned>(bit_ceil(MaxBitsNeed), 8);
----------------
Shoreshen wrote:
Thanks @nikic , well should I also check for trunc and zext??
https://github.com/llvm/llvm-project/pull/130577
More information about the llvm-commits
mailing list