[llvm] [AMDGPU][CodeGenPrepare] Narrow 64 bit math to 32 bit if profitable (PR #130577)
Nikita Popov via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 12 07:37:36 PDT 2025
================
@@ -8594,6 +8594,68 @@ static bool optimizeBranch(BranchInst *Branch, const TargetLowering &TLI,
return false;
}
+static bool tryNarrowMathIfNoOverflow(Instruction *I,
+ const TargetTransformInfo *TTI,
+ const DataLayout &DL) {
+ unsigned Opc = I->getOpcode();
+ Type *OldType = I->getType();
+
+ if (Opc != Instruction::Add && Opc != Instruction::Mul)
+ return false;
+
+ unsigned OrigBit = OldType->getScalarSizeInBits();
+ unsigned MaxBitsNeed = OrigBit;
+ switch (Opc) {
+ case Instruction::Add:
+ MaxBitsNeed = KnownBits::add(computeKnownBits(I->getOperand(0), DL),
+ computeKnownBits(I->getOperand(1), DL))
+ .countMaxActiveBits();
+ break;
+ case Instruction::Mul:
+ MaxBitsNeed = KnownBits::mul(computeKnownBits(I->getOperand(0), DL),
+ computeKnownBits(I->getOperand(1), DL))
+ .countMaxActiveBits();
+ break;
+ default:
+ break;
+ }
+
+ MaxBitsNeed = std::max<unsigned>(bit_ceil(MaxBitsNeed), 8);
----------------
nikic wrote:
Oh yeah, your code does have the non-pow2 type issue, but it could still pick an illegal type. E.g. if you have a target with 64-bit and 32-bit types, this may pick 8-bit, which may unnecessary inflate the costs by using an illegal type.
https://github.com/llvm/llvm-project/pull/130577
More information about the llvm-commits
mailing list