[llvm] [AMDGPU][SDAG] Try folding "lshr i64 + mad" to "mad_u64_u32" (PR #119218)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Sun Jan 5 23:03:44 PST 2025
================
@@ -13857,6 +13857,36 @@ static SDValue getMad64_32(SelectionDAG &DAG, const SDLoc &SL, EVT VT,
return DAG.getNode(ISD::TRUNCATE, SL, VT, Mad);
}
+// Fold
+// y = lshr i64 x, 32
+// res = add (mul i64 y, Const), x where "Const" is a 64-bit constant
+// with Const.hi == -1
+// To
+// res = mad_u64_u32 y.lo ,Const.lo, x.lo
+static SDValue tryFoldMADwithSRL(SelectionDAG &DAG, const SDLoc &SL,
+ SDValue MulLHS, SDValue MulRHS,
+ SDValue AddRHS) {
+
+ if (MulLHS.getValueType() != MVT::i64 || MulLHS.getOpcode() != ISD::SRL)
+ return SDValue();
+
+ ConstantSDNode *ShiftVal = dyn_cast<ConstantSDNode>(MulLHS.getOperand(1));
+ if (!ShiftVal || MulLHS.getOperand(0) != AddRHS)
+ return SDValue();
+
+ if (ShiftVal->getAsZExtVal() != 32)
+ return SDValue();
+
+ uint64_t Const = dyn_cast<ConstantSDNode>(MulRHS.getNode())->getZExtValue();
----------------
arsenm wrote:
Unchecked dyn_cast. You should only check the constantness once with dyn_cast instead of splitting the handling
https://github.com/llvm/llvm-project/pull/119218
More information about the llvm-commits
mailing list