[llvm] [NVPTX] Improve 64bit FSH/ROT lowering when shift amount is constant (PR #131371)
Justin Fargnoli via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 17 11:30:59 PDT 2025
================
@@ -2758,6 +2764,53 @@ static SDValue lowerCTLZCTPOP(SDValue Op, SelectionDAG &DAG) {
return DAG.getNode(ISD::ZERO_EXTEND, DL, MVT::i64, CT, SDNodeFlags::NonNeg);
}
+static SDValue expandFSH64(SDValue A, SDValue B, SDValue AmtVal, SDLoc DL,
+ unsigned Opcode, SelectionDAG &DAG) {
+ assert(A.getValueType() == MVT::i64 && B.getValueType() == MVT::i64);
+
+ const auto *AmtConst = dyn_cast<ConstantSDNode>(AmtVal);
+ if (!AmtConst)
+ return SDValue();
+ const auto Amt = AmtConst->getZExtValue() & 63;
+
+ SDValue UnpackA =
+ DAG.getNode(NVPTXISD::UNPACK_VECTOR, DL, {MVT::i32, MVT::i32}, A);
+ SDValue UnpackB =
+ DAG.getNode(NVPTXISD::UNPACK_VECTOR, DL, {MVT::i32, MVT::i32}, B);
+
+ // Arch is Little endiain: 0 = low bits, 1 = high bits
+ SDValue ALo = UnpackA.getValue(0);
+ SDValue AHi = UnpackA.getValue(1);
+ SDValue BLo = UnpackB.getValue(0);
+ SDValue BHi = UnpackB.getValue(1);
+
+ // The bitfeild consists of { AHi : ALo : BHi : BLo }
+ // FSHL, Amt < 32 - The window will contain { AHi : ALo : BHi }
+ // FSHL, Amt >= 32 - The window will contain { ALo : BHi : BLo }
+ // FSHR, Amt < 32 - The window will contain { ALo : BHi : BLo }
+ // FSHR, Amt >= 32 - The window will contain { AHi : ALo : BHi }
----------------
justinfargnoli wrote:
Great comments :)
https://github.com/llvm/llvm-project/pull/131371
More information about the llvm-commits
mailing list