[llvm] [RISCV] Decompose locally repeating shuffles (without exact VLEN) (PR #125735)
Luke Lau via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 4 18:34:47 PST 2025
================
@@ -5685,10 +5700,42 @@ static SDValue lowerVECTOR_SHUFFLE(SDValue Op, SelectionDAG &DAG,
: DAG.getUNDEF(XLenVT));
}
SDValue LHSIndices = DAG.getBuildVector(IndexVT, DL, GatherIndicesLHS);
- LHSIndices = convertToScalableVector(IndexContainerVT, LHSIndices, DAG,
- Subtarget);
- SDValue Gather = DAG.getNode(GatherVVOpc, DL, ContainerVT, V1, LHSIndices,
- DAG.getUNDEF(ContainerVT), TrueMask, VL);
+ LHSIndices =
+ convertToScalableVector(IndexContainerVT, LHSIndices, DAG, Subtarget);
+
+ SDValue Gather;
+ // If we have a locally repeating mask, then we can reuse the first register
+ // in the index register group for all registers within the source register
+ // group. TODO:: This generalizes to m2, and m4. Also, this is currently
+ // picking up cases with a fully undef tail which could be more directly
+ // handled with fewer redundant vrgathers
+ const MVT M1VT = getLMUL1VT(ContainerVT);
+ auto VLMAX = RISCVTargetLowering::computeVLMAXBounds(M1VT, Subtarget).first;
+ if (ContainerVT.bitsGT(M1VT) && isLocalRepeatingShuffle(Mask, VLMAX)) {
+ EVT SubIndexVT = M1VT.changeVectorElementType(IndexVT.getScalarType());
+ SDValue SubIndex =
+ DAG.getNode(ISD::EXTRACT_SUBVECTOR, DL, SubIndexVT, LHSIndices,
+ DAG.getVectorIdxConstant(0, DL));
+ auto [InnerTrueMask, InnerVL] =
----------------
lukel97 wrote:
Nit, would SubTrueMask + SubVL match the naming of the rest of the SDValues better?
https://github.com/llvm/llvm-project/pull/125735
More information about the llvm-commits
mailing list