[llvm] [X86] Use an FP-based expansion for v4i32 ctlz on SSE2-only targets (PR #167034)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 10 10:05:53 PST 2025
================
@@ -29039,6 +29045,13 @@ static SDValue LowerVectorCTLZ(SDValue Op, const SDLoc &DL,
if (VT.is512BitVector() && !Subtarget.hasBWI())
return splitVectorIntUnary(Op, DAG, DL);
+ if (VT == MVT::v4i32 && Subtarget.hasSSE2() && !Subtarget.hasSSSE3()) {
+ const TargetLowering &TLI = DAG.getTargetLoweringInfo();
+ SDValue New = TLI.expandCTLZWithFP(Op.getNode(), DAG);
----------------
arsenm wrote:
```suggestion
SDValue New = expandCTLZWithFP(Op.getNode(), DAG);
```
https://github.com/llvm/llvm-project/pull/167034
More information about the llvm-commits
mailing list