[llvm] [LoopVectorize] Add cost of generating tail-folding mask to the loop (PR #130565)
Mel Chen via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 10 07:02:51 PDT 2025
================
@@ -743,6 +743,22 @@ InstructionCost VPInstruction::computeCost(ElementCount VF,
return Ctx.TTI.getArithmeticReductionCost(
Instruction::Or, cast<VectorType>(VecTy), std::nullopt, Ctx.CostKind);
}
+ case VPInstruction::ActiveLaneMask: {
+ Type *Arg0Ty = Ctx.Types.inferScalarType(getOperand(0));
+ Type *Arg1Ty = Ctx.Types.inferScalarType(getOperand(1));
+ Type *RetTy = toVectorTy(Ctx.Types.inferScalarType(this), VF);
----------------
Mel-Chen wrote:
IIUC, `llvm.get.active.lane.mask` only returns a `VF x i1` type.
Could we use `getInt1Ty` directly here?
https://github.com/llvm/llvm-project/pull/130565
More information about the llvm-commits
mailing list