[llvm] [NVPTX] Improve modeling of inline PTX (PR #130675)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Tue Mar 25 10:06:32 PDT 2025
================
@@ -483,6 +484,34 @@ NVPTXTTIImpl::instCombineIntrinsic(InstCombiner &IC, IntrinsicInst &II) const {
return std::nullopt;
}
+InstructionCost
+NVPTXTTIImpl::getInstructionCost(const User *U,
+ ArrayRef<const Value *> Operands,
+ TTI::TargetCostKind CostKind) {
+ if (const auto *CI = dyn_cast<CallInst>(U))
+ if (const auto *IA = dyn_cast<InlineAsm>(CI->getCalledOperand())) {
+ // Without this implementation getCallCost() would return the number
+ // of arguments+1 as the cost. Because the cost-model assumes it is a call
+ // since it is classified as a call in the IR. A better cost model would
+ // be to return the number of asm instructions embedded in the asm
+ // string.
+ auto &AsmStr = IA->getAsmString();
+ SmallVector<StringRef, 4> AsmPieces;
+ SplitString(AsmStr, AsmPieces, ";\n");
----------------
AlexMaclean wrote:
Yea, that is true. No matter what, there are going to be edge cases that are not correctly estimated here. I've switched to splitting only on ";" but my assumption is this hasn't fixed all the examples you listed above. I do think it will be good enough for most inline PTX though.
https://github.com/llvm/llvm-project/pull/130675
More information about the llvm-commits
mailing list