[llvm-branch-commits] [llvm] AMDGPU: Reduce cost of f64 copysign (PR #141944)
Pierre van Houtryve via llvm-branch-commits
llvm-branch-commits at lists.llvm.org
Mon Jun 2 00:42:03 PDT 2025
================
@@ -741,8 +743,8 @@ GCNTTIImpl::getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,
case Intrinsic::copysign:
return NElts * getFullRateInstrCost();
case Intrinsic::canonicalize: {
- assert(SLT != MVT::f64);
- InstRate = getFullRateInstrCost();
+ InstRate =
+ SLT == MVT::f64 ? get64BitInstrCost(CostKind) : getFullRateInstrCost();
break;
}
case Intrinsic::uadd_sat:
----------------
Pierre-vh wrote:
are those cases below fine with handling f64 now?
https://github.com/llvm/llvm-project/pull/141944
More information about the llvm-branch-commits
mailing list