[llvm-branch-commits] [llvm] AMDGPU: Reduce cost of f64 copysign (PR #141944)

Matt Arsenault via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Mon Jun 2 00:48:47 PDT 2025


================
@@ -741,8 +743,8 @@ GCNTTIImpl::getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,
   case Intrinsic::copysign:
     return NElts * getFullRateInstrCost();
   case Intrinsic::canonicalize: {
-    assert(SLT != MVT::f64);
-    InstRate = getFullRateInstrCost();
+    InstRate =
+        SLT == MVT::f64 ? get64BitInstrCost(CostKind) : getFullRateInstrCost();
     break;
   }
   case Intrinsic::uadd_sat:
----------------
arsenm wrote:

They are only integer intrinsics 

https://github.com/llvm/llvm-project/pull/141944


More information about the llvm-branch-commits mailing list