[llvm-branch-commits] [llvm] AMDGPU: Reduce cost of f64 copysign (PR #141944)

Pierre van Houtryve via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Mon Jun 2 00:42:03 PDT 2025


================
@@ -741,8 +743,8 @@ GCNTTIImpl::getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,
   case Intrinsic::copysign:
     return NElts * getFullRateInstrCost();
   case Intrinsic::canonicalize: {
-    assert(SLT != MVT::f64);
-    InstRate = getFullRateInstrCost();
+    InstRate =
+        SLT == MVT::f64 ? get64BitInstrCost(CostKind) : getFullRateInstrCost();
     break;
   }
   case Intrinsic::uadd_sat:
----------------
Pierre-vh wrote:

are those cases below fine with handling f64 now?

https://github.com/llvm/llvm-project/pull/141944


More information about the llvm-branch-commits mailing list