[llvm-branch-commits] [llvm] AMDGPU: Reduce cost of f64 copysign	(PR #141944)
    Pierre van Houtryve via llvm-branch-commits 
    llvm-branch-commits at lists.llvm.org
       
    Mon Jun  2 00:42:03 PDT 2025
    
    
  
================
@@ -741,8 +743,8 @@ GCNTTIImpl::getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,
   case Intrinsic::copysign:
     return NElts * getFullRateInstrCost();
   case Intrinsic::canonicalize: {
-    assert(SLT != MVT::f64);
-    InstRate = getFullRateInstrCost();
+    InstRate =
+        SLT == MVT::f64 ? get64BitInstrCost(CostKind) : getFullRateInstrCost();
     break;
   }
   case Intrinsic::uadd_sat:
----------------
Pierre-vh wrote:
are those cases below fine with handling f64 now?
https://github.com/llvm/llvm-project/pull/141944
    
    
More information about the llvm-branch-commits
mailing list