[llvm] [ConstantFolding] Fix nvvm_round folding on PPC (PR #149837)

Mon Jul 21 08:27:45 PDT 2025

https://github.com/LewisCrawford created https://github.com/llvm/llvm-project/pull/149837

Fix a failing test for constant-folding the nvvm_round intrinsic. The original implementation added in #141233 used a native libm call to the "round" function, but on PPC this produces +0.0 if the input is -0.0, which caused a test failure.

This patch updates it to use APFloat functions instead of native libm calls to ensure cross-platform consistency.

>From 310d89d1835e5726bece32a59787b85853209ad3 Mon Sep 17 00:00:00 2001
From: Lewis Crawford <lcrawford at nvidia.com>
Date: Mon, 21 Jul 2025 15:23:03 +0000
Subject: [PATCH] [ConstantFolding] Fix nvvm_round folding on PPC

Fix a failing test for constant-folding the nvvm_round intrinsic.
The original implementation added in #141233 used a native libm
call to the "round" function, but on PPC this produces +0.0 if the
input is -0.0, which caused a test failure.

This patch updates it to use APFloat functions instead of native
libm calls to ensure cross-platform consistency.
---
 llvm/lib/Analysis/ConstantFolding.cpp | 14 +++++++++-----
 1 file changed, 9 insertions(+), 5 deletions(-)

diff --git a/llvm/lib/Analysis/ConstantFolding.cpp b/llvm/lib/Analysis/ConstantFolding.cpp
index f5a88b6e0368e..eb7369fcc7513 100644
--- a/llvm/lib/Analysis/ConstantFolding.cpp
+++ b/llvm/lib/Analysis/ConstantFolding.cpp
@@ -2677,11 +2677,15 @@ static Constant *ConstantFoldScalarCall1(StringRef Name,
 
       case Intrinsic::nvvm_round_ftz_f:
       case Intrinsic::nvvm_round_f:
-      case Intrinsic::nvvm_round_d:
-        return ConstantFoldFP(
-            round, APF, Ty,
-            nvvm::GetNVVMDenromMode(
-                nvvm::UnaryMathIntrinsicShouldFTZ(IntrinsicID)));
+      case Intrinsic::nvvm_round_d: {
+        // Use APFloat implementation instead of native libm call, as some
+        // implementations (e.g. on PPC) do not preserve the sign of negative 0.
+        APFloat Res = nvvm::UnaryMathIntrinsicShouldFTZ(IntrinsicID)
+                          ? FTZPreserveSign(APF)
+                          : APF;
+        Res.roundToIntegral(APFloat::rmNearestTiesToAway);
+        return ConstantFP::get(Ty->getContext(), U);
+      }
 
       case Intrinsic::nvvm_saturate_ftz_f:
       case Intrinsic::nvvm_saturate_d: