[Mlir-commits] [mlir] [MLIR][NVVM] Add support for f32 in redux.sync Op (PR #128137)
Durgadoss R
llvmlistbot at llvm.org
Fri Feb 21 02:19:01 PST 2025
================
@@ -25,9 +25,17 @@ using namespace mlir;
using namespace mlir::LLVM;
using mlir::LLVM::detail::createIntrinsicCall;
+#define REDUX_F32_ID_IMPL(op, abs, nan) \
+ hasNaN ? llvm::Intrinsic::nvvm_redux_sync_f##op##abs##nan \
+ : llvm::Intrinsic::nvvm_redux_sync_f##op##abs
+
+#define GET_REDUX_F32_ID(op, abs, nan) \
+ hasAbs ? REDUX_F32_ID_IMPL(op, abs, nan) : REDUX_F32_ID_IMPL(op, , nan)
+
----------------
durga4github wrote:
optional:
Since the concat is straight-forward, passing the booleans to the macros directly, makes them self-contained.
https://github.com/llvm/llvm-project/pull/128137
More information about the Mlir-commits
mailing list