[llvm] [NVPTX] Add support for f16 fabs (PR #116107)

Justin Fargnoli via llvm-commits llvm-commits at lists.llvm.org
Wed Nov 13 13:52:58 PST 2024


================
@@ -864,10 +864,15 @@ NVPTXTargetLowering::NVPTXTargetLowering(const NVPTXTargetMachine &TM,
     AddPromotedToType(Op, MVT::bf16, MVT::f32);
   }
   for (const auto &Op : {ISD::FABS}) {
-    setOperationAction(Op, MVT::f16, Promote);
     setOperationAction(Op, MVT::f32, Legal);
     setOperationAction(Op, MVT::f64, Legal);
----------------
justinfargnoli wrote:

> abs.f64 requires sm_13 or higher.

9.7.3.9. [Floating Point Instructions: abs](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#floating-point-instructions-abs)

https://github.com/llvm/llvm-project/pull/116107


More information about the llvm-commits mailing list