[llvm] [NVPTX] Add mix precision arith intrinsics (PR #136657)

Rajat Bajpai via llvm-commits llvm-commits at lists.llvm.org
Wed Apr 23 02:05:56 PDT 2025


================
@@ -1306,6 +1306,48 @@ let TargetPrefix = "nvvm" in {
       DefaultAttrsIntrinsic<[llvm_double_ty], [llvm_double_ty, llvm_double_ty],
         [IntrNoMem, IntrSpeculatable, Commutative]>;
 
+  // Mixed-precision add intrinsics for half and bfloat16 to float
----------------
rajatbajpai wrote:

Yes, they are semantically equivalent. If we are not inclined on adding newer intrinsics, may be we can evaluate adding transformation to xfrm `fpext` + `fadd` => `add.f32.f16` because both the scenarios leads to different SASS. What do you think?

https://github.com/llvm/llvm-project/pull/136657


More information about the llvm-commits mailing list