[llvm] [NVPTX] Add mix precision arith intrinsics (PR #136657)
Rajat Bajpai via llvm-commits
llvm-commits at lists.llvm.org
Wed Apr 23 02:05:56 PDT 2025
================
@@ -1306,6 +1306,48 @@ let TargetPrefix = "nvvm" in {
DefaultAttrsIntrinsic<[llvm_double_ty], [llvm_double_ty, llvm_double_ty],
[IntrNoMem, IntrSpeculatable, Commutative]>;
+ // Mixed-precision add intrinsics for half and bfloat16 to float
----------------
rajatbajpai wrote:
Yes, they are semantically equivalent. If we are not inclined on adding newer intrinsics, may be we can evaluate adding transformation to xfrm `fpext` + `fadd` => `add.f32.f16` because both the scenarios leads to different SASS. What do you think?
https://github.com/llvm/llvm-project/pull/136657
More information about the llvm-commits
mailing list