[clang] [llvm] [NVPTX] Add intrinsics for cvt .f6x2 and .ue8m0x2 variants (PR #134345)

Artem Belevich via llvm-commits llvm-commits at lists.llvm.org
Fri Apr 4 13:58:46 PDT 2025


================
@@ -580,6 +580,15 @@ def __nvvm_f2bf16_rz : NVPTXBuiltinSMAndPTX<"__bf16(float)", SM_80, PTX70>;
 def __nvvm_f2bf16_rz_relu : NVPTXBuiltinSMAndPTX<"__bf16(float)", SM_80, PTX70>;
 
 def __nvvm_f2tf32_rna : NVPTXBuiltinSMAndPTX<"int32_t(float)", SM_80, PTX70>;
+def __nvvm_f2tf32_rna_satfinite : NVPTXBuiltinSMAndPTX<"int32_t(float)", SM_89, PTX81>;
----------------
Artem-B wrote:

Patch summary may need to be updated:
* we're also adding clang builtins, not just LLVM intrinsics.
* we're also adding them for tf32

Perhaps make the summary generic "Added builtins/intrinsics for conversions between new FP types." and include itemized list in the details.

https://github.com/llvm/llvm-project/pull/134345


More information about the llvm-commits mailing list