[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 10 09:30:40 PDT 2025
================
@@ -903,6 +921,11 @@ NVPTXTargetLowering::NVPTXTargetLowering(const NVPTXTargetMachine &TM,
}
}
+ // Expand v2f32 = fp_extend
+ setOperationAction(ISD::FP_EXTEND, MVT::v2f32, Expand);
+ // Expand v2[b]f16 = fp_round v2f32
+ setOperationAction(ISD::FP_ROUND, {MVT::v2bf16, MVT::v2f16}, Expand);
----------------
AlexMaclean wrote:
This looks unrelated to v2f32 support. Why is this needed as part of this patch? Can it be landed separately?
https://github.com/llvm/llvm-project/pull/126337
More information about the llvm-commits
mailing list