[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)

Alex MacLean via llvm-commits llvm-commits at lists.llvm.org
Sat Feb 8 12:59:34 PST 2025


https://github.com/AlexMaclean commented:

> Rather than legalizing v2f32, we handle these four instructions ad hoc, so that codegen remains the same unless these instructions are present.

Supporting `v2f32` (similar to how we support `v2f16` for example) would be a cleaner and more extensible way to implement this change. 

https://github.com/llvm/llvm-project/pull/126337


More information about the llvm-commits mailing list