[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)

Nikolay Panchenko via llvm-commits llvm-commits at lists.llvm.org
Fri Jul 18 12:49:06 PDT 2025


npanchen wrote:

> Looks like we need to add the following:
> 
> ```
>   setTruncStoreAction(MVT::v2f32, MVT::v2f16, Expand);
>   setTruncStoreAction(MVT::v2f32, MVT::v2bf16, Expand);
> ```
> 
> Otherwise the fp_round will get folded into the store which we can't handle.

With both https://github.com/llvm/llvm-project/pull/149393 and https://github.com/llvm/llvm-project/pull/149571 my local testing is clean!

https://github.com/llvm/llvm-project/pull/126337


More information about the llvm-commits mailing list