[llvm] [NVPTX] support packed f32 instructions for sm_100+ (PR #126337)
Nikolay Panchenko via llvm-commits
llvm-commits at lists.llvm.org
Fri Jul 18 12:22:51 PDT 2025
npanchen wrote:
> Looks like we need to add the following:
>
> ```
> setTruncStoreAction(MVT::v2f32, MVT::v2f16, Expand);
> setTruncStoreAction(MVT::v2f32, MVT::v2bf16, Expand);
> ```
>
> Otherwise the fp_round will get folded into the store which we can't handle.
Yes, I finally got a chance to debug it and realized there's incorrect handling of truncating stores for a long period of time. Will you(@AlexMaclean) or @Artem-B or @Prince781 create a fix ?
https://github.com/llvm/llvm-project/pull/126337
More information about the llvm-commits
mailing list