[llvm] AMDGPU: Improve codegen for intrinsic llvm.fptrunc.round (PR #104486)
Changpeng Fang via llvm-commits
llvm-commits at lists.llvm.org
Fri Aug 16 10:15:51 PDT 2024
changpeng wrote:
> Do you have an example where codegen is improved?
Same code generated for round.upward ad round.downward. Add two additional cases round.tonearst and round.towardzero.
> How about calling it `FPTRUNC_ROUND_F16_F32` to match `V_CVT_F16_F32`?
Will do as suggested. Thanks.
https://github.com/llvm/llvm-project/pull/104486
More information about the llvm-commits
mailing list