[llvm] AMDGPU: Improve codegen for intrinsic llvm.fptrunc.round (PR #104486)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Fri Aug 16 03:07:42 PDT 2024
jayfoad wrote:
> Improve codegen for intrinsic llvm.fptrunc.round
Do you have an example where codegen is improved?
> We name the pseudo instruction as FPTRUNC_ROUND_F32_TO_F16 (for f32 -> f16)
How about calling it `FPTRUNC_ROUND_F16_F32` to match `V_CVT_F16_F32`?
https://github.com/llvm/llvm-project/pull/104486
More information about the llvm-commits
mailing list