[PATCH] D75392: [AMDGPU] Fix the gfx10 scheduling model for f32 conversions

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 28 16:07:20 PST 2020


rampitec added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/VOP1Instructions.td:199
 let FPDPRounding = 1 in {
 defm V_CVT_F16_F32 : VOP1Inst <"v_cvt_f16_f32", VOP_F16_F32, fpround>;
 } // End FPDPRounding = 1
----------------
As far as I can tell this is quarter rate.


================
Comment at: llvm/lib/Target/AMDGPU/VOP1Instructions.td:201
 } // End FPDPRounding = 1
 defm V_CVT_F32_F16 : VOP1Inst <"v_cvt_f32_f16", VOP_F32_F16, fpextend>;
 defm V_CVT_RPI_I32_F32 : VOP1Inst <"v_cvt_rpi_i32_f32", VOP_I32_F32, cvt_rpi_i32_f32>;
----------------
And this is full rate, as well as 3 opcodes below.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D75392/new/

https://reviews.llvm.org/D75392





More information about the llvm-commits mailing list