[PATCH] D75392: [AMDGPU] Fix the gfx10 scheduling model for f32 conversions
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Mar 10 12:33:52 PDT 2020
foad marked 2 inline comments as done.
foad added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/VOP1Instructions.td:199
let FPDPRounding = 1 in {
defm V_CVT_F16_F32 : VOP1Inst <"v_cvt_f16_f32", VOP_F16_F32, fpround>;
} // End FPDPRounding = 1
----------------
rampitec wrote:
> As far as I can tell this is quarter rate.
For anyone following at home, we got confirmation from an authority at AMD that V_CVT_F16_F32 is full rate on gfx10.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D75392/new/
https://reviews.llvm.org/D75392
More information about the llvm-commits
mailing list