[PATCH] D154528: [AMDGPU][GlobalISel] Generate fast fp64-to-fp16 conversions in unsafe mode.
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 6 03:21:57 PDT 2023
foad added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/VOP1Instructions.td:519-522
+def : GCNPat<
+ (f16 (fpround f64:$src)),
+ (V_CVT_F16_F32_e64 SRCMODS.NONE, (V_CVT_F32_F64_e64 SRCMODS.NONE, $src))
+>;
----------------
kosarev wrote:
> arsenm wrote:
> > Should just let the expansion split them up, this is missing the source modifiers
> Updated to match modifiers. The expansion would combine back to a single fpround f64, so doesn't work.
"The expansion would combine back to a single fpround f64" - that sounds like the combine is broken then? It should respect legality.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D154528/new/
https://reviews.llvm.org/D154528
More information about the llvm-commits
mailing list