[PATCH] D133489: [AMDGPU] Fix shrinking of F16 FMA on newer subtargets
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 8 08:24:23 PDT 2022
foad added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/gfx10-shrink-mad-fma.mir:195
%1:vgpr_32 = IMPLICIT_DEF
- %2:vgpr_32 = V_FMA_F16_e64 0, 18688, 0, %0, 0, %1, 0, 0, implicit $mode, implicit $exec
SI_RETURN implicit %2
----------------
Joe_Nash wrote:
> I guess the test case before was technically invalid, because V_FMA_F16_e64 should not exist on GFX10, only GFX8. But afaik we do not check whether an instruction is valid on the subtarget in SIInstrInfo::verifyInstruction.
Right. See https://reviews.llvm.org/D129506#inline-1246267.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D133489/new/
https://reviews.llvm.org/D133489
More information about the llvm-commits
mailing list