[PATCH] D106023: [AMDGPU] Mark relevant rematerializable VOP2 instructions
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 15 15:23:17 PDT 2021
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/VOP2Instructions.td:765
+let SubtargetPredicate = HasFmaakFmamkF32Insts, isReMaterializable = 1 in {
def V_FMAMK_F32 : VOP2_Pseudo<"v_fmamk_f32", VOP_MADMK_F32, [], "">;
----------------
rampitec wrote:
> arsenm wrote:
> > This and the other fma flavors preserve the high bits on gfx9
> This is f32, both f16 fma shall also zero hi bits because these are VOP2 only.
I wrote inline asm tests a few weeks ago for all of these. The gfx9 manual says it didn't change the existing instruction behavior, but I think this was wrong. mad/mac/fma all seem to preserve (see GCNSubtarget::zeroesHigh16BitsOfDest)
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D106023/new/
https://reviews.llvm.org/D106023
More information about the llvm-commits
mailing list