[PATCH] D120769: [AMDGPU] gfx940: support V_FMAMK_F32 and V_FMAAK_F32

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 2 10:25:56 PST 2022


rampitec marked an inline comment as done.
rampitec added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/madak.ll:162
+; GFX10-FMA:     v_fmaak_f32 v{{[0-9]+}}, [[SB]], [[VA]], 0x41200000
+; GFX940-FMA:    v_fmac_f32_e32 v{{[0-9]+}}, [[SB]], [[VA]]
 define amdgpu_kernel void @v_s_madak_f32(float addrspace(1)* noalias %out, float %a, float addrspace(1)* noalias %in.b) #0 {
----------------
foad wrote:
> Isn't this supposed to use fmaak now?
It exceeds constant bus limit. This is OK for gfx10 where the limit is 2.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120769/new/

https://reviews.llvm.org/D120769



More information about the llvm-commits mailing list