[all-commits] [llvm/llvm-project] 10d918: AMDGPU: Handle cvt_scale F32/F16->F4/F8 gfx950 hazard

Pravin Jagtap via All-commits all-commits at lists.llvm.org
Tue Nov 26 20:18:51 PST 2024


  Branch: refs/heads/users/arsenm/gfx950/cvt_scale_hazard
  Home:   https://github.com/llvm/llvm-project
  Commit: 10d91893557ca0f2213a2614782f49c4da4cecb2
      https://github.com/llvm/llvm-project/commit/10d91893557ca0f2213a2614782f49c4da4cecb2
  Author: Pravin Jagtap <Pravin.Jagtap at amd.com>
  Date:   2024-11-26 (Tue, 26 Nov 2024)

  Changed paths:
    M llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
    M llvm/lib/Target/AMDGPU/GCNSubtarget.h
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
    M llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h
    M llvm/lib/Target/AMDGPU/VOP3Instructions.td
    M llvm/test/CodeGen/AMDGPU/hazards-gfx950.mir
    M llvm/test/CodeGen/AMDGPU/llvm.amdgcn.cvt.scalef32.pk.gfx950.ll
    M llvm/test/CodeGen/AMDGPU/materialize-frame-index-sgpr.ll

  Log Message:
  -----------
  AMDGPU: Handle cvt_scale F32/F16->F4/F8 gfx950 hazard

gfx950 SP changes doc says:
No 4 clk forwarding on opcodes that convert from
F32/F16->F8 or F32/F16->F4. Must insert a NOP or
instruction writing some other destination VREG
after a conversion to F4/F8 since it writes either
low/high half or bytes.

Co-authored-by: Pravin Jagtap <Pravin.Jagtap at amd.com>
Co-authored-by: Jeffrey Byrnes <Jeffrey.Byrnes at amd.com>



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list