[all-commits] [llvm/llvm-project] 1e15ad: [AMDGPU] Add s_nop WaitStates between neighboring ...

Austin Kerbow via All-commits all-commits at lists.llvm.org
Wed Mar 23 13:57:33 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 1e15adba62a9fbc00a9999d75818ef8b1fbb8cd7
      https://github.com/llvm/llvm-project/commit/1e15adba62a9fbc00a9999d75818ef8b1fbb8cd7
  Author: Austin Kerbow <Austin.Kerbow at amd.com>
  Date:   2022-03-23 (Wed, 23 Mar 2022)

  Changed paths:
    M llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp
    M llvm/lib/Target/AMDGPU/GCNHazardRecognizer.h
    A llvm/test/CodeGen/AMDGPU/neighboring-mfma-padding.mir

  Log Message:
  -----------
  [AMDGPU] Add s_nop WaitStates between neighboring mfma

In some cases padding bubbles between sequential MFMA instructions may
lead to increased inter-wave performance. Add option to request to pad
some portion of these stall cycles with s_nops.

Fixes: SWDEV-326925

Reviewed By: rampitec

Differential Revision: https://reviews.llvm.org/D121437




More information about the All-commits mailing list