[PATCH] D121437: [AMDGPU] Add s_nop WaitStates between neighboring mfma
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 21 11:03:13 PDT 2022
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/GCNHazardRecognizer.cpp:1397
if (WaitStatesNeeded == MaxWaitStates)
return WaitStatesNeeded; // Early exit.
----------------
Longest MAI is 64 cycles. You may want to move your code to the top as it can bring longest nop sequence.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D121437/new/
https://reviews.llvm.org/D121437
More information about the llvm-commits
mailing list