[llvm] AMDGPU: Handle gfx950XDL Write VGPR wait state change (PR #126132)
via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 7 09:06:58 PST 2025
================
@@ -2290,12 +2290,14 @@ GFX940_SMFMA_N_PassWritesVGPROverlappedSrcABWaitStates(int NumPasses) {
return NumPasses + 2;
}
-static int GFX940_XDL_N_PassWritesVGPROverlappedSrcABWaitStates(int NumPasses) {
- // 2 pass -> 5
- // 4 pass -> 7
- // 8 pass -> 11
- // 16 pass -> 19
- return NumPasses + 3;
+static int GFX940_XDL_N_PassWritesVGPROverlappedSrcABWaitStates(int NumPasses,
+ bool IsGFX950) {
+ // xdl def cycles | gfx940 | gfx950
+ // 2 pass | 5 5
+ // 4 pass | 7 8
+ // 8 pass | 11 12
+ // 16 pass | 19 20
+ return NumPasses + 3 + (NumPasses != 2 && IsGFX950);
}
----------------
bcahoon wrote:
Hi @scchan, that's a good point about generic targets. Should that be handled with a separate patch?
https://github.com/llvm/llvm-project/pull/126132
More information about the llvm-commits
mailing list