[llvm] [AMDGPU] Fix hidden kernarg preload count inconsistency (PR #116759)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Tue Dec 3 13:56:04 PST 2024
================
@@ -771,6 +775,9 @@ GCNUserSGPRUsageInfo::GCNUserSGPRUsageInfo(const Function &F,
if (hasPrivateSegmentSize())
NumUsedUserSGPRs += getNumUserSGPRForField(PrivateSegmentSizeID);
+
+ if (hasLDSKernelId())
+ NumSyntheticSGPRs += getNumUserSGPRForField(LDSKernelIdID);
----------------
arsenm wrote:
I'm not sure I understand why anything is changing with the kernel ID. Is this just changing bookkeeping to split out the kernel's user SGPRs. vs. the implicit SGPRs in a function?
https://github.com/llvm/llvm-project/pull/116759
More information about the llvm-commits
mailing list