[llvm] [AMDGPU] Fix hidden kernarg preload count inconsistency (PR #116759)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Tue Dec 3 13:56:04 PST 2024


================
@@ -771,6 +775,9 @@ GCNUserSGPRUsageInfo::GCNUserSGPRUsageInfo(const Function &F,
 
   if (hasPrivateSegmentSize())
     NumUsedUserSGPRs += getNumUserSGPRForField(PrivateSegmentSizeID);
+
+  if (hasLDSKernelId())
+    NumSyntheticSGPRs += getNumUserSGPRForField(LDSKernelIdID);
----------------
arsenm wrote:

I'm not sure I understand why anything is changing with the kernel ID. Is this just changing bookkeeping to split out the kernel's user SGPRs. vs. the implicit SGPRs in a function?

https://github.com/llvm/llvm-project/pull/116759


More information about the llvm-commits mailing list