[llvm] aebd338 - [AMDGPU] Fix user SGPR alloc order in docs (#119092)
via llvm-commits
llvm-commits at lists.llvm.org
Sat Dec 7 13:08:39 PST 2024
Author: Austin Kerbow
Date: 2024-12-07T13:08:35-08:00
New Revision: aebd3389a9e694f7087d55e159186734d4559ca6
URL: https://github.com/llvm/llvm-project/commit/aebd3389a9e694f7087d55e159186734d4559ca6
DIFF: https://github.com/llvm/llvm-project/commit/aebd3389a9e694f7087d55e159186734d4559ca6.diff
LOG: [AMDGPU] Fix user SGPR alloc order in docs (#119092)
NFC. Preload kernarg SGPRs are allocated after the private segment size
SGPR. This patch updates AMDGPUUsage.rst to reflect this.
Added:
Modified:
llvm/docs/AMDGPUUsage.rst
Removed:
################################################################################
diff --git a/llvm/docs/AMDGPUUsage.rst b/llvm/docs/AMDGPUUsage.rst
index 7dbfa8c085b91a..5c6034753eb4af 100644
--- a/llvm/docs/AMDGPUUsage.rst
+++ b/llvm/docs/AMDGPUUsage.rst
@@ -5756,9 +5756,6 @@ SGPR register initial state is defined in
then Flat Scratch Init 2 See
(enable_sgpr_flat_scratch :ref:`amdgpu-amdhsa-kernel-prolog-flat-scratch`.
_init)
- then Preloaded Kernargs N/A See
- (kernarg_preload_spec :ref:`amdgpu-amdhsa-kernarg-preload`.
- _length)
then Private Segment Size 1 The 32-bit byte size of a
(enable_sgpr_private single work-item's memory
_segment_size) allocation. This is the
@@ -5779,6 +5776,9 @@ SGPR register initial state is defined in
may be needed for GFX9-GFX11 which
changes the meaning of the
Flat Scratch Init value.
+ then Preloaded Kernargs N/A See
+ (kernarg_preload_spec :ref:`amdgpu-amdhsa-kernarg-preload`.
+ _length)
then Work-Group Id X 1 32-bit work-group id in X
(enable_sgpr_workgroup_id dimension of grid for
_X) wavefront.
More information about the llvm-commits
mailing list