[PATCH] D42554: AMDGPU: Clarify ReqdWorkGroupSize and MaxFlatWorkGroupSize metadata
Tony Tye via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Jan 25 13:01:57 PST 2018
t-tye created this revision.
t-tye added reviewers: llvm-commits, kzhuravl, scchan.
Herald added subscribers: tpr, dstuttard, yaxunl, nhaehnle, wdng.
- If ReqdWorkGroupSize is present it must have all elements >=1.
- If MaxFlatWorkGroupSize must be consistent with ReqdWorkGroupSize
- Remove FixedWorkGroupSize as now equivalent to ReqdWorkGroupSize.
- Remove wor- group size information from the kernel descriptor.
https://reviews.llvm.org/D42554
Files:
docs/AMDGPUUsage.rst
Index: docs/AMDGPUUsage.rst
===================================================================
--- docs/AMDGPUUsage.rst
+++ docs/AMDGPUUsage.rst
@@ -991,9 +991,11 @@
=================== ============== ========= ==============================
String Key Value Type Required? Description
=================== ============== ========= ==============================
- "ReqdWorkGroupSize" sequence of The dispatch work-group size
- 3 integers X, Y, Z must correspond to the
- specified values.
+ "ReqdWorkGroupSize" sequence of If not 0, 0, 0 then all values
+ 3 integers must be >=1 and the dispatch
+ work-group size X, Y, Z must
+ correspond to the specified
+ values. Defaults to 0, 0, 0.
Corresponds to the OpenCL
``reqd_work_group_size``
@@ -1286,19 +1288,9 @@
supported by the
kernel in work-items.
Must be >=1 and
- consistent with any
- non-0 values in
- FixedWorkGroupSize.
- "FixedWorkGroupSize" sequence of Corresponds to the
- 3 integers dispatch work-group
- size X, Y, Z. If
- omitted, defaults to
- 0, 0, 0. If an
- element is non-0 then
- the kernel must only
- be launched with a
- matching corresponding
- work-group size.
+ consistent with
+ ReqdWorkGroupSize if
+ not 0, 0, 0.
"NumSpilledSGPRs" integer Number of stores from
a scalar register to
a register allocator
@@ -1530,30 +1522,7 @@
entry point instruction
which must be 256 byte
aligned.
- 223:192 4 bytes MaxFlatWorkGroupSize Maximum flat work-group
- size supported by the
- kernel in work-items. If
- an exact work-group size
- is required then must be
- omitted or 0 and
- ReqdWorkGroupSize* must
- be set to non-0.
- 239:224 2 bytes ReqdWorkGroupSizeX If present and non-0 then
- the kernel
- must be executed with the
- specified work-group size
- for X.
- 255:240 2 bytes ReqdWorkGroupSizeY If present and non-0 then
- the kernel
- must be executed with the
- specified work-group size
- for Y.
- 271:256 2 bytes ReqdWorkGroupSizeZ If present and non-0 then
- the kernel
- must be executed with the
- specified work-group size
- for Z.
- 383:272 14 Reserved, must be 0.
+ 383:192 24 Reserved, must be 0.
bytes
415:384 4 bytes ComputePgmRsrc1 Compute Shader (CS)
program settings used by
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D42554.131498.patch
Type: text/x-patch
Size: 5216 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180125/786552aa/attachment.bin>
More information about the llvm-commits
mailing list