[llvm] AMDGPU/Docs: Fix target properties for gfx9-4-generic (PR #125593)

Joseph Greathouse via llvm-commits llvm-commits at lists.llvm.org
Mon Feb 3 15:38:31 PST 2025


================
@@ -583,8 +583,8 @@ Generic processor code objects are versioned. See :ref:`amdgpu-generic-processor
                                                                                                   - ``v_dot2_f32_f16``
 
 
-     ``gfx9-4-generic``   ``amdgcn``     - ``gfx940``      - xnack            - Absolute flat   FP8 and BF8 instructions,
-                                         - ``gfx941``      - sramecc            scratch         FP8 and BF8 conversion instructions,
+     ``gfx9-4-generic``   ``amdgcn``     - ``gfx940``      - xnack            - Architected     FP8 and BF8 instructions,
+                                         - ``gfx941``      - sramecc            flat scratch    FP8 and BF8 conversion instructions,
----------------
jlgreathouse wrote:

Yes, gfx94x and gfx950 have packed work-item IDs, so I think the gfx9-4-generic processor requires them as well.

gfx94x and gfx950 offer both tgsplit and kernarg preload. so gfx9-4-generic can support the feature. These are optional, so gfx9-4-generic is not necessarily _required_ to offer them. But it probably should and they probably should be in this table.

https://github.com/llvm/llvm-project/pull/125593


More information about the llvm-commits mailing list