[llvm] AMDGPU/Docs: Fix target properties for gfx9-4-generic (PR #125593)
Konstantin Zhuravlyov via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 4 16:07:26 PST 2025
https://github.com/kzhuravl updated https://github.com/llvm/llvm-project/pull/125593
>From e3fa0c489906c3c5b9790969561a5253e9cebc26 Mon Sep 17 00:00:00 2001
From: Konstantin Zhuravlyov <kzhuravl_dev at outlook.com>
Date: Mon, 3 Feb 2025 17:48:49 -0500
Subject: [PATCH 1/2] AMDGPU/Docs: Fix target properties for gfx9-4-generic
gfx9-4-generic has architected flat scratch, not absolute
---
llvm/docs/AMDGPUUsage.rst | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/llvm/docs/AMDGPUUsage.rst b/llvm/docs/AMDGPUUsage.rst
index b646621d12eb0d..41c7aa5a112544 100644
--- a/llvm/docs/AMDGPUUsage.rst
+++ b/llvm/docs/AMDGPUUsage.rst
@@ -583,8 +583,8 @@ Generic processor code objects are versioned. See :ref:`amdgpu-generic-processor
- ``v_dot2_f32_f16``
- ``gfx9-4-generic`` ``amdgcn`` - ``gfx940`` - xnack - Absolute flat FP8 and BF8 instructions,
- - ``gfx941`` - sramecc scratch FP8 and BF8 conversion instructions,
+ ``gfx9-4-generic`` ``amdgcn`` - ``gfx940`` - xnack - Architected FP8 and BF8 instructions,
+ - ``gfx941`` - sramecc flat scratch FP8 and BF8 conversion instructions,
- ``gfx942`` as well as instructions with XF32 format support
- ``gfx950`` are not available.
>From a18fb710513b12569f8c274a55b2ce53311fec9b Mon Sep 17 00:00:00 2001
From: Konstantin Zhuravlyov <kzhuravl_dev at outlook.com>
Date: Tue, 4 Feb 2025 14:54:39 -0500
Subject: [PATCH 2/2] Address review feedback from Joe and Shilei
---
llvm/docs/AMDGPUUsage.rst | 10 +++++-----
1 file changed, 5 insertions(+), 5 deletions(-)
diff --git a/llvm/docs/AMDGPUUsage.rst b/llvm/docs/AMDGPUUsage.rst
index 41c7aa5a112544..dde6adf7248329 100644
--- a/llvm/docs/AMDGPUUsage.rst
+++ b/llvm/docs/AMDGPUUsage.rst
@@ -583,11 +583,11 @@ Generic processor code objects are versioned. See :ref:`amdgpu-generic-processor
- ``v_dot2_f32_f16``
- ``gfx9-4-generic`` ``amdgcn`` - ``gfx940`` - xnack - Architected FP8 and BF8 instructions,
- - ``gfx941`` - sramecc flat scratch FP8 and BF8 conversion instructions,
- - ``gfx942`` as well as instructions with XF32 format support
- - ``gfx950`` are not available.
-
+ ``gfx9-4-generic`` ``amdgcn`` - ``gfx940`` - sramecc - Architected FP8 and BF8 instructions,
+ - ``gfx941`` - tgsplit flat scratch FP8 and BF8 conversion instructions,
+ - ``gfx942`` - xnack - Packed as well as instructions with XF32 format support
+ - ``gfx950`` - kernarg preload work-item are not available.
+ IDs
``gfx10-1-generic`` ``amdgcn`` - ``gfx1010`` - xnack - Absolute flat - The following instructions are
- ``gfx1011`` - wavefrontsize64 scratch not available on ``gfx1011``
More information about the llvm-commits
mailing list