[llvm] 83d47ba - [AMDGPU] Add _e64_dpp asm suffix to docs
Joe Nash via llvm-commits
llvm-commits at lists.llvm.org
Thu Jun 22 11:31:25 PDT 2023
Author: Joe Nash
Date: 2023-06-22T14:31:09-04:00
New Revision: 83d47ba15a1229a21aaca8a8d6a33e0e90aabfd4
URL: https://github.com/llvm/llvm-project/commit/83d47ba15a1229a21aaca8a8d6a33e0e90aabfd4
DIFF: https://github.com/llvm/llvm-project/commit/83d47ba15a1229a21aaca8a8d6a33e0e90aabfd4.diff
LOG: [AMDGPU] Add _e64_dpp asm suffix to docs
The _e64_dpp suffix can be added to an instruction to force the
AsmParser to encode it as VOP3 with DPP if possible on GFX11+. This has
been the behavior since GFX11 was introduced; this patch only updates
the documentation.
Reviewed By: arsenm
Differential Revision: https://reviews.llvm.org/D153564
Added:
Modified:
llvm/docs/AMDGPUUsage.rst
Removed:
################################################################################
diff --git a/llvm/docs/AMDGPUUsage.rst b/llvm/docs/AMDGPUUsage.rst
index db3b3cda9f23c..e2eadad7bb4bf 100644
--- a/llvm/docs/AMDGPUUsage.rst
+++ b/llvm/docs/AMDGPUUsage.rst
@@ -14717,6 +14717,7 @@ force specific encoding, one can add a suffix to the opcode of the instruction:
* _e32 for 32-bit VOP1/VOP2/VOPC
* _e64 for 64-bit VOP3
* _dpp for VOP_DPP
+* _e64_dpp for VOP3 with DPP
* _sdwa for VOP_SDWA
VOP1/VOP2/VOP3/VOPC examples:
@@ -14749,6 +14750,15 @@ VOP_DPP examples:
v_add_f32 v0, v0, |v0| row_shl:1 row_mask:0xa bank_mask:0x1 bound_ctrl:0
v_max_f16 v1, v2, v3 row_shl:1 row_mask:0xa bank_mask:0x1 bound_ctrl:0
+
+VOP3_DPP examples (Available on GFX11+):
+
+.. code-block:: nasm
+
+ v_add_f32_e64_dpp v0, v1, v2 dpp8:[0,1,2,3,4,5,6,7]
+ v_sqrt_f32_e64_dpp v0, v1 row_shl:1 row_mask:0xa bank_mask:0x1 bound_ctrl:0
+ v_ldexp_f32 v0, v1, v2 dpp8:[0,1,2,3,4,5,6,7]
+
VOP_SDWA examples:
.. code-block:: nasm
More information about the llvm-commits
mailing list