[PATCH] D158468: [AMDGPU] Add sdot4 / sdot8 intrinsics for gfx11
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Aug 22 11:06:17 PDT 2023
arsenm added a comment.
Title is confusing, this isn't adding new intrinsics
================
Comment at: llvm/lib/Target/AMDGPU/VOP3PInstructions.td:437-452
defm V_DOT4_I32_IU8 : VOP3PDOTIUInst<"v_dot4_i32_iu8", int_amdgcn_sudot4>;
defm V_DOT8_I32_IU4 : VOP3PDOTIUInst<"v_dot8_i32_iu4", int_amdgcn_sudot8>;
+
+def : GCNPat < (int_amdgcn_sdot8 i32:$src0,
+ i32:$src1,
+ i32:$src2, (i1 timm:$clamp)),
+ (V_DOT8_I32_IU4 (i32 8), i32:$src0,
----------------
I don't understand how these cases are different, the intrinsic name is just slightly different from the instruction name?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D158468/new/
https://reviews.llvm.org/D158468
More information about the llvm-commits
mailing list