[llvm] [AMDGPU][MC] Restrict op_sel in VOP3P dot instructions (PR #100485)
Jun Wang via llvm-commits
llvm-commits at lists.llvm.org
Thu Nov 7 18:05:54 PST 2024
================
@@ -389,345 +389,23 @@ v_dot8_i32_i4 v0, v1, v2, v3
v_dot8_u32_u4 v0, v1, v2, v3
//
-// Test op_sel/op_sel_hi.
+// Test op_sel/op_sel_hi: in VOP3P dot, op_sel must be 0, op_sel_hi cannot appear
//
// CHECK: encoding: [0x00,0x40,0xa3,0xd3,0x01,0x05,0x0e,0x1c]
v_dot2_f32_f16 v0, v1, v2, v3 op_sel:[0,0]
-// CHECK: encoding: [0x00,0x50,0xa3,0xd3,0x01,0x05,0x0e,0x1c]
-v_dot2_f32_f16 v0, v1, v2, v3 op_sel:[0,1]
----------------
jwanggit86 wrote:
Thanks! However, `v_dot4_` and `v_dot8_` still don't support opsel pre-gfx940, right? What's the best way to differentiate between `v_dot2_` and, say `v_dot4_`?
https://github.com/llvm/llvm-project/pull/100485
More information about the llvm-commits
mailing list