[PATCH] D121401: [AMDGPU] Support gfx940 v_lshl_add_u64 instruction

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 11 12:16:01 PST 2022


rampitec added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/VOP3Instructions.td:689
+  (ThreeOpFrag<shl_0_to_4, add> i64:$src0, i32:$src1, i64:$src2),
+  (V_LSHL_ADD_U64_e64 VSrc_b64:$src0, VSrc_b32:$src1, VSrc_b64:$src2)
+>;
----------------
foad wrote:
> If this is a normal VOP3 then surely src1 can be anything, so why restrict it to immediates 0..4 ? If this is a real restriction on the allowed operands, then you should probably mention that where the instruction is defined.
This is a real limitation:
LSHL_ADD is a 64-bit which left-shifts the result by 0-4 (use “0” to get Add_U64).


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D121401/new/

https://reviews.llvm.org/D121401



More information about the llvm-commits mailing list