[PATCH] D121401: [AMDGPU] Support gfx940 v_lshl_add_u64 instruction
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 11 12:16:01 PST 2022
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/VOP3Instructions.td:689
+ (ThreeOpFrag<shl_0_to_4, add> i64:$src0, i32:$src1, i64:$src2),
+ (V_LSHL_ADD_U64_e64 VSrc_b64:$src0, VSrc_b32:$src1, VSrc_b64:$src2)
+>;
----------------
foad wrote:
> If this is a normal VOP3 then surely src1 can be anything, so why restrict it to immediates 0..4 ? If this is a real restriction on the allowed operands, then you should probably mention that where the instruction is defined.
This is a real limitation:
LSHL_ADD is a 64-bit which left-shifts the result by 0-4 (use “0” to get Add_U64).
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D121401/new/
https://reviews.llvm.org/D121401
More information about the llvm-commits
mailing list