[PATCH] D104236: [AArch64] Add a TableGen pattern to generate uaddlv from uaddlp and addv
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 18 03:25:28 PDT 2021
dmgreen added a comment.
> For addp(addlp(x)) ->addv(x), addv genereates i32 output from v4i32 input and i16 output from v4i16 input. However, addp(addlp(x)) generates i64 output from v4i32 input and i32 output from v4i16 input. I think the outputs could not be same with big numbers input between addp(addlp(x)) and addv(x). If I missed something, please let me know.
With types and full opcode names it would be `(i64 addp(v2i64 uaddlp(v4i32 x))) -> (i64 uaddlv(v4i32 x))` or `(i32 addp(v2i32 uaddlp(v4i16 x))) -> (i32 uaddlv(v4i16 x))`
Do you have an example of where you don't think that would work the same?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D104236/new/
https://reviews.llvm.org/D104236
More information about the llvm-commits
mailing list