[PATCH] D114405: Optimize shift and accumulate pattern in AArch64.
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Nov 23 01:41:43 PST 2021
dmgreen added a comment.
Could this be handled with a tablegen pattern? Something like the or_is_add pattern from X86 (but using haveNoCommonBitsSet). Maybe a PatFrag that accepts an add or an add-like-or?
Also should ssra be treated the same way?
================
Comment at: llvm/test/CodeGen/AArch64/shift-accumulate.ll:12
+ %4 = lshr <8 x i16> %3, <i16 7, i16 7, i16 7, i16 7, i16 7, i16 7, i16 7, i16 7>
+ %5 = or <8 x i16> %4, %3
+ %6 = bitcast <8 x i16> %5 to <16 x i8>
----------------
It's probably a better test to return the %5, not extract a single lane from it.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D114405/new/
https://reviews.llvm.org/D114405
More information about the llvm-commits
mailing list