[PATCH] D67392: [ARM][ParallelDSP] Change smlad insertion order
Sam Parker via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Oct 15 07:30:13 PDT 2019
samparker marked 2 inline comments as done.
samparker added inline comments.
================
Comment at: lib/Target/ARM/ARMParallelDSP.cpp:379
+ for (auto Write : Writes) {
+ for (auto Read : Loads) {
MemoryLocation ReadLoc =
----------------
SjoerdMeijer wrote:
> Just curious, why did you change the iteration order, Loads/Writes vs. Writes/Loads?
We know that Loads isn't empty, but Writes maybe so we can skip the loop if it is.
================
Comment at: test/CodeGen/ARM/ParallelDSP/blocks.ll:139
+; scalars.
+; CHECK-LABEL: num_load_limit
+; CHECK: call i32 @llvm.arm.smlad
----------------
SjoerdMeijer wrote:
> Double checking I understand this test: this test has 16 loads, so is within the limit of 16.
> So why don't we generate the 4th SMLAD?
We're within the limit, but it looks like there's some limitation with the search algorithm... the adds are just ordered differently to 'usual'.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D67392/new/
https://reviews.llvm.org/D67392
More information about the llvm-commits
mailing list