[PATCH] D67392: [ARM][ParallelDSP] Change smlad insertion order

Sam Parker via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Oct 15 07:30:13 PDT 2019


samparker marked 2 inline comments as done.
samparker added inline comments.


================
Comment at: lib/Target/ARM/ARMParallelDSP.cpp:379
+  for (auto Write : Writes) {
+    for (auto Read : Loads) {
       MemoryLocation ReadLoc =
----------------
SjoerdMeijer wrote:
> Just curious, why did you change the iteration order, Loads/Writes vs. Writes/Loads? 
We know that Loads isn't empty, but Writes maybe so we can skip the loop if it is.


================
Comment at: test/CodeGen/ARM/ParallelDSP/blocks.ll:139
+; scalars.
+; CHECK-LABEL: num_load_limit
+; CHECK: call i32 @llvm.arm.smlad
----------------
SjoerdMeijer wrote:
> Double checking I understand this test: this test has 16 loads, so is within the limit of 16.
> So why don't we generate the 4th SMLAD?
We're within the limit, but it looks like there's some limitation with the search algorithm... the adds are just ordered differently to 'usual'.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D67392/new/

https://reviews.llvm.org/D67392





More information about the llvm-commits mailing list