[PATCH] D48128: [ARM] Parallel DSP IR Pass

Eli Friedman via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jul 3 11:11:23 PDT 2018


efriedma added a comment.

I think there's a missing correctness check in this patch.  Specifically, ARMParallelDSP::CreateParallelMACPairs doesn't check that the transformed loads are i16 loads, so a pair of i8 loads gets transformed to an i32 load.  This is causing a miscompile on at least one of our internal tests.

I guess the transform is actually still viable for a pair of i8 loads with a slightly different code (we can generate ldrh+sxtb16).


Repository:
  rL LLVM

https://reviews.llvm.org/D48128





More information about the llvm-commits mailing list