[PATCH] D45098: [AArch64] fix PR32384: bump the number of stores per memset/memcpy/memmov

Eli Friedman via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 30 12:15:01 PDT 2018


efriedma added a comment.

Wait, nevermind, it shouldn't matter whether we have NEON; we probably want to inline roughly the same number of instructions either way, and integer and vector registers have roughly equivalent ldp/stp instructions.

Looking at the generated code a bit, it looks like we do a really terrible job lowering memcpy; we don't form ldp/stp at all, ever.  We should probably fix that before we mess with the threshold here; it could substantially change the codesize/performance impact of this change.


https://reviews.llvm.org/D45098





More information about the llvm-commits mailing list