[PATCH] D45098: [AArch64] fix PR32384: bump the number of stores per memset/memcpy/memmov

Sebastian Pop via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 30 11:38:42 PDT 2018


sebpop added a comment.

In https://reviews.llvm.org/D45098#1053201, @efriedma wrote:

> Should we check for hasNEON() here?  The generic code doesn't know AArch64 has ldp/stp, so we might want to be a little more aggressive to compensate.


Do you mean something like this? or something else?

  if (Subtarget->hasNEON()) {
    MaxStoresPerMemset = 32;
    MaxStoresPerMemsetOptSize = 8;
    MaxStoresPerMemcpy = 16;
    MaxStoresPerMemcpyOptSize = 4;
    MaxStoresPerMemmove = 16;
    MaxStoresPerMemmoveOptSize = 4;
  } else {
    MaxStoresPerMemset = MaxStoresPerMemsetOptSize = 8;
    MaxStoresPerMemcpy = MaxStoresPerMemcpyOptSize = 4;
    MaxStoresPerMemmove = MaxStoresPerMemmoveOptSize = 4;
  }


https://reviews.llvm.org/D45098





More information about the llvm-commits mailing list