[PATCH] D131964: [AArch64] Add support to loop vectorization for non temporal loads

Florian Hahn via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Aug 19 11:05:07 PDT 2022


fhahn added inline comments.


================
Comment at: llvm/lib/Target/AArch64/AArch64TargetTransformInfo.h:326
+      unsigned NumElements = DataTypeTy->getNumElements();
+      unsigned EltSize = DataTypeTy->getElementType()->getScalarSizeInBits();
+      return NumElements > 1 && isPowerOf2_64(NumElements) && EltSize >= 8 &&
----------------
Is there much difference between the load and store version? Could they just share the same code?

I think this would also require support for generating `LDNP` for types smaller than 256 bits. @zjaffal  is currently looking into this.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D131964/new/

https://reviews.llvm.org/D131964



More information about the llvm-commits mailing list