[PATCH] D85410: [ARM][MVE] Allow loops containing strides != 1 to be tail predicated with gather/scatters enabled
Anna Welker via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Aug 17 04:21:37 PDT 2020
anwel added inline comments.
================
Comment at: llvm/lib/Target/ARM/ARMTargetTransformInfo.cpp:1620
+ }
+ if (NextStride == 2 && isa<LoadInst>(I)) {
+ LLVM_DEBUG(dbgs() << "Consecutive strides of 2 found, vld2 can't "
----------------
samparker wrote:
> dmgreen wrote:
> > A load or a store can be vld2 or a vst2, neither of which can be tail folded unfortunately.
> else ifs?
Good point.
================
Comment at: llvm/lib/Target/ARM/ARMTargetTransformInfo.cpp:1620
+ }
+ if (NextStride == 2 && isa<LoadInst>(I)) {
+ LLVM_DEBUG(dbgs() << "Consecutive strides of 2 found, vld2 can't "
----------------
anwel wrote:
> samparker wrote:
> > dmgreen wrote:
> > > A load or a store can be vld2 or a vst2, neither of which can be tail folded unfortunately.
> > else ifs?
> Good point.
Right, I forgot about the `vstr2`'s. In that case we can never allow a stride of 2 here, as the only instructions that get us here are loads and stores.
================
Comment at: llvm/lib/Target/ARM/ARMTargetTransformInfo.cpp:1625
+ }
+ if (EnableMaskedGatherScatters) {
+ if (NextStride > 0) {
----------------
samparker wrote:
> I think this would be easier to read if this was organised in stride order and separating the gather/scatter from the consecutive accesses. So when !EnableMaskedGatherScatter, getPtrStride should only ever be 1, right? So I don't think we have to track the 'NextStride' business.
Good point. We may as well get rid of that.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D85410/new/
https://reviews.llvm.org/D85410
More information about the llvm-commits
mailing list