[PATCH] D36036: Supported interleaved byte load-pattern of stride:4 VF(8, 16, 32).

Farhana Aleen via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jul 28 16:43:26 PDT 2017


Farhana created this revision.
Herald added a subscriber: mzolotukhin.

It looks like we can generalize this stride-4 interleaved pattern. The basic idea is to optimize them as a 128 bit vector so that we can use the vpshuf*. Once we have shuffled in all the interleaved elements we can just keep packing them until we build the vectors of desired elements. Similar way, stores can be handled.


https://reviews.llvm.org/D36036

Files:
  include/llvm/Analysis/VectorUtils.h
  lib/Analysis/VectorUtils.cpp
  lib/Target/X86/X86ISelLowering.h
  lib/Target/X86/X86InterleavedAccess.cpp
  test/CodeGen/X86/x86-interleaved-access.ll
  test/Transforms/InterleavedAccess/X86/interleaved-accesses-8bits.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D36036.108741.patch
Type: text/x-patch
Size: 58426 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170728/3db9410a/attachment.bin>


More information about the llvm-commits mailing list