[llvm-dev] Data structure improvement for the SLP vectorizer

Keno Fischer via llvm-dev llvm-dev at lists.llvm.org
Thu Mar 16 05:41:18 PDT 2017


On Thu, Mar 16, 2017 at 1:11 AM, Shahid, Asghar-ahmad
<Asghar-ahmad.Shahid at amd.com> wrote:
>
> Here, %load should be 4 element load instead of 2 and you can still do the required broadcast
> With above shuffle. Why this is important is that with our scheme of having a DAG with masks on
> Edges to a single tree node, generating a vector load of lesser length than the chosen vector factor
> Will be problematic.

Could you elaborate why you think this is? There doesn't seem a
problem to me of having on 2-element bundle and then putting (0,1,0,1)
on the edge to a 4-element, bundle, but I may be missing something.

Thanks,
Keno


More information about the llvm-dev mailing list