[PATCH] Vectorizing Global Structures - Take 2

Thu Feb 14 20:21:55 PST 2013

On Thu, Feb 14, 2013 at 3:13 PM, Arnold Schwaighofer <
aschwaighofer at apple.com> wrote:

>
> On Feb 14, 2013, at 1:55 PM, Daniel Berlin <dberlin at dberlin.org> wrote:
>
>
>
> On Thu, Feb 14, 2013 at 2:49 PM, Arnold Schwaighofer <
> aschwaighofer at apple.com> wrote:
>
>>
>> On Feb 14, 2013, at 12:34 PM, Renato Golin <renato.golin at linaro.org>
>> wrote:
>>
>> On 14 February 2013 17:46, Arnold Schwaighofer <aschwaighofer at apple.com>wrote:
>>
>>> The existing implementation already relies on runtime checks (it has to
>>> make sure that an unknown object and a known object do not overlap). Yes,
>>> AA will conservatively return MayAlias/PartialAlias if it does not know two
>>> objects. You just have to make sure that you actually query it with that
>>> unknown object.
>>>
>>
>> I was expecting this, and I'm planning to be extra conservative to begin
>> with. So, for now, MayAlias and PartialAlias are reasons to stop trying.
>> Once we can get the code to understand basic independent objects inside a
>> structure, we can specialize for the other cases.
>>
>>
>> Okay. But understand that LLVM IR semantics does not give you much leeway
>> in "understanding independent objects inside a structure". That is what you
>> need TBAA for.
>>
>
> TBAA is not a good answer to this general problem.  It falls down pretty
> quickly when you start to ask it about fields inside structures.
> Proper structure aliasing (IE aliasing on pieces of structs), including
> pointer analysis, in languages where things can point to fields inside
> structures (like GEP does) is not only possible to do on LLVM IR, it's
> actually been done before :)
>
> See, e.g, http://www.cs.ucsb.edu/~benh/research/downloads.html (The field
> sensitive version, and the flow sensitive/field sensitive versions).
> Both were done on earlier versions of LLVM, but could be made to work
> today.
>
>
>
>
>
> What I meant by saying "understanding independent objects inside a
> structure*"* to is that just by looking at an llvm type say for example
> struct { int A[100]; int B[100]} you and seeing two gep accesses that index
> into the first and second field of the type you can not necessarily say
> that the access is not overlapping without fully evaluating the address
> computation:
>

Yes, you need to fully evaluate it, but that's not really related to TBAA :)

>
> struct { int A[100]; int B[100]} S;
>
> ptr1 = getelementpr S, 0, 1, x
> ptr2  = getelementpr S, 0, 0, y
>
> ptr1 and ptr2 mayalias for some x and y (for example x = 0 and y = 100,
> the ir makes no guarantee that you cannot run over the length of an array
> within a struct as long as you are within the struct,
> http://llvm.org/docs/GetElementPtr.html#what-happens-if-an-array-index-is-out-of-bounds
> )
>
Yup.

>
> If you can symbolically reason about x and y - the full address
> computation then yes you can make more concise statements.
>
> Correct me if I am wrong but this is how it understand the semantics of a
> gep. I would love to be able to treat a type like a tree :).
>
> I have not had a look at what ben hardekopf is doing. Does he handle the
> case I mention above correctly?
>

Yes.  Hardekopf's work is an extension of work done by David Pearce, who I
worked with to get this right in all the weird edge cases, and implemented
in GCC.  See http://dl.acm.org/citation.cfm?id=1290524

(David made this work, Ben Hardekopf made it work *really fast*)

Even though C/C++ does not legally allow to happen what LLVM IR does, it
was common enough that we had to assume it can anyway.

>
> I know that the existing TBAA can not handle cases with struct fields, but
> an improved one might.
>

TBAA is type based, and language specific.   At least in C/C++/most other
languages i'm aware of, the TBAA tree will always say the above accesses
can alias because they validly point to accesses of the same type.

your example above is the C equivalent of

struct a {
int b[100];
int c[100];
};

struct a foo;
int (*x[100]) = &foo.b;
int (*y[100]) = &foo.c;

TBAA will *never* say that *x and *y are no-alias, because they are both
validly access an int *.
The same is true for

int *z1 = *x;
int *z2 = *y;

TBAA will never say *z1 and *z2 are no-alias, because they both validly
access ints.

So i can't see how you could extend TBAA to account for your example, since
it's only about types, and both of these are the same type, but maybe i'm
missing something?
If you mean you could make TBAA evaluate address computations and calculate
whether these the resulting addresses ever access the same objects in
memory, this isn't TBAA anymore, that is pointer analysis.  At least for
inclusion based versions, it's O(N^3) worst case :P

In any case the point of field sensitive pointer analysis above is
*exactly* to say "these both point to different fields in the structure,
and the bit size of the access guarantees they will never overrun".

In the case of pointer loops that iterate over your example type, it will
properly discover that it overruns into the next field, and properly mark
it as pointing-to that field as well.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130214/4848b0a5/attachment.html>