[LLVMdev] Parallel Loop Metadata

Fri Feb 8 06:26:32 PST 2013

On Fri, Feb 8, 2013 at 9:16 AM, Pekka Jääskeläinen
<pekka.jaaskelainen at tut.fi> wrote:
> Hi Renato,
>
>
> On 02/08/2013 03:07 PM, Renato Golin wrote:
>>
>> In this case, I'd prefer metadata on the variables that are assumed not
>> to alias, like the restrict keyword.
>
>>
>>
>> It seems to me that having metadata on the loop basic blocks, since they
>> can be invalidated, will not help that much with the vectorizer more
>> than specific annotation on specific values (which are harder to lose).
>> I'm not saying we should annotate *all* memory instructions on a loop,
>> just the ones that make sense, or will help the vectorizer default to a
>> sane value.
>
>
> This is an interesting alternative! Do you mean that we would still add
> the llvm.mem.parallel_loop_access metadata, but only to such mem accesses
> that are assumed to be "hard or impossible to analyze" (to prove to be no
> alias cases)? Then we'd forget about the "parallel loop metadata" as is.
>
> Then we would rely on the regular loop carried dependency analyzer by
> default, but let those (mem) annotations just *help* in the "tricky cases".
>
> The llvm.mem.parallel_loop_access metadata would only communicate "this
> instruction does not alias with any other similarly annotated instruction
> from any other iteration in this loop".
>
> Quickly thinking, this might work and might not loose the
> parallelism info too easily. Anyways, the info still has to be
> connected to a loop to avoid breakup in inlining, multi-level loops, etc.
>
> Summarizing, the new metadata would be:
>
> llvm.loop:
> Just to mark a loop (points to a unique id metadata).
>
> llvm.mem.parallel_loop_access:
> The above mentioned new semantics, connected to the llvm.loop's id metadata.

How does this not require you to mark all the possible alias pairs in practice?

IE
Given memory instructions A, B, C, and D, what do you think makes the
(A,B) hard to analyze (and thus you'd need to mark A and B with this
new metadata) that doesn't also make (A, C) hard to analyze?  Is it
not usually the case that it is *A* itself, that is hard to analyze
(because of some property of the memory access), rather than any
particular pair?

I'd love to see example cases where the pair analysis is the
difficulty, rather than the access analysis of any single memory piece
being the difficulty.