[PATCH] D50433: A New Divergence Analysis for LLVM

Fri Aug 24 10:28:52 PDT 2018

simoll added a comment.

In https://reviews.llvm.org/D50433#1212500, @alex-t wrote:

> >> For given branch all the blocks where PHI nodes must be inserted belongs to the branch parent block dominance frontier.
> > 
> > The problem with DF is that it implicitly assumes that there exists a definition at the function entry (see http://www.cs.utexas.edu/~pingali/CS380C/2010/papers/ssaCytron.pdf, comment below last equation of page 17).
> >  So, we would get imprecise results.
>
> I'm not sure that I understand you correct...
>  I still do not get an idea what do you mean by "imprecise results". The assumption in the paper you have referred looks reasonable.
>  Let's say we have 2 disjoint paths from Entry to block X and you have a use of some value V in X.
>
>         Entry
>        /   \
>       A     E
>     /  \     \
>   B     C     F  [v0 = 1]
>     \  /      |
>      D       /
>       \ _   /
>           X   v1 = PHI (1, F, undef, Entry)
>  
>  
>         Entry__
>        /       \
>     A [v1 = 2]  E
>    /  \         |
>   B     C       F  [v0 = 1]
>    \  /      __/
>     D       /
>       \ _  /
>         X v2 = PHI (1, F, 2, A)
>   
>
> Irrelevant of the definition in Entry, divergent branch in Entry makes the PHI in X divergent.
>
> Each of this 2 paths should contain a definition for V. It does not matter in entry block or in one of it's successors.
>  You either have a normal PHI or a PHI with undef incoming value. How to handle undef depends of your decision.
>  You may conservatively consider any undef as divergent. This make your PHI divergent by data-dependency.

The SDA detects re-convergence points of disjoint divergent paths from a branch.
There aren't any real PHI nodes involved.
The PHI nodes are rather a vehicle to demonstrate the reduction to SSA construction.
The incoming values in the reduction are always uniform (x0 = <SomeConstant>).

Let's pretend we actually generated the assignments and we were really construction SSA form.

     Entry
     /   \
    B     C
   /  \    \
  D    E-->F
   \  /   /
    G--->H

B contains a divergent branch. So, we emit the assigments "x = 0" in D and "x = 1" in E.
SSA construction will generate PHI nodes in F, G and H.
However, there aren't two disjoint path from B to F.
The root cause of the "spurious" phi node in F is the implicit definition "undef" at Entry.
In short, vanilla SSA construction gives imprecise result.

> 
> 
>> Control-dependence/PDT does not give you correct results (see my earlier example in https://reviews.llvm.org/D50433#1205934).
> 
> Oh.. :)   Please forget of the set differences that I mentioned. It was mentioned mistakenly.
>  You was concerned about using non-filtered PDFs since they could produce over-conservative results.
>  You probably meant not PDF but iterated PDF  (PDF+)

Yep, my bad. The notation should be:
DF(X)  := the set of all blocks B that X does not dominate but that have a predecessor that X dominates.
PDF(X) := set set of all blocks B that X does not post-dominate but that have a successor that X post-dominates (aka control dependence).
PDF+   := the transitive closure of the post-dominance frontier.
DF+    := the transitive closure of the dominance frontier.

> I meant just PDF - without closure over all predecessors.
> 
>        Entry
>      /       \
>     A_____    E
>    /       \   \
>   B[v1 = 2] C  F  [v0 = 1]
>    \  _____/ _/
>     D       /
>       \ _  /
>         X v2 = PHI (1, F, 2, A)
>    
> 
> PDF(B) = {A}  DF+(B) = {A, Entry}
>  PDF(F) = DF+(F) = {Entry}
> 
> For PHI in X we have 2 source blocks - B and F so we only have to examine branches in A and Entry
>  If the second definition of V was in C instead of F we'd only look at the branch in A.
> 
> For your example with switch:
> 
> +----A----+
> 
> |  |  |
> |
> 
> C0  C1    D
>  \   |    [..]
> 
>   \  |
>     J
> 
> PDF(C0) = {A}
>  PDF(C1) = {A}
> 
> Let's say in J we have v2 = PHI(v0, A. v1, C0)  we should examine A terminator because PDF(C0) = {A}, PDF(A) = {}

Sorry, i am struggeling to follow.
Do you take the union of the PDF(P) for each immediate predecessor P of X? (where X is a potential join point).
That gives you invalid results.

        A
      /   \
     B     C
   /  \   /  \
  D     E     F
   \  /   \  /
     G     H
     \    /
        I

PDF(G) = {E, B}
PDF(H) = {E, C}

PDF(G) join PDF(H) =  {E, B, C}   (where join is set union).
Yet, there are two disjoint paths from A to I. But A is in none of these sets.

Repository:
  rL LLVM

https://reviews.llvm.org/D50433