[llvm] r194527 - delinearization of arrays

Wed Nov 13 13:05:00 PST 2013

----- Original Message -----
> Hi,
> 
> Hal Finkel wrote:
> > ----- Original Message -----
> > > 
> > > On Tue, Nov 12, 2013 at 2:47 PM, Sebastian Pop <
> > > spop at codeaurora.org
> > > > wrote:
> > > 
> > > 
> > > delinearization of arrays
> > > This seems like a really underwhelming commit log... ;] There is
> > > quite a lot going on here, and it would be helpful to give an
> > > overview for those who haven't followed the entire review of this
> > > work.
> > > 
> 
> The core of the patch adds one function to the SCEV tools:
> delinearize.  That
> allows the delinearization of a SCEV representing a memory access
> function.
> 
> For example: when analyzing the memory access A[i][j][k] in this loop
> nest
> 
> +; void foo(long n, long m, long o, double A[n][m][o]) {
> +;
> +;   for (long i = 0; i < n; i++)
> +;     for (long j = 0; j < m; j++)
> +;       for (long k = 0; k < o; k++)
> +;         A[i][j][k] = 1.0;
> +; }
> 
> the delinearization input is the following AddRec SCEV:
> 
> +; AddRec: {{{%A,+,(8 * %m * %o)}<%for.i>,+,(8 *
> %o)}<%for.j>,+,8}<%for.k>
> 
> From this SCEV, we are able to say that the base offset of the access
> is %A
> because it appears as an offset that does not divide any of the
> strides in the
> loops:
> 
> +; CHECK: Base offset: %A
> 
> and then delinearize determines the size of some of the dimensions of
> the array
> as these are the multiples by which the strides are happening:
> 
> +; CHECK: ArrayDecl[UnknownSize][%m][%o] with elements of
> sizeof(double) bytes.
> 
> Note that the outermost dimension remains of UnknownSize because
> there are no
> strides that would help identifying the size of the last dimension:
> when the
> array has been statically allocated, one could compute the size of
> that
> dimension by dividing the overall size of the array by the size of
> the known
> dimensions: %m * %o * 8.
> 
> Finally delinearize provides the access functions for the array
> reference that
> does correspond to A[i][j][k] of the above C testcase:
> 
> +; CHECK:
> ArrayRef[{0,+,1}<nuw><nsw><%for.i>][{0,+,1}<nuw><nsw><%for.j>][{0,+,1}<nuw><nsw><%for.k>]
> 
> The testcases are checking the output of a function pass:
> DelinearizationPass
> that walks through all loads and stores of a function asking for the
> SCEV of the
> memory access with respect to all enclosing loops, calling
> SCEV->delinearize on
> that and printing the results.
> 
> The patch also calls SCEV->delinearize within Preston's classical
> data
> dependence analysis.  The flag controlling the call to delinearize
> from the data
> dependence analysis "-da-delinearize" is turned off by default for
> now.
> Attached a patch that shows the results of the dependence analysis
> with
> -da-delinearize turned on. There are more MIV tests (Multiple
> Induction
> Variables dependence test) that can be disambiguated as carrying no
> dependence
> (as SCEV->delinearize translates MIV tests into SIV tests (Single IV
> tests)):
> 
> diff --git a/test/Analysis/DependenceAnalysis/Banerjee.ll
> b/test/Analysis/DependenceAnalysis/Banerjee.ll
> index 09e8fd2..65a1f68 100644
> --- a/test/Analysis/DependenceAnalysis/Banerjee.ll
> +++ b/test/Analysis/DependenceAnalysis/Banerjee.ll
> @@ -490,7 +490,7 @@ entry:
>    br label %for.cond1.preheader
>  
>  ; CHECK: 'Dependence Analysis' for function 'banerjee9':
> -; CHECK: da analyze - output [* *]!
> +; CHECK: da analyze - none!
>  ; CHECK: da analyze - flow [<= =|<]!
>  ; CHECK: da analyze - confused!
>  ; CHECK: da analyze - none!
> diff --git a/test/Analysis/DependenceAnalysis/GCD.ll
> b/test/Analysis/DependenceAnalysis/GCD.ll
> index bb31d11..18942aa 100644
> --- a/test/Analysis/DependenceAnalysis/GCD.ll
> +++ b/test/Analysis/DependenceAnalysis/GCD.ll
> @@ -18,7 +18,7 @@ entry:
>  ; CHECK: da analyze - output [* *]!
>  ; CHECK: da analyze - flow [=> *|<]!
>  ; CHECK: da analyze - confused!
> -; CHECK: da analyze - input [* *]!
> +; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
>  ; CHECK: da analyze - none!
>  
> @@ -71,7 +71,7 @@ entry:
>  ; CHECK: da analyze - output [* *]!
>  ; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
> -; CHECK: da analyze - input [* *]!
> +; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
>  ; CHECK: da analyze - none!
>  
> @@ -125,7 +125,7 @@ entry:
>  ; CHECK: da analyze - output [* *]!
>  ; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
> -; CHECK: da analyze - input [* *]!
> +; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
>  ; CHECK: da analyze - none!
>  
> @@ -353,10 +353,10 @@ entry:
>    br i1 %cmp4, label %for.cond1.preheader.preheader, label
>    %for.end12
>  
>  ; CHECK: 'Dependence Analysis' for function 'gcd6'
> -; CHECK: da analyze - output [* *]!
> +; CHECK: da analyze - flow [0 =>|<]!
>  ; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
> -; CHECK: da analyze - input [* *]!
> +; CHECK: da analyze - none!
>  ; CHECK: da analyze - confused!
>  ; CHECK: da analyze - output [* *]!
> 
> 
> I'm not sure why we have this change:
> -; CHECK: da analyze - output [* *]!
> +; CHECK: da analyze - flow [0 =>|<]!
> it looks like an error... although, I have not investigated why this
> happens.
> 
> > 
> > Yea ;)
> > 
> > Also, please comment on the current state of any known correctness
> > issues. In
> > response to one of Arnold's comments about dealing with
> > inner-dimension-overflow cases, you had said, "Right.  It looks
> > like the data
> > dependence analysis needs more code to support the contiguous
> > memory
> > semantics...", and I'd like to have a clear understanding on the
> > resolution to
> > that.
> 
> The data dependence analysis has to bound each array access dimension
> by the
> number of elements that the delinearization has determined: i.e., for
> a
> delinearized access of the form (same C testcase as above)
> 
> > +; AddRec: {{{%A,+,(8 * %m * %o)}<%for.i>,+,(8 *
> > %o)}<%for.j>,+,8}<%for.k>
> > +; CHECK: Base offset: %A
> > +; CHECK: ArrayDecl[UnknownSize][%m][%o] with elements of
> > sizeof(double) bytes.
> > +; CHECK:
> > ArrayRef[{0,+,1}<nuw><nsw><%for.i>][{0,+,1}<nuw><nsw><%for.j>][{0,+,1}<nuw><nsw><%for.k>]
> 
> We will have to add to the data dependence test these two
> constraints:
> "{0,+,1}<nuw><nsw><%for.k> < %o"
> "{0,+,1}<nuw><nsw><%for.j> < %m"
> to avoid having memory accesses overflow from one dimension into
> another.
> This is related to the problem of determining the existence of data
> dependences
> in array accesses using a different number of subscripts: in C one
> can decide to
> access an array A[100][100]; as A[0][9999] or *A[9999], etc.
> I am currently not sure how to add these constraints in the classical
> data
> dependence test in lib/Analysis/DependenceAnalysis.cpp.

Okay; for affine recurrences, checking the starting and ending value is sufficient, and I think something that SCEV can already do. Please place a FIXME in the DA code about this, and file a PR (adding me to the CC list). I think that this will need to be fixed before we can enable delinearization in DA by default.

Thanks again,
Hal

> 
> On the other hand, adding these constraints to Polly is a piece of
> cake: you
> just drop the extra constraints in the bag of constraints that Polly
> uses to
> formulate the dependence test, and then ISL does the rest.  I will
> post a patch
> that integrates SCEV->delinearize in Polly.
> 
> > > (And also, awesome to see all of that work paying off!)
> 
> SCEV->delinearize will be beneficial in the context of data
> dependence tests in
> multiple nested loops. LLVM would need a loop nest optimizer, a
> vectorizer, or a
> parallelizer on top of lib/Analysis/DependenceAnalysis.cpp (not sure
> how long
> that would take).
> 
> On the other hand, Polly would be the first to see the benefits from
> SCEV->delinearize: I hope that Polly will start transforming most of
> the
> test-suite/pollybench.
> 
> Sebastian
> --
> Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum,
> hosted by The Linux Foundation
> 

-- 
Hal Finkel
Assistant Computational Scientist
Leadership Computing Facility
Argonne National Laboratory