[llvm] r215343 - In LVI(Lazy Value Info), originally value on a BB can only be caculated once,

Pete Cooper peter_cooper at apple.com
Fri Nov 21 17:36:55 PST 2014


Attached a new diff.

This checks the result of getBlockValue for undefined.  If we get an undefined value then that must be from a block we’re waiting on the information from.  If we try to get it again then we must have a cycle so at that point we break the cycle by moving to overdefined.

Thanks,
Pete

-------------- next part --------------
A non-text attachment was scrubbed...
Name: lvi.diff
Type: application/applefile
Size: 68 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20141121/c9425d0a/attachment.bin>
-------------- next part --------------

> On Nov 21, 2014, at 5:04 PM, Hal Finkel <hfinkel at anl.gov> wrote:
> 
> [+Jiangning, Philip, Nick]
> 
> ----- Original Message -----
>> From: "Pete Cooper" <peter_cooper at apple.com>
>> To: "Hans Wennborg" <hans at chromium.org>
>> Cc: "llvm-commits at cs.uiuc.edu for LLVM" <llvm-commits at cs.uiuc.edu>
>> Sent: Friday, November 21, 2014 6:09:50 PM
>> Subject: Re: [llvm] r215343 - In LVI(Lazy Value Info),	originally value on a BB can only be caculated once,
>> 
>> So, i’ve worked through this in lldb and ultimately the wrong
>> decision is about the value of %inc2 in %for.cond.
>> 
>> When we process '%i.07 = phi i32 [ 1, %for.body.lr.ph ], [ %inc2,
>> %for.cond ]’, we get 1 from the first value, i.e., a constantrange
>> [1,2)
>> We then ask for the result for %inc2 from %for.cond.  As you pointed
>> out, its in the cache so we return its Result (which is still
>> undefined).  We then (in solveBlockValuePHINode), do
>> 'Result.mergeIn(EdgeResult)’ which ignores the undefined RHS and we
>> think the result from our PHI is the constantrange.
>> 
>> So, there’s a couple of things going wrong here.  Firstly, i’m
>> worried that ‘Result.mergeIn’ returned false, but we didn’t check
>> the return value in solveBlockValuePHINode.  I think this is a bug
>> regardless of my patch, although I have no test case.
>> 
>> The other issue, which I think is a result of my patch, is that
>> getEdgeValue should probably return false if the value is undefined
>> as that means we still need to process that edge.  I’m actually
>> getting more and more tempted to stop using undefined in my patch
>> and make an explicit enum value which means ‘processing
>> predecessors’ or at least ‘unknown’ so we know its value is TBD.
>> 
>> What do you think?
>> 
>> Thanks,
>> Pete
>> 
>> 
>>> On Nov 21, 2014, at 3:34 PM, Pete Cooper <peter_cooper at apple.com>
>>> wrote:
>>> 
>>> 
>>>> On Nov 21, 2014, at 3:30 PM, Hans Wennborg <hans at chromium.org>
>>>> wrote:
>>>> 
>>>> On Fri, Nov 21, 2014 at 3:13 PM, Pete Cooper
>>>> <peter_cooper at apple.com> wrote:
>>>>> Thanks Hans!  I was just trying to reduce it.
>>>>> 
>>>>> I’ll take a look now.  Probably something i’ve done is breaking
>>>>> when there’s loops.
>>>> 
>>>> I think the problem is that when we're resetting BBLV to
>>>> Undefined,
>>>> that value goes in the cache, hasBlockValue() is now returning
>>>> true
>>>> for that value, and others might pick it up - and Undefined is the
>>>> wrong value.
>>> Ah, I didn’t know undefined really meant anything to this
>>> algorithm.  I figured it would mean unknown, but if it actually
>>> means undef and that the optimizer can choose a value then i can
>>> see there being a problem.
>>>> 
>>>> What we'd really want to do is to remove BBLV from the cache until
>>>> we're done computing its value, but then we're not terminating
>>>> cycles
>>>> anymore. Hmm.
>>> I think we’ll still terminate cycles because we’ll always
>>> eventually hit a loop pre-header or the start of the function, but
>>> i haven’t thought about it enough to say what the complexity would
>>> be in that case.
>>>> 
>>>> 
>>>>>> On Nov 21, 2014, at 3:12 PM, Hans Wennborg <hans at chromium.org>
>>>>>> wrote:
>>>>>> 
>>>>>> On Thu, Nov 20, 2014 at 7:51 PM, Pete Cooper
>>>>>> <peter_cooper at apple.com> wrote:
>>>>>>> But something is going wrong. I'm attaching a reduction of the
>>>>>>> problem
>>>>>>> I saw. With your patch applied, using "clang -O2 /tmp/a.cc",
>>>>>>> the
>>>>>>> attached program goes into an infinite loop.
>>>>>>> 
>>>>>>> Thanks!  I’ll take a look.
>>>>>> 
>>>>>> I've reduced it (at least a little..) into an IR test
>>>>>> (attached). With
>>>>>> your patch applied, run through opt like so:
>>>>>> 
>>>>>> $ opt -correlated-propagation /tmp/reduced.ll -S
>>>>>> 
>>>>>> Note how the definition of %i.1 changes from
>>>>>> 
>>>>>> %i.1 = phi i32 [ %add, %for.body ], [ %i.07, %if.then ]
>>>>>> 
>>>>>> to
>>>>>> 
>>>>>> %i.1 = phi i32 [ 2, %for.body ], [ 1, %if.then ]
>>>>>> 
>>>>>> which is not correct.
>>>>>> 
>>>>>> - Hans
>>>>>> <reduced.ll>
>>>>> 
>>> 
>>> 
>>> _______________________________________________
>>> llvm-commits mailing list
>>> llvm-commits at cs.uiuc.edu
>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
>> 
>> 
>> _______________________________________________
>> llvm-commits mailing list
>> llvm-commits at cs.uiuc.edu
>> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
>> 
> 
> -- 
> Hal Finkel
> Assistant Computational Scientist
> Leadership Computing Facility
> Argonne National Laboratory



More information about the llvm-commits mailing list