<div dir="ltr"><br><br><div class="gmail_quote"><div dir="ltr">On Fri, Sep 1, 2017 at 11:34 AM Grang, Mandeep Singh <<a href="mailto:mgrang@codeaurora.org">mgrang@codeaurora.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Thanks David and Diana for your suggestions. Yes, I am looking at<br>
setting up the builder to run after every 10 commits (is 10 a reasonable<br>
number?) and notify the blame list in case of failures.<br></blockquote><div><br>Mostly you probably just end up having the bot run as fast as possible - I think the default configuration maybe has a time based delay so it doesn't fire off on the first commit after a quiet period, but if nothing else comes in for a few minutes, it goes off and runs on a single commit rather than sitting idle.<br><br>I guess you'd find more in the zorg repository.<br><br>But please keep an eye on how reliable/actionable the emails are - if developers aren't responding to/fixing issues, or if the bot is sending fail mail for other uninteresting things (like build failures that other buildbots alerady diagnosed) - please tweak/tune or disable the buildbot. I know there's already a lot of buildbot email spam, but everything we can do to reduce the noise is really important.<br> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
My plan is to enable this once all the current failures in the reverse<br>
builder are fixed (currently there are 4 failures).<br>
<br>
As for the bit-by-bit comparison of forward vs reverse builders is<br>
concerned I am trying to convince my team to dedicate some resources to<br>
this. Not sure how soon I can get this done :)<br>
<br>
Also thanks Diana for your ideas on how to debug/fix reverse iteration<br>
failures. Actually I have been following a similar strategy to fix these<br>
issues so far. Maybe I will update the community on this in a future<br>
email thread.<br>
<br>
--Mandeep<br>
<br>
<br>
On 8/31/2017 3:20 AM, Diana Picus wrote:<br>
> On 30 August 2017 at 18:51, David Blaikie via llvm-dev<br>
> <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>> wrote:<br>
>><br>
>> On Tue, Aug 29, 2017 at 11:45 AM Grang, Mandeep Singh via cfe-dev<br>
>> <<a href="mailto:cfe-dev@lists.llvm.org" target="_blank">cfe-dev@lists.llvm.org</a>> wrote:<br>
>>> Hi All,<br>
>>><br>
>>> I wanted to share a couple of updates on the effort to uncover<br>
>>> non-determinism in LLVM through reverse iteration.<br>
>>><br>
>>> 1. Reverse iteration has now been enabled for DenseMap<br>
>>> (<a href="https://reviews.llvm.org/D35043" rel="noreferrer" target="_blank">https://reviews.llvm.org/D35043</a>)<br>
>>><br>
>>> 2. We have setup a nightly reverse iteration buildbot<br>
>>> (<a href="http://lab.llvm.org:8011/builders/reverse-iteration" rel="noreferrer" target="_blank">http://lab.llvm.org:8011/builders/reverse-iteration</a>).<br>
>>> This builds all LLVM targets with reverse iteration ON and runs ninja<br>
>>> check-all. Currently there are 14 unit test failures. Please feel free<br>
>>> to fix these.<br>
>>><br>
>>> Also currently, only I receive the nightly email notification for this<br>
>>> buildbot run. My plan is to enable sending the nightly notifications to<br>
>>> llvm-commits once all 14 failures have been resolved.<br>
>>> Please let me know if the community wants the nightly notifications even<br>
>>> with the failures.<br>
>>> As a potential next step, I was thinking about bootstrapping this<br>
>>> reverse iteration LLVM to compile itself. Not sure if it can uncover<br>
>>> more bugs but maybe worth a shot.<br>
>><br>
>> To uncover bugs in this configuration, I believe you'd want/need a<br>
>> stage2/stage3 comparison which might be a bit tricky/expensive*, something<br>
>> like:<br>
>><br>
>> build clang twice (reverse and forward enabled) then build (in one mode,<br>
>> doesn't matter which I think) clang or other release binaries (or even the<br>
>> whole release) from each of those and compare them bit-for-bit, they should<br>
>> be identical.<br>
>><br>
>> * If you want other developers to act on bugs found, the buildbot needs to<br>
>> have a short blame list (this can be done on a slow buildbot by having<br>
>> multiple slaves/builders running in parallel) but preferably also a short<br>
>> cycle time (so failures are reported soon after they're created) - otherwise<br>
>> expect to do a lot of triage yourself (& possibly leave the emails only<br>
>> going to you - because they'll have too large blame lists/revision ranges<br>
>> and people won't find them actionable) & then probably following up on the<br>
>> specific commit you believe introduced the problem and either fixing it<br>
>> yourself or replying on the commits list to report it to the original<br>
>> contributor.<br>
>><br>
> I agree with what David said here, but I just wanted to say that you<br>
> shouldn't feel too discouraged because of it.<br>
><br>
> As someone that occasionally has to bisect 5h+ worth of revisions, I<br>
> can tell you that in time you'll often be able to just look at the<br>
> revisions and spot the culprit, or maybe 2-3 candidates that have<br>
> likely caused the issue. Given that this bot does something very<br>
> specific, you can then probably just inspect the code and see what<br>
> caused the problem (if the revision doesn't touch any containers, then<br>
> it probably didn't cause the issue, right?). It's a lot easier when<br>
> you have a revision range, so it obviously won't take as long to<br>
> identify and fix as the initial failures that you are seeing now.<br>
><br>
> Ultimately, it's up to you to decide how much effort you are willing /<br>
> able to put into this. This kind of failures probably won't even occur<br>
> that often in practice, but when they do I think it's important to<br>
> find them and fix them. The best way to know for sure is to give it a<br>
> try for a while and see how it goes. If you find that it's<br>
> impractical, you can always revert to the current configuration.<br>
><br>
>>><br>
>>> All comments/suggestions welcome.<br>
>>><br>
>>> Thanks,<br>
>>> Mandeep<br>
>>> _______________________________________________<br>
>>> cfe-dev mailing list<br>
>>> <a href="mailto:cfe-dev@lists.llvm.org" target="_blank">cfe-dev@lists.llvm.org</a><br>
>>> <a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev</a><br>
>><br>
>> _______________________________________________<br>
>> LLVM Developers mailing list<br>
>> <a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br>
>> <a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
>><br>
<br>
</blockquote></div></div>