<div dir="ltr">Is the machine running any jobs in parallel? Would it be worth trying running lit in the loop, rather than the script? (perhaps lit's doing something interesting) or maybe the full test run from ninja, but I appreciate that that is expensive.<br><br>Are there other PPC bots? Any idea if they are experiencing this failure? <br><br>There are also other tests that do similar mkdir/symlink things, I think - yet they are not failing? Maybe they do it in some slightly different manner?</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Sep 3, 2020 at 5:03 AM Nemanja Ivanovic <<a href="mailto:nemanja.i.ibm@gmail.com">nemanja.i.ibm@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div>Sure.</div><div>I didn't use lit or ninja. I simply copied the script produced by lit (/home/buildbots/ppc64le-clang-multistage-test/clang-ppc64le-multistage/stage1/tools/clang/test/Driver/Output/target-override.c.script) into a temporary directory (along with a deep copy of the build directory). I modified the paths in the script to point to the temporary directory.<br></div><div>Then I ran the script in a loop.</div><div>For running a bunch in parallel, I just produced a wrapper script to invoke that one:</div><div><span style="font-family:monospace">target-override.c.script $LINENO &<br>target-override.c.script $LINENO &<br>target-override.c.script $LINENO &</span></div><div><span style="font-family:monospace">...</span></div><div><span style="font-family:monospace">wait</span></div><div>And ran that in a loop. For thousands of iterations...<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Sep 2, 2020 at 3:51 PM David Blaikie <<a href="mailto:dblaikie@gmail.com" target="_blank">dblaikie@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Thanks for looking into it!<br><br>Could you describe your test process in more detail? Were you running lit from your script? Running the build system (ninja?)? </div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Sep 2, 2020 at 10:47 AM Nemanja Ivanovic <<a href="mailto:nemanja.i.ibm@gmail.com" target="_blank">nemanja.i.ibm@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div>Well, I am at my wit's end. I have copied over the script and directories for this test case and run it a few million times. First I was running one at a time, then I switched to kicking off 1000 at a time. All the while, the bots continued to run on the same machine. The script never failed even once. I am not sure if this has something to do with Python as part of llvm-lit or what is going on.</div><div>I am thinking that the best course of action for us is to mark this test case UNSUPPORTED for PPC.<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Sep 2, 2020 at 12:41 PM Nemanja Ivanovic via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr" style="font-family:Arial,Helvetica,sans-serif;font-size:10pt"><div dir="ltr">Interesting, thanks for bringing this to our attention. I just took a quick look through the last 100 builds and this test has failed 13 times. This is certainly something we need to look at. We will investigate and see if we can make any sense of this.</div>
<div dir="ltr"> </div>
<div dir="ltr">Nemanja Ivanovic<br>LLVM PPC Backend Development<br>IBM Toronto Lab<br>Email: <a href="mailto:nemanjai@ca.ibm.com" target="_blank">nemanjai@ca.ibm.com</a><br>Phone: 905-413-3388</div>
<div dir="ltr"> </div>
<div dir="ltr"> </div>
<blockquote dir="ltr" style="border-left:2px solid rgb(170,170,170);margin-left:5px;padding-left:5px;direction:ltr;margin-right:0px">----- Original message -----<br>From: David Blaikie <<a href="mailto:dblaikie@gmail.com" target="_blank">dblaikie@gmail.com</a>><br>To: llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>>, Nico Weber <<a href="mailto:thakis@chromium.org" target="_blank">thakis@chromium.org</a>>, Serge Pavlov <<a href="mailto:sepavloff@gmail.com" target="_blank">sepavloff@gmail.com</a>>, <a href="mailto:powerllvm@ca.ibm.com" target="_blank">powerllvm@ca.ibm.com</a><br>Cc:<br>Subject: [EXTERNAL] Flakey failure on clang-ppc64le-linux-multistage<br>Date: Tue, Sep 1, 2020 6:10 PM<br>
<div dir="ltr">Seems there were a couple of correlated failures that appear to be flakes on this buildbot recently:<br><br>green: <a href="http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13974" target="_blank">http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13974</a><br>red: <a href="http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13975" target="_blank">http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13975</a> (target-override.c during stage 1, seems to be missing the directory/symlink it just created)<br>red: <a href="http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13976" target="_blank">http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13976</a> (same test failure as the last, but during stage 2, not stage 1)<br>green: <a href="http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13977" target="_blank">http://lab.llvm.org:8011/builders/clang-ppc64le-linux-multistage/builds/13977</a><br><br>Including Nico & Pavlov as the people who wrote/edited the test, but I'm guessing this is something interesting going on on the buildbot itself?<br><br><a href="mailto:powerllvm@ca.ibm.com" target="_blank">powerllvm@ca.ibm.com</a>, whoever you are on the end of that mailing list - could you take a look at this? Possibly manually running that test in a loop a bunch of times to see if it fails sometimes & try to help us understand why?</div></blockquote>
<div dir="ltr"> </div></div><br>
_______________________________________________<br>
LLVM Developers mailing list<br>
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br>
<a href="https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
</blockquote></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>