<div dir="ltr">On 27 June 2013 17:05, Tobias Grosser <span dir="ltr"><<a href="mailto:tobias@grosser.es" target="_blank">tobias@grosser.es</a>></span> wrote:<br><div class="gmail_extra"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div class="HOEnZb"><div class="h5"><span style="color:rgb(34,34,34)">We are looking for a good</span> <span style="color:rgb(34,34,34)">way/value to show the reliability of individual results in the UI. Do you have some experience, what a good measure of the reliability of test results is?</span></div>

</div></blockquote><div></div></div><br></div><div class="gmail_extra">Hi Tobi,</div><div class="gmail_extra"><br></div><div class="gmail_extra">I had a look at this a while ago, but never got around to actually work on it. My idea was to never use point-changes as indication of progress/regressions, unless there was a significant change (2/3 sigma). What we should do is to compare the current moving-average with the past moving averages (of K runs) with both last-avg and the (N-K)th moving-average (to make sure previous values included in the current moving average are not toning it down/up), and keep the biggest difference as the final result.</div>

<div class="gmail_extra"><br></div><div class="gmail_extra">We should also compare the current mov-avg with M non-overlapping mov-avgs before, and calculate if we're monotonically increasing, decreasing or if there is a difference of 2/3 sigma between the current mov-avg (N) and the (N-M)th mov-avg. That would give us an idea on the trends of each test.</div>

<div class="gmail_extra"><br></div><div class="gmail_extra">cheers,</div><div class="gmail_extra">--renato</div></div>