<div dir="ltr">On 5 February 2013 11:03, David Tweed <span dir="ltr"><<a href="mailto:david.tweed@arm.com" target="_blank">david.tweed@arm.com</a>></span> wrote:<br><div class="gmail_extra"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

since the more instructions there<br>

are the more an out-of-order CPU can put them into otherwise unused slots. I<br>

can't think of a way of figuring out such a mapping other than empirically.<br></blockquote><div><br></div><div><br></div><div style>Given the amount of uncertainty on these OOO guesses, I don't think we can get anything worth trying, even empirically. The noise will always outweigh the signal.</div>

<div style><br></div><div style>You can normally save a few cycles on a static micro-benchmark and think you're in control, but you normally don't evaluate the exact impact that the same guess had on other benchmarks or real code. Since this is auto-vectorization, it'll be hard to compare, but more so to evaluate the overall impact on the rest.</div>

<div style><br></div><div style>--renato</div></div></div></div>