<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><br><div><div>On Apr 17, 2012, at 3:29 PM, Chandler Carruth wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div class="gmail_quote">On Wed, Apr 18, 2012 at 12:22 AM, Jakob Stoklund Olesen <span dir="ltr"><<a href="mailto:stoklund@2pi.dk">stoklund@2pi.dk</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0.8ex; border-left-width: 1px; border-left-color: rgb(204, 204, 204); border-left-style: solid; padding-left: 1ex; position: static; z-index: auto; ">
<div>I tried disabling just the InstCombine transforms that hide shl instructions behind bitmasks. Even though DAGCombine has the same transforms, it causes some pretty bad regressions:</div><div></div></blockquote></div>
<br><div>I wonder about your idea of still combining most of the shifts, but leaving the last bits uncombined... Certainly the combining effects seem particularly important to capture because we'll only look through a finite number of shifts to determine demanded bits, etc.</div></blockquote><div><br></div><div>Yes, the idea is to only preserve shl instructions because of their arithmetic/logical duality. I'll keep folding ashr/lshr.</div><div><br></div><div>These transformations all still run:</div><div><br></div><div> X << C1 >> C2 --> X << (C1-C2) & Mask</div><div><div> X << C1 >> C2 --> X >> (C2-C1) & Mask</div><div><div> X << C1 << C2 --> X << (C1+C2)</div></div><div><div><div> X >> C1 >> C2 --> X >> (C1+C2)</div></div></div><div><br></div><div>I also allow folding of exact right shifts:</div><div><br></div><div> X >>exact C1 <<nuw C2 --> X <<nuw (C2-C1)</div><div><div> X >>exact C1 << C2 --> X >>exact (C1-C2)</div></div><div><br></div><div>Since a bit mask isn't needed, scalar evolution should be able to reason about this.</div><div><br></div><div>Long chains of shifts should still be collapsed this way.</div><div><br></div></div><blockquote type="cite"><div>But this is all theorizing. =] The actually regressions will likely tell more.</div>
</blockquote></div><br><div>Yeah, all of the regressions I posted actually had identical assembly. I'll try measuring again while holding my breath.</div><div><br></div><div>/jakob</div><div><br></div></body></html>