<br><br><div class="gmail_quote">On Tue, Nov 30, 2010 at 2:44 PM, Chris Lattner <span dir="ltr"><<a href="mailto:clattner@apple.com">clattner@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div style="word-wrap:break-word"><div class="im"><br><div><div>On Nov 30, 2010, at 2:36 PM, Xinliang David Li wrote:</div><br><blockquote type="cite"><span style="border-collapse:separate;font-family:Helvetica;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:medium"><blockquote class="gmail_quote" style="margin-top:0px;margin-right:0px;margin-bottom:0px;margin-left:0.8ex;border-left-width:1px;border-left-color:rgb(204, 204, 204);border-left-style:solid;padding-left:1ex">
<div style="word-wrap:break-word"><div><div><blockquote type="cite"><span style="border-collapse:separate;font-family:Helvetica;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:medium"><div>
Yes, global growth limit may be good for size control, but is a hack for control icache footprint. However, as I mentioned, the bottom up inline scheme make it impossible to use any heuristics involving 'global limit' which can be more complicated and fancier than the simple growth limit. For instance, there is no restriction that only one global limit can be used --- the compiler can partition the call graph into multiple locality regions, and set icache limit for each region. The inlining order can be done on a region by region basis. For each region, the region limit is applied and the priority queue must be used.</div>
</span></blockquote><br></div></div><div>Yes, I understand that. But why is a global limit useful? What problem does it solve? If it is to cap "runaway inlining" there are better ways to do it. You agree that it isn't for icache optimization, so what value does it serve?</div>
</div></blockquote><div><br></div><div>I am not trying to defend 'global growth limit' here :) -- I am arguing the usefulness/flexibility of using a priority based inline ordering. do we agree here?</div></span></blockquote>
<br></div></div><div>I think I'm missing something important. :)</div><div><br></div><div>Can you explain the difference between priority based inlining and a global threshold? If you end up inlining everything in your priority queue,</div>
</div></blockquote><div><br></div><div>For large applications, this will never happen.</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div style="word-wrap:break-word"><div> then order doesn't matter. If you only inline the high priority stuff, then you can handle this with a threshold. Can you give a scenario that can't be handled with bottom-up inlining and a sufficiently smart inlining cost evaluation?</div>
</div></blockquote><div><br></div><div>It is doable in bottom up, but can be complicated. I guess you will also need a priority queue as a side data structure, and you probably also need multiple passes to exhaust the limit. </div>
<div><br></div><div>Another LLVM inliner limitation I won't to point out is that since inline analysis and transformation are done together, any inline decisions can not be undone.</div><div><br></div><div>Thanks,</div>
<div><br>David</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;"><div style="word-wrap:break-word"><div><br></div><font color="#888888"><div>-Chris</div>
</font></div></blockquote></div><br>