<div dir="ltr">Ah, this is a file that takes <100ms to run in release mode.<div>Yeah, gonna be hard to beat that.</div><div>You can see what the time is without any use optimization by using -memssa-check-limit=0 (but again, you have to use getClobberingMemoryAccess as you are now to get sane results).</div><div><br></div><div>It looks like all the build time is in use opt.</div><div>Better alias caching would help with this.</div><div><br></div><div>Profiling is hard, it takes so little time instruments on my mac won't profile it :)</div><div><br></div><div><br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Aug 22, 2016 at 3:00 PM, Geoff Berry <span dir="ltr"><<a href="mailto:gberry@codeaurora.org" target="_blank">gberry@codeaurora.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div bgcolor="#FFFFFF" text="#000000">
<p>Sure, here is the .bc file for z44.c:</p><div><div class="h5">
<p><br>
</p>
<br>
<div>On 8/22/2016 5:39 PM, Daniel Berlin
wrote:<br>
</div>
<blockquote type="cite">
<div dir="ltr">Can you send me bc files so i can profile?<br>
<div>We haven't really begun to speed anything up yet :)<br>
</div>
<div><br>
</div>
<div>We also could, if it really mattered, turn off use
optimization, which is usually the slow part of building.</div>
<div><br>
</div>
<div>It would require updating defining accesses in the walker
(otherwise, it will slow down every query that asks about a
given load. Caching is a non-starter since you are basically
caching something the size of memoryssa ;)).</div>
<div><br>
</div>
</div>
<div class="gmail_extra"><br>
<div class="gmail_quote">On Mon, Aug 22, 2016 at 2:27 PM, Geoff
Berry <span dir="ltr"><<a href="mailto:gberry@codeaurora.org" target="_blank">gberry@codeaurora.org</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">gberry
added a comment.<br>
<br>
I've collected some compile time stats when enabling
MemorySSA EarlyCSE just for the EarlyCSE pass added at the
beginning of addFunctionSimplificationPasse<wbr>s at O2 and
higher.<br>
There were 8 benchmarks in the llvm test-suite whose compile
time increased by more than 1%. The biggest increase was in
consumer-typeset. Drilling down a bit, the MemorySSA
construction time for compiling the z44.c input to this
benchmark is reported as 2% of runtime.<br>
<br>
<br>
<a href="https://reviews.llvm.org/D19821" rel="noreferrer" target="_blank">https://reviews.llvm.org/D1982<wbr>1</a><br>
<br>
<br>
<br>
</blockquote>
</div>
<br>
</div>
</blockquote>
<br>
</div></div><span class="HOEnZb"><font color="#888888"><pre cols="72">--
Geoff Berry
Employee of Qualcomm Datacenter Technologies, Inc.
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc. Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.</pre>
</font></span></div>
</blockquote></div><br></div>