<div dir="ltr">Hi Hal,<div><br></div><div>I want to make sure If there is a conclusion about these unrolling methods on AArch64 target. It seems the answer is no. So it's worth to spend more time to tune the parameter before sending out the patch.  Thanks for providing some background around this.</div>

<div><br></div><div>Regards,</div><div>Kevin</div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-07-31 22:11 GMT+08:00 Hal Finkel <span dir="ltr"><<a href="mailto:hfinkel@anl.gov" target="_blank">hfinkel@anl.gov</a>></span>:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="">----- Original Message -----<br>

> From: "Kevin Qin" <<a href="mailto:kevinqindev@gmail.com">kevinqindev@gmail.com</a>><br>

> To: "LLVM Developers Mailing List" <<a href="mailto:llvmdev@cs.uiuc.edu">llvmdev@cs.uiuc.edu</a>><br>

> Sent: Thursday, July 31, 2014 3:03:19 AM<br>

> Subject: [LLVMdev] Should we enable Partial unrolling and Runtime unrolling   on AArch64?<br>

><br>

><br>

><br>

><br>

><br>

> Hi all,<br>

><br>

><br>

> Partial unrolling and runtime unrolling are enabled by default in<br>

> aarch64 gcc which is help to get performance better. But these two<br>

> methods are enabled for only several backends in LLVM which are X86,<br>

> PowerPC and R600. I don't know the history of these two kinds of<br>

> unrolling, and why they are not widely used. I also want to know is,<br>

> for aarch64 backend, is it intentionally to get them disabled?<br>

><br>

> I've did some experiment around this and see the performance is<br>

> indeed impacted. Overall, partial unrolling can bring small benefit<br>

> on most cases of Benchmark and regression is major and small.<br>

> Runtime unrolling can bring huge improvement on some certain cases<br>

> but also huge regression on others. The proportion of improvement<br>

</div>> and regression varies in different Benchmark . Also, code size is<br>

<div class="">> increased for two both.<br>

><br>

><br>

> I will show more information before this be changed. Here I just want<br>

> to know more backgrounds of two unrolling methods.<br>

<br>

</div>These unrolling methods have been available in LLVM for several years, but the pass-manager setup and TTI hooks that enable backends to enable these in a target-specific way is relatively new. As you've noticed, per-target tuning is required. Patches are certainly welcome; if you have a modification for AArch64 that provides significant benefits and little downside, please send it to llvm-commits for review.<br>


<br>

Thanks for looking at this.<br>

<br>

 -Hal<br>

<div class=""><br>

><br>

><br>

> --<br>

><br>

> Best Regards,<br>

><br>

><br>

> Kevin Qin<br>

><br>

</div>> _______________________________________________<br>

> LLVM Developers mailing list<br>

> <a href="mailto:LLVMdev@cs.uiuc.edu">LLVMdev@cs.uiuc.edu</a>         <a href="http://llvm.cs.uiuc.edu" target="_blank">http://llvm.cs.uiuc.edu</a><br>

> <a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" target="_blank">http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev</a><br>

><br>

<span class="HOEnZb"><font color="#888888"><br>

--<br>

Hal Finkel<br>

Assistant Computational Scientist<br>

Leadership Computing Facility<br>

Argonne National Laboratory<br>

</font></span></blockquote></div><br><br clear="all"><div><br></div>-- <br><div dir="ltr">Best Regards,<div><br></div><div>Kevin Qin</div></div>

</div></div>