<div dir="ltr"><div dir="ltr">On Mon, Apr 1, 2019 at 6:47 PM Eric Christopher <<a href="mailto:echristo@gmail.com">echristo@gmail.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi All,<br>
<br>
Looks like people are generally supportive of the changes to O1 even<br>
if there's some discussion around O1/Og equivalence that I'll take to<br>
a separate set of replies. Mostly so that I can get O1 changes<br>
separate from "new Og pipeline" changes even if I think that's the<br>
direction we should ultimately go :)<br></blockquote><div><br></div><div>One comment I'll make (as I'm somewhat obviously supportive having discussed this before):</div><div><br></div><div>I think the key thing isn't that we *necessarily* leave -Og here. But that if we have a good -O1 pipeline and use that as a starting point for -Og, then we can critically evaluate whether there is something that we would like to do at -Og but would harm the goals of -O1. If so, that becomes a much more clear and precise basis for diverging. Until then, I suspect it would still be a better starting point than -O2.</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
In general though I wanted to give a heads up as to the direction I<br>
was planning on taking things and seeing if anyone necessarily<br>
disagreed with the general idea. I'll plan on working up a set of pass<br>
pipeline patches (after some cleanup) and send out some reviews -<br>
please let me know (privately) if you'd like to be explicitly cc'd on<br>
any review otherwise I'll probably pick a set of people out of the cc<br>
line here.<br>
<br>
Thanks a lot for the responses and looking forward to getting some<br>
changes going in this direction!<br>
<br>
-eric<br>
<br>
On Thu, Mar 28, 2019 at 7:09 PM Eric Christopher <<a href="mailto:echristo@gmail.com" target="_blank">echristo@gmail.com</a>> wrote:<br>
><br>
> Hi All,<br>
><br>
> I’ve been thinking about both O1 and Og optimization levels and have a<br>
> proposal for an improved O1 that I think overlaps in functionality<br>
> with our desires for Og. The design goal is to rewrite the O1<br>
> optimization and code generation pipeline to include the set of<br>
> optimizations that minimizes build and test time while retaining our<br>
> ability to debug.<br>
><br>
> This isn’t to minimize efforts around optimized debugging or negate O0<br>
> builds, but rather to provide a compromise mode that encompasses some<br>
> of the benefits of both. In effect to create a “build mode for<br>
> everyday development”.<br>
><br>
> This proposal is a first approximation guess on direction. I’ll be<br>
> exploring different options and combinations, but I think this is a<br>
> good place to start for discussion. Unless there are serious<br>
> objections to the general direction I’d like to get started so we can<br>
> explore and look at the code as it comes through review.<br>
><br>
><br>
> Optimization and Code Generation Pipeline<br>
><br>
> The optimization passes chosen fall into a few main categories,<br>
> redundancy elimination and basic optimization/abstraction elimination.<br>
> The idea is that these are going to be the optimizations that a<br>
> programmer would expect to happen without affecting debugging. This<br>
> means not eliminating redundant calls or non-redundant loads as those<br>
> could fail in different ways and locations while executing. These<br>
> optimizations will also reduce the overall amount of code going to the<br>
> code generator helping both linker input size and code generation<br>
> speed.<br>
><br>
> Dead code elimination<br>
><br>
> - Dead code elimination (ADCE, BDCE)<br>
> - Dead store elimination<br>
> - Parts of CFG Simplification<br>
> - Removing branches and dead code paths and not including commoning<br>
> and speculation<br>
><br>
> Basic Scalar Optimizations<br>
><br>
> - Constant propagation including SCCP and IPCP<br>
> - Constant merging<br>
> - Instruction Combining<br>
> - Inlining: always_inline and normal inlining passes<br>
> - Memory to register promotion<br>
> - CSE of “unobservable” operations<br>
> - Reassociation of expressions<br>
> - Global optimizations - try to fold globals to constants<br>
><br>
> Loop Optimizations<br>
><br>
> Loop optimizations have some problems around debuggability and<br>
> observability, but a suggested set of passes would include<br>
> optimizations that remove abstractions and not ones that necessarily<br>
> optimize for performance.<br>
><br>
> - Induction Variable Simplification<br>
> - LICM but not promotion<br>
> - Trivial Unswitching<br>
> - Loop rotation<br>
> - Full loop unrolling<br>
> - Loop deletion<br>
><br>
> Pass Structure<br>
><br>
> Overall pass ordering will look similar to the existing pass layout in<br>
> llvm with passes added or subtracted for O1 rather than a new pass<br>
> ordering. The motivation here is to make the overall proposal easier<br>
> to understand initially upstream while also maintaining existing pass<br>
> pipeline synergies between passes.<br>
><br>
> Instruction selection<br>
><br>
> We will use the fast instruction selector (where it exists) for three reasons:<br>
> - Significantly faster code generation than llvm’s dag based<br>
> instruction selection<br>
> - Better debugability than selection dag - fewer instructions moved around<br>
> - Fast instruction selection has been optimized somewhat and<br>
> shouldn’t be an outrageous penalty on most architectures<br>
><br>
> Register allocation<br>
><br>
> The fast register allocator should be used for compilation speed.<br>
><br>
> Thoughts?<br>
><br>
> Thanks!<br>
><br>
> -eric<br>
</blockquote></div></div>