[LLVMdev] [RFC] Heuristic for complete loop unrolling

Sean Silva chisophugis at gmail.com
Mon Jan 26 05:52:24 PST 2015


On Sat, Jan 24, 2015 at 5:39 PM, Owen Anderson <resistor at mac.com> wrote:

>
>
> > On Jan 24, 2015, at 5:02 AM, Sean Silva <chisophugis at gmail.com> wrote:
> >
> > I actually think the general problem that this thread has been talking
> about (how to determine if a particular optimization is going to enable
> further optimizations) is very interesting in general, but I would take
> this particular example with a grain of salt because is a very specific
> stylized computation.
>
> I see variations of this optimization come up in many contexts in graphics
> programming, not just convolutions.  It's very common in code that operates
> on 4x4 matrices, which comes up frequently in vertex shaders.
>
> Convolutions and stencils are particularly interesting in that there are
> *even more* optimizations you can do for them, typically with the goal of
> improving cache locality.  But what Michael is proposing is useful for
> other classes of programs as well.
>
>
Great! I just wanted raise "gain of salt"-level awareness of the
specificity of this particular example.

-- Sean Silva



> -Owen
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20150126/adfefec9/attachment.html>


More information about the llvm-dev mailing list