[LLVMdev] Enabling the vectorizer for -Os

JF Bastien jfb at google.com
Thu Jun 6 08:45:45 PDT 2013


>
> Will knowing the exact number help us in making a better decision ? I
> don’t think so. On modern processors when running vectorizable loops, the
> code size of the vector instructions is almost never the bottleneck.
>

I'd make a slightly different point: being able to estimate the number of
UOPs will make a big difference if it allows you to fit your loop in the
loop stream detector.

So I'd agree that estimating x86 encoded code size doesn't matter that much
for performance (though I$ pressure is a big issue for may codebases, but I
assume you're talking about tight vectorizable kernels), but estimating
UOPs does matter a great deal.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130606/3474ae24/attachment.html>


More information about the llvm-dev mailing list