[LLVMdev] Enabling the vectorizer for -Os
Nadav Rotem
nrotem at apple.com
Tue Jun 4 20:26:14 PDT 2013
Hi,
I would like to start a discussion about enabling the loop vectorizer by default for -Os. The loop vectorizer can accelerate many workloads and enabling it for -Os and -O2 has obvious performance benefits. At the same time the loop vectorizer can increase the code size because of two reasons. First, to vectorize some loops we have to keep the original loop around in order to handle the last few iterations. Second, on x86 and possibly other targets, the encoding of vector instructions takes more space.
The loop vectorizer is already aware of the ‘optsize’ attribute and it does not vectorize loops which require that we keep the scalar tail. It also does not unroll loops when optimizing for size. It is not obvious but there are many cases in which this conservative kind of vectorization is profitable. The loop vectorizer does not try to estimate the encoding size of instructions and this is one reason for code growth.
I measured the effects of vectorization on performance and binary size using -Os. I measured the performance on a Sandybridge and compiled our test suite using -mavx -f(no)-vectorize -Os. As you can see in the attached data there are many workloads that benefit from vectorization. Not as much as vectorizing with -O3, but still a good number of programs. At the same time the code growth is minimal. Most workloads are unaffected and the total code growth for the entire test suite is 0.89%. Almost all of the code growth comes from the TSVC test suite which contains a large number of large vectorizable loops. I did not measure the compile time in this batch but I expect to see an increase in compile time in vectorizable loops because of the time we spend in codegen.
I am interested in hearing more opinions and discussing more measurements by other people.
Nadav
.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: VectorizationOsSize.pdf
Type: application/pdf
Size: 67620 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130604/23345076/attachment.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: VectorizationOsPerf.pdf
Type: application/pdf
Size: 60392 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20130604/23345076/attachment-0001.pdf>
More information about the llvm-dev
mailing list