[PATCH] Improve performance of vector code on A15

Jakob Stoklund Olesen stoklund at 2pi.dk
Thu Mar 14 09:34:47 PDT 2013


On Mar 14, 2013, at 5:24 AM, "Silviu Baranga" <silbar01 at arm.com> wrote:

> I'm attaching a new version of the A15 neon optimization patch
> with the following improvements:
> 
> - we now always check if a register is virtual before
> calling getVRegDef.
> 
> - the elideCopiesAndPHIs method previously had a chance to
> enter in an infinite recursion since it also looks past PHI nodes.
> I've fixed that by using a DFS instead.
> 
> - disabled VMOVS widening on A15 since it was not working well with
> the optimization pass.
> 
> I've also tested the patch using the LNT testsuite and didn't get
> any failures (except the ones that we already know about and were
> already failing without the patch).
> 
> Would this be close to a state where it can get committed?

Looks good, except you should have to run DCE again. This doesn't seem like something that needs to create dead code.

/jakob

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130314/bf38da76/attachment.html>


More information about the llvm-commits mailing list