[PATCH] Improve performance of vector code on A15
Jakob Stoklund Olesen
stoklund at 2pi.dk
Thu Mar 14 09:34:47 PDT 2013
On Mar 14, 2013, at 5:24 AM, "Silviu Baranga" <silbar01 at arm.com> wrote:
> I'm attaching a new version of the A15 neon optimization patch
> with the following improvements:
>
> - we now always check if a register is virtual before
> calling getVRegDef.
>
> - the elideCopiesAndPHIs method previously had a chance to
> enter in an infinite recursion since it also looks past PHI nodes.
> I've fixed that by using a DFS instead.
>
> - disabled VMOVS widening on A15 since it was not working well with
> the optimization pass.
>
> I've also tested the patch using the LNT testsuite and didn't get
> any failures (except the ones that we already know about and were
> already failing without the patch).
>
> Would this be close to a state where it can get committed?
Looks good, except you should have to run DCE again. This doesn't seem like something that needs to create dead code.
/jakob
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130314/bf38da76/attachment.html>
More information about the llvm-commits
mailing list