[PATCH] Improve performance of vector code on A15

Quentin Colombet qcolombet at apple.com
Thu Feb 21 10:33:50 PST 2013


Hi James,

Maybe a stupid remark, looking at your patch, unless I miss something, if I choose to disable your pass, I don't get the previous behavior for CortexA15 as the createExecutionDependencyFixPass will be skipped.
Is that intended/desirable?

-Quentin

On Feb 21, 2013, at 10:13 AM, James Molloy <James.Molloy at arm.com> wrote:

> Hi,
> 
> Attached is a patch which improves performance of vector code on
> Cortex-A15.
> 
> Certain code sequences, where an S register is written but read as a D
> register, can execute slowly. This patch defines a pass which detects
> when SPRs are written but read as DPRs and inserts code sequences which,
> although nonobvious, make code run much quicker.
> 
> The second patch, implicit-ops.diff, fixes an unrelated bug with
> implicit defs/uses that happens to impact the A15 pass's efficiency.
> 
> Please review!
> 
> Cheers,
> 
> James
> 
> -- IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium.  Thank you.<a15-sd-optimizer.diff><implicit-ops.diff>_______________________________________________
> llvm-commits mailing list
> llvm-commits at cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130221/3b7d95d0/attachment.html>


More information about the llvm-commits mailing list