[PATCH] Improve performance of vector code on A15

James Molloy James.Molloy at arm.com
Thu Feb 21 10:13:52 PST 2013


Hi,

Attached is a patch which improves performance of vector code on
Cortex-A15.

Certain code sequences, where an S register is written but read as a D
register, can execute slowly. This patch defines a pass which detects
when SPRs are written but read as DPRs and inserts code sequences which,
although nonobvious, make code run much quicker.

The second patch, implicit-ops.diff, fixes an unrelated bug with
implicit defs/uses that happens to impact the A15 pass's efficiency.

Please review!

Cheers,

James

-- IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium.  Thank you.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: a15-sd-optimizer.diff
Type: text/x-patch
Size: 20684 bytes
Desc: a15-sd-optimizer.diff
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130221/2b98e865/attachment.bin>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: implicit-ops.diff
Type: text/x-patch
Size: 1197 bytes
Desc: implicit-ops.diff
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20130221/2b98e865/attachment-0001.bin>


More information about the llvm-commits mailing list