[PATCH] Improve performance of vector code on A15

Joey Gouly joey.gouly at arm.com
Thu Feb 21 10:25:40 PST 2013


Minor thing: You should be using a triple slash /// for documentation
(especially for A15SDOptimizer::runOnMachineInstruction), and 
I think they have to go above the function.

Joey

-----Original Message-----
From: llvm-commits-bounces at cs.uiuc.edu
[mailto:llvm-commits-bounces at cs.uiuc.edu] On Behalf Of James Molloy
Sent: 21 February 2013 18:14
To: llvm-commits
Cc: Silviu Baranga
Subject: [PATCH] Improve performance of vector code on A15

Hi,

Attached is a patch which improves performance of vector code on
Cortex-A15.

Certain code sequences, where an S register is written but read as a D
register, can execute slowly. This patch defines a pass which detects
when SPRs are written but read as DPRs and inserts code sequences which,
although nonobvious, make code run much quicker.

The second patch, implicit-ops.diff, fixes an unrelated bug with
implicit defs/uses that happens to impact the A15 pass's efficiency.

Please review!

Cheers,

James

-- IMPORTANT NOTICE: The contents of this email and any attachments are
confidential and may also be privileged. If you are not the intended
recipient, please notify the sender immediately and do not disclose the
contents to any other person, use it for any purpose, or store or copy the
information in any medium.  Thank you.







More information about the llvm-commits mailing list