[llvm-commits] [PATCH 1/2] Improve optimization to fold base update into LDM/STM

Fri Aug 31 11:02:41 PDT 2012

Please review the attached patch. Thanks!

The base updated optimization for LDM/STM tries to fold an update of the
base
address register into the instruction itself. For example,

  LDM R0, {R1, R2}
  ADD R0, R0, #8
  ==>
  LDM R0!, {R1, R2}

This commit improves the existing optimization in the following ways:

  1. Try to optimize all LDM/STM instructions -- previously we only
     transformed LDM/STM instructions that were created by grouping multiple
     LDR/STR instructions together in an earlier phase of the optimization.
  2. Try to merge with instruction any distance away -- previously we only
     merged with an instruction directly preceeding or following the
LDM/STM.
  3. Add support for optimizing VLDM/VSTM.

The new support for folding the base update into vector memory ops include
the
following instructions:

  VLDM  == VLDMIA
  VSTM  == VSTMIA

We can fold the base address update into vector loads/stores that access
1 singleword register, 1 doubleword register, or 2 doubleword registers.

In order to enable the optimization for 2 doubleword registers I also added
new
pseudo instructions for VLDM/VSTM with update.

The new pseudo instructions are for the VLDMQ and VSTMQ instructions that
modify the base register. We can use these instructions to generate the VLDM
and VSTM instructions that write back to the base register. For example,

   vldm R0!, {D0, D1}
   vstm R0!, {D0, D1}

-- Qualcomm Innovation Center, Inc. is a member of Code Aurora Forum, hosted
by The Linux Foundation

-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-Improve-optimization-to-fold-base-update-into-LDM-ST.patch
Type: application/octet-stream
Size: 29518 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20120831/cc5d1abf/attachment.obj>