[PATCH] D51780: ARM: align loops to 4 bytes on Cortex-M3 and Cortex-M4.

Wed Sep 12 04:13:18 PDT 2018

t.p.northover updated this revision to Diff 165056.
t.p.northover added a comment.

I think I've implemented the suggested changes, except for the function alignment one.

The benchmarks came back as about a 0.2% difference in cycle count, and (crucially) there's no way when deciding function alignment to check for  OptSize so we'd inevitably pessimize some cases.

Repository:
  rL LLVM

https://reviews.llvm.org/D51780

Files:
  llvm/include/llvm/CodeGen/TargetLowering.h
  llvm/lib/CodeGen/MachineBlockPlacement.cpp
  llvm/lib/Target/ARM/ARM.td
  llvm/lib/Target/ARM/ARMISelLowering.cpp
  llvm/lib/Target/ARM/ARMISelLowering.h
  llvm/lib/Target/ARM/ARMSubtarget.h
  llvm/test/CodeGen/ARM/loop-align-cortex-m.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D51780.165056.patch
Type: text/x-patch
Size: 7267 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180912/b0f1fdc6/attachment-0001.bin>