Evan Cheng <evan.cheng at apple.com> writes:
> 1. When is this profitable? We can model size of loop buffer. But this
> is also dependent on loop alignment. We may have to align small loops
> on 32 byte boundary to get this right.
We almost always want to do this anyway on the newest processors.
-Dave