[llvm-commits] [llvm] r85697 - in /llvm/trunk: lib/Target/ARM/ARMInstrNEON.td test/CodeGen/ARM/fmacs.ll test/CodeGen/ARM/fnmacs.ll test/CodeGen/Thumb2/cross-rc-coalescing-2.ll

David Conrad lessen42 at gmail.com
Wed Nov 4 01:29:27 PST 2009


On Nov 3, 2009, at 5:07 PM, Jim Grosbach wrote:

>
> On Nov 2, 2009, at 1:53 AM, David Conrad wrote:
>
>> Thus even without modeling the special behaviour of vmla it's always
>> better to use it: it'll always be at least as fast as a separate vmul
>> +vadd. This applies to the integer versions as well.
>
>
> Hi David,
>
> Unfortunately, this turns out not to be the case. The NEON unit will  
> stall adjacent instructions in the presence of vmla to preserve in- 
> order retirement. If a RAW hazard is present, the stall is 8  
> (possibly 7) cycles, otherwise it is 4 cycles.

You're correct, sorry for the noise and wrong information.



More information about the llvm-commits mailing list