[llvm-commits] [llvm] r85697 - in /llvm/trunk: lib/Target/ARM/ARMInstrNEON.td test/CodeGen/ARM/fmacs.ll test/CodeGen/ARM/fnmacs.ll test/CodeGen/Thumb2/cross-rc-coalescing-2.ll
Evan Cheng
evan.cheng at apple.com
Sun Nov 1 10:31:21 PST 2009
On Nov 1, 2009, at 10:22 AM, Anton Korobeynikov wrote:
> Hello, Evan
>
>> On the other hand, a vmla.32 followed by another vmla.32 is just
>> fine. And
>> it is faster than vmul + vadd. I agree we should try to solve it
>> better.
>> Perhaps expanding it before or during schedule2.
> Right, NEON scheduling is tricky, it seems that our instruction
> itineraries are not expressible enough for such complex pipelines.
I think we should be able to handle at least the true dependency
cases. Instruction latency is a function of both defining instruction
and the use. cc'ing David for his comments.
Evan
>
> --
> With best regards, Anton Korobeynikov
> Faculty of Mathematics and Mechanics, Saint Petersburg State
> University
More information about the llvm-commits
mailing list