[llvm] r371321 - [aarch64] Add combine patterns for fp16 fmla

Tue Sep 24 08:29:59 PDT 2019

On Mon, Sep 23, 2019 at 6:03 AM Tim Northover <t.p.northover at gmail.com>
wrote:

> Hi Sebastian,
>
> > define <8 x half> @test_FMLSv8f16_OP1(<8 x half> %a, <8 x half> %b, <8 x
> half> %c) {
> > ; CHECK-LABEL: test_FMLSv8f16_OP1:
> > ; CHECK: fmls    {{v[0-9]+}}.8h, {{v[0-9]+}}.8h, {{v[0-9]+}}.8h
> > entry:
> >   %mul = fmul fast <8 x half> %c, %b
> >   %sub = fsub fast <8 x half> %mul, %a
> >   ret <8 x half> %sub
> > }
>
> This doesn't look right to me. The exact instruction produced is "fmls
> v0.8h, v2.8h, v1.8h", which I think calculates "v0 - v2*v1", but the
> IR is calculating "v2*v1-v0". The equivalent <4 x float> code also
> doesn't emit an fmls.
>
>
You are right, I will send a patch to fix this.

Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20190924/99836862/attachment.html>