[PATCH] D41772: [AArch64] optimise v4f16 FCMPs to utilise vector instructions

Carey Williams via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jan 5 09:22:35 PST 2018


carwil created this revision.
carwil added a reviewer: SjoerdMeijer.
Herald added subscribers: kristof.beyls, javed.absar, rengolin, aemerson.

Improves the code generation for v4f16 FCMP instructions when FullFP16 is not supported by generating FCTVL(s) rather than a longer series of FCVTs.


https://reviews.llvm.org/D41772

Files:
  lib/Target/AArch64/AArch64ISelLowering.cpp
  test/CodeGen/AArch64/fp16-v4-instructions.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D41772.128760.patch
Type: text/x-patch
Size: 10667 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180105/8bf8b1db/attachment.bin>


More information about the llvm-commits mailing list