[PATCH] D50326: [ARM] FP16: vector VMUL variants
Sjoerd Meijer via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Aug 8 02:23:24 PDT 2018
SjoerdMeijer added inline comments.
================
Comment at: test/CodeGen/ARM/armv8.2a-fp16-vector-intrinsics.ll:996
+ ret <8 x half> %mul
+}
----------------
SjoerdMeijer wrote:
> samparker wrote:
> > Should there not also be a test where both inputs are v8f16?
> ah, I now see what you mean. This is the test and IR for ACLE intrinsic:
>
> float16x8_t vmulq_lane_f16 (float16x8_t a, float16x4_t v, const int lane)
>
> but yes, the pattern would also match for a pattern where the 2nd operand is a v8f16.
>
The shufflevector is creating a 8 x half vector here, with half the elements undef because we pass in a 4 x half, so it actually looks all okay here?
https://reviews.llvm.org/D50326
More information about the llvm-commits
mailing list