[PATCH] D56118: [ARM]: Add optimized NEON uint64x2_t multiply routine.

Sun Jan 6 12:51:19 PST 2019

RKSimon added inline comments.

================
Comment at: test/Analysis/CostModel/ARM/mult.ll:1
+; RUN: opt < %s  -cost-model -analyze -mtriple=thumbv7-apple-ios6.0.0 -mcpu=cortex-a9 | FileCheck %s
+
----------------
You might find the utils\update_analyze_test_checks.py script useful to make this more maintainable - see X86\arith.ll for examples.

================
Comment at: test/CodeGen/ARM/vmul.ll:69
+;CHECK: vld1.64         {d20, d21}, [r0]
+;CHECK; vand            q8, q10, q8
+;CHECK: vmovn.i64       d18, q9
----------------
easyaspi314 wrote:
> craig.topper wrote:
> > efriedma wrote:
> > > DAGCombine should be able to catch the redundant AND... but it looks like DAGCombiner::visitTRUNCATE doesn't try to handle demanded bits for vectors.  (I guess it didn't get updated when other operations got support for vector operands?)
> > PR39689 mentions this is disabled for vectors. Maybe @rksimon or @spatel are working on it?
> That would explain why it was choking on this when X86 does not.
> PR39689 mentions this is disabled for vectors. Maybe @RKSimon or @spatel are working on it?

I've been putting it off as there's a load of yak shaving to be done for it - but I will look again.

================
Comment at: test/CodeGen/ARM/vmul.ll:40
+
+define <2 x i64> @vmuli64(<2 x i64>* %A, <2 x i64>* %B) nounwind {
+;CHECK-LABEL: vmuli64
----------------
Please add these new tests to trunk with current codegen now then rebase this patch so it shows the changes to codegen.

Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D56118/new/

https://reviews.llvm.org/D56118