[PATCH] D77872: [AArch32] Armv8.6-a Matrix Mult Assembly + Intrinsics

Luke Geeson via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Thu Apr 16 06:10:24 PDT 2020


LukeGeeson marked 3 inline comments as done.
LukeGeeson added inline comments.


================
Comment at: clang/test/CodeGen/arm-v8.6a-neon-intrinsics.c:3
+// RUN: -fallow-half-arguments-and-returns -S -disable-O0-optnone -emit-llvm -o - %s \
+// RUN: | opt -S -mem2reg \
+// RUN: | FileCheck %s
----------------
miyuki wrote:
> Can you try -sroa after -mem2reg? I think it should eliminate some more useless stores and loads.
added, test updated


================
Comment at: llvm/lib/Target/ARM/ARMInstrNEON.td:4839
   let Constraints = "$dst = $Vd";
+  let hasNoSchedulingInfo = 1;
 }
----------------
miyuki wrote:
> Why was this added?
Added a vusdot class to handle this


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D77872/new/

https://reviews.llvm.org/D77872





More information about the cfe-commits mailing list