[llvm] d8ba9e5 - [ARM] Cortex-M55 Scheduling Model

David Green via llvm-commits llvm-commits at lists.llvm.org
Sat Jan 21 10:03:30 PST 2023


Author: David Green
Date: 2023-01-21T18:03:24Z
New Revision: d8ba9e505ac3a57211fade270532519adde374c2

URL: https://github.com/llvm/llvm-project/commit/d8ba9e505ac3a57211fade270532519adde374c2
DIFF: https://github.com/llvm/llvm-project/commit/d8ba9e505ac3a57211fade270532519adde374c2.diff

LOG: [ARM] Cortex-M55 Scheduling Model

This adds an Arm Cortex-M55 scheduling model, using the information from
https://developer.arm.com/documentation/102692/latest/

Differential Revision: https://reviews.llvm.org/D141523

Added: 
    llvm/lib/Target/ARM/ARMScheduleM55.td
    llvm/test/tools/llvm-mca/ARM/m55-fp.s
    llvm/test/tools/llvm-mca/ARM/m55-int.s
    llvm/test/tools/llvm-mca/ARM/m55-mve-fp.s
    llvm/test/tools/llvm-mca/ARM/m55-mve-int.s
    llvm/test/tools/llvm-mca/ARM/m55-mve-ldst.s
    llvm/test/tools/llvm-mca/ARM/m55-mve-pred.s
    llvm/test/tools/llvm-mca/ARM/m55-storefwd.s

Modified: 
    llvm/lib/Target/ARM/ARM.td
    llvm/test/CodeGen/Thumb2/LowOverheadLoops/spillingmove.ll
    llvm/test/CodeGen/Thumb2/aligned-nonfallthrough.ll
    llvm/test/CodeGen/Thumb2/mve-pipelineloops.ll

Removed: 
    


################################################################################
diff  --git a/llvm/lib/Target/ARM/ARM.td b/llvm/lib/Target/ARM/ARM.td
index ec631b73f973d..5ccc603f6b426 100644
--- a/llvm/lib/Target/ARM/ARM.td
+++ b/llvm/lib/Target/ARM/ARM.td
@@ -1222,6 +1222,7 @@ include "ARMScheduleSwift.td"
 include "ARMScheduleR52.td"
 include "ARMScheduleA57.td"
 include "ARMScheduleM4.td"
+include "ARMScheduleM55.td"
 include "ARMScheduleM7.td"
 
 //===----------------------------------------------------------------------===//
@@ -1497,7 +1498,7 @@ def : ProcessorModel<"cortex-m35p", CortexM4Model,      [ARMv8mMainline,
                                                          FeatureHasNoBranchPredictor,
                                                          FeatureFixCMSE_CVE_2021_35465]>;
 
-def : ProcessorModel<"cortex-m55", CortexM4Model,      [ARMv81mMainline,
+def : ProcessorModel<"cortex-m55", CortexM55Model,      [ARMv81mMainline,
                                                          FeatureDSP,
                                                          FeatureFPARMv8_D16,
                                                          FeatureUseMISched,

diff  --git a/llvm/lib/Target/ARM/ARMScheduleM55.td b/llvm/lib/Target/ARM/ARMScheduleM55.td
new file mode 100644
index 0000000000000..f24f97b26f0aa
--- /dev/null
+++ b/llvm/lib/Target/ARM/ARMScheduleM55.td
@@ -0,0 +1,478 @@
+//==- ARMScheduleM55.td - Arm Cortex-M55 Scheduling Definitions -*- tablegen -*-=//
+//
+// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+//
+// This file defines the scheduling model for the Arm Cortex-M55 processors.
+//
+//===----------------------------------------------------------------------===//
+
+// ===---------------------------------------------------------------------===//
+// Cortex-M55 is a lot like the M4/M33 in terms of scheduling. It technically
+// has an extra pipeline stage but that is unimportant for scheduling, just
+// starting our model a stage later. The main points of interest over an
+// Cortex-M4 are MVE instructions and the ability to dual issue thumb1
+// instructions.
+//
+//
+// MVE
+//
+// The EPU pipelines now include both MVE and FP instructions. It has four
+// pipelines across 4 stages (E1-E4). These pipelines are "control",
+// "load/store", "integer" and "float/mul". We start the schedule at E2 to line
+// up with the rest of the pipeline we model, and take the latency as the time
+// between reading registers (almost always in E2) and register write (or
+// forward, if it allows it). This mean that a lot of instructions (including
+// loads) actually take 1 cycle (amazingly).
+//
+// Each MVE instruction needs to take 2 beats, each performing 64bits of the
+// 128bit vector operation. So long as the beats are to 
diff erent pipelines,
+// the execution of the first-beat-of-the-second-instruction can overlap with
+// the second-beat-of-the-first. For example a sequence of VLDR;VADD;VMUL;VSTR
+// can look like this is a pipeline:
+//          1    2    3    4    5
+// LD/ST  : VLDR VLDR      VSTR VSTR
+// INTEGER:      VADD VADD
+// FP/MUL :           VMUL VMUL
+//
+// But a sequence of VLDR;VLDRB;VADD;VSTR because the loads cannot overlap,
+// looks like:
+//          1     2     3     4     5    6
+// LD/ST  : VLDR  VLDR  VLDRB VLDRB VSTR VSTR
+// INTEGER:                   VADD  VADD
+//
+// For this schedule, we currently model latencies and pipelines well for each
+// instruction. MVE instruction take two beats, modelled using
+// ResourceCycles=[2].
+//
+//
+// Dual Issue
+//
+// Cortex-M55 can dual issue two 16-bit T1 instructions providing one is one of
+// NOPs, ITs, Brs, ADDri/SUBri, UXTB/H, SXTB/H and MOVri's. NOPs and IT's are
+// not relevant (they will not appear when scheduling), Brs are only at the end
+// of the block. The others are more useful, and where the problems arise.
+//
+// The first problem comes from the fact that we will only be seeing Thumb2
+// instructions at the point in the pipeline where we do the scheduling. The
+// Thumb2SizeReductionPass has not been run yet. Especially pre-ra scheduling
+// (where the scheduler has the most freedom) we can only really guess at which
+// instructions will become thumb1 instructions. We are quite optimistic, and
+// may get some things wrong as a result.
+//
+// The other problem is one of telling llvm what to do exactly. The way we
+// attempt to meld this is:
+//  Set IssueWidth to 2 to allow 2 instructions per cycle.
+//  All instructions we cannot dual issue are "SingleIssue=1" (MVE/FP and T2
+//    instructions)
+//  We guess at another set of instructions that will become T1 instruction.
+//    These become the primary instruction in a dual issue pair (the normal
+//    one). These use normal resources and latencies, but set SingleIssue = 0.
+//  We guess at another set of instructions that will be shrank down into T1 DI
+//    instructions (add, sub, mov's, etc), which become the secondary. These
+//    don't use a resource, and set SingleIssue = 0.
+//
+// So our guessing is a bit rough. It may be possible to improve this by moving
+// T2SizeReduction pass earlier in the pipeline, for example, so that at least
+// Post-RA scheduling sees what is T1/T2. It may also be possible to write a
+// custom instruction matcher for more accurately guess at T1 instructions.
+
+
+def CortexM55Model : SchedMachineModel {
+  let MicroOpBufferSize = 0;      // Explicitly set to zero since M55 is in-order.
+  let IssueWidth = 2;             // There is some dual-issue support in M55.
+  let MispredictPenalty = 3;      // Default is 10
+  let LoadLatency = 4;            // Default is 4
+  let PostRAScheduler = 1;
+  let FullInstRWOverlapCheck = 1;
+
+  let CompleteModel = 0;
+  let UnsupportedFeatures = [IsARM, HasNEON, HasDotProd, HasMatMulInt8, HasZCZ,
+                             IsNotMClass, HasV8, HasV8_3a, HasTrustZone, HasDFB,
+                             IsWindows];
+}
+
+
+let SchedModel = CortexM55Model in {
+
+//===----------------------------------------------------------------------===//
+// Define each kind of processor resource and number available.
+
+// Modeling each pipeline as a ProcResource using the BufferSize = 0 since
+// M55 is in-order.
+def M55UnitALU : ProcResource<1> { let BufferSize = 0; } // Int ALU
+def M55UnitVecALU : ProcResource<1> { let BufferSize = 0; } // MVE integer pipe
+def M55UnitVecFPALU : ProcResource<1> { let BufferSize = 0; } // MVE float pipe
+def M55UnitLoadStore : ProcResource<1> { let BufferSize = 0; } // MVE load/store pipe
+def M55UnitVecSys : ProcResource<1> { let BufferSize = 0; } // MVE control/sys pipe
+
+// Some VMOV's can go down either pipeline. FIXME: This M55Write2IntFPE2 is
+// intended to model the VMOV taking either Int or FP for 2 cycles. It is not
+// clear if the llvm scheduler is using it like we want though.
+def M55UnitVecIntFP: ProcResGroup<[M55UnitVecALU, M55UnitVecFPALU]>;
+
+
+//===----------------------------------------------------------------------===//
+// Subtarget-specific SchedWrite types which both map the ProcResources and
+// set the latency.
+
+//=====//
+// ALU //
+//=====//
+
+// Generic writes for Flags, GRPs and other extra operands (eg post-inc, vadc flags, vaddlv etc)
+def M55WriteLat0  : SchedWriteRes<[]>  { let Latency = 0; let NumMicroOps = 0; }
+def M55WriteLat1  : SchedWriteRes<[]>  { let Latency = 1; let NumMicroOps = 0; }
+def M55WriteLat2  : SchedWriteRes<[]>  { let Latency = 2; let NumMicroOps = 0; }
+
+// DX instructions are ALU instructions that take a single cycle. The
+// instructions that may be shrank to T1 (and can be dual issued) are
+// SingleIssue = 0. The others are SingleIssue = 1.
+let SingleIssue = 0, Latency = 1 in {
+    def : WriteRes<WriteALU, [M55UnitALU]>;
+    def : WriteRes<WriteCMP, [M55UnitALU]>;
+    def : WriteRes<WriteBr, [M55UnitALU]>;
+    def : WriteRes<WriteBrL, [M55UnitALU]>;
+    def : WriteRes<WriteBrTbl, [M55UnitALU]>;
+    def : WriteRes<WriteST, [M55UnitALU]>;
+    def M55WriteDX_DI : SchedWriteRes<[M55UnitALU]>;
+}
+let SingleIssue = 1, Latency = 1 in {
+    def : WriteRes<WritePreLd, [M55UnitALU]>;
+    def M55WriteDX_SI : SchedWriteRes<[M55UnitALU]>;
+}
+
+def : InstRW<[M55WriteDX_SI], (instregex "t2BF[CI]", "t2CPS", "t2DBG",
+          "t2MRS", "t2MSR", "t2SEL", "t2SG", "t2TT")>;
+def : InstRW<[M55WriteDX_SI], (instregex "t2SUBS_PC_LR", "COPY")>;
+def : InstRW<[M55WriteDX_SI], (instregex "t2CS(EL|INC|INV|NEG)")>;
+// Thumb 2 instructions that could be reduced to a thumb 1 instruction and can
+// be dual issued with one of the above. This list is optimistic.
+def : InstRW<[M55WriteDX_DI], (instregex "t2ADDC?rr$", "t2ADDrr$",
+           "t2ADDSrr$", "t2ANDrr$", "t2ASRr[ir]$", "t2BICrr$", "t2CMNzrr$",
+           "t2CMPr[ir]$", "t2EORrr$", "t2LSLr[ir]$", "t2LSRr[ir]$", "t2MVNr$",
+           "t2ORRrr$", "t2REV(16|SH)?$", "t2RORrr$", "t2RSBr[ir]$", "t2RSBSri$",
+           "t2SBCrr$", "t2SUBS?rr$", "t2TEQrr$", "t2TSTrr$", "t2STRi12$",
+           "t2STRs$", "t2STRBi12$", "t2STRBs$", "t2STRHi12$", "t2STRHs$",
+           "t2STR_POST$", "t2STMIA$", "t2STMIA_UPD$", "t2STMDB$", "t2STMDB_UPD$")>;
+def : InstRW<[M55WriteDX_DI], (instregex "t2SETPAN$", "tADC$", "tADDhirr$",
+           "tADDrSP$", "tADDrSPi$", "tADDrr$", "tADDspi$", "tADDspr$", "tADR$",
+           "tAND$", "tASRri$", "tASRrr$", "tBIC$", "tBKPT$", "tCBNZ$", "tCBZ$",
+           "tCMNz$", "tCMPhir$", "tCMPi8$", "tCMPr$", "tCPS$", "tEOR$", "tHINT$",
+           "tHLT$", "tLSLri$", "tLSLrr$", "tLSRri$", "tLSRrr$", "tMOVSr$",
+           "tMUL$", "tMVN$", "tORR$", "tPICADD$", "tPOP$", "tPUSH$", "tREV$",
+           "tREV16$", "tREVSH$", "tROR$", "tRSB$", "tSBC$", "tSETEND$",
+           "tSTMIA_UPD$", "tSTRBi$", "tSTRBr$", "tSTRHi$", "tSTRHr$", "tSTRi$",
+           "tSTRr$", "tSTRspi$", "tSUBrr$", "tSUBspi$", "tSVC$", "tTRAP$",
+           "tTST$", "tUDF$")>;
+def : InstRW<[M55WriteDX_DI], (instregex "tB$", "tBLXNSr$", "tBLXr$", "tBX$",
+           "tBXNS$", "tBcc$")>;
+
+
+// CX instructions take 2 (or more) cycles. Again T1 instructions may be dual
+// issues (SingleIssue = 0)
+let SingleIssue = 0, Latency = 2 in {
+    def : WriteRes<WriteLd, [M55UnitALU]>;
+    def M55WriteCX_DI  : SchedWriteRes<[M55UnitALU]>;
+}
+let SingleIssue = 1, Latency = 2 in {
+    def : WriteRes<WriteALUsi, [M55UnitALU]>;
+    def : WriteRes<WriteALUsr, [M55UnitALU]>;
+    def : WriteRes<WriteALUSsr, [M55UnitALU]>;
+    def : WriteRes<WriteCMPsi, [M55UnitALU]>;
+    def : WriteRes<WriteCMPsr, [M55UnitALU]>;
+    def : WriteRes<WriteDIV, [M55UnitALU]>;
+    def M55WriteCX_SI  : SchedWriteRes<[M55UnitALU]>;
+}
+
+def : SchedAlias<WriteMUL16, M55WriteCX_SI>;
+def : SchedAlias<WriteMUL32, M55WriteCX_SI>;
+def : SchedAlias<WriteMUL64Lo, M55WriteCX_SI>;
+def : WriteRes<WriteMUL64Hi, []> { let Latency = 2; }
+def : SchedAlias<WriteMAC16, M55WriteCX_SI>;
+def : SchedAlias<WriteMAC32, M55WriteCX_SI>;
+def : SchedAlias<WriteMAC64Lo, M55WriteCX_SI>;
+def : WriteRes<WriteMAC64Hi, []> { let Latency = 2; }
+
+def : InstRW<[M55WriteCX_SI], (instregex "t2CDP", "t2CLREX", "t2[DI][MS]B",
+           "t2MCR", "t2MOVSs[ir]", "t2MRC", "t2MUL", "t2STC")>;
+def : InstRW<[M55WriteCX_SI], (instregex "t2Q", "t2[SU](ADD|ASX|BFX|DIV)",
+           "t2[SU]H(ADD|ASX|SUB|SAX)", "t2SM[LM]", "t2S(SAT|SUB|SAX)", "t2UQ",
+           "t2USA", "t2USUB", "t2UXTA[BH]")>;
+def : InstRW<[M55WriteCX_SI], (instregex "t2LD[AC]", "t2STL", "t2STRD")>;
+def : InstRW<[M55WriteCX_SI], (instregex "MVE_[SU]Q?R?SH[LR]$")>;
+def : InstRW<[M55WriteCX_SI, M55WriteLat2], (instregex "MVE_ASRL", "MVE_LSLL",
+            "MVE_LSRL", "MVE_[SU]Q?R?SH[LR]L")>;
+// This may be higher in practice, but that likely doesn't make a 
diff erence
+// for scheduling
+def : InstRW<[M55WriteCX_SI], (instregex "t2CLRM")>;
+
+def : InstRW<[M55WriteCX_DI], (instregex "t2LDR[BH]?i12$", "t2LDRS?[BH]?s$",
+           "t2LDM")>;
+def : InstRW<[M55WriteCX_DI], (instregex "tLDM", "tLDRBi$", "tLDRBr$",
+           "tLDRHi$", "tLDRHr$", "tLDRSB$", "tLDRSH$", "tLDRi$", "tLDRpci$",
+           "tLDRr$", "tLDRspi$")>;
+
+// Dual Issue instructions
+let Latency = 1, SingleIssue = 0 in {
+    def : WriteRes<WriteNoop, []>;
+    def M55WriteDI : SchedWriteRes<[]>;
+}
+
+def : InstRW<[M55WriteDI], (instregex "tADDi[38]$", "tSUBi[38]$", "tMOVi8$",
+           "tMOVr$", "tUXT[BH]$", "tSXT[BH]$")>;
+// Thumb 2 instructions that could be reduced to a dual issuable Thumb 1
+// instruction above.
+def : InstRW<[M55WriteDI], (instregex "t2ADDS?ri$", "t2MOV[ir]$", "t2MOVi16$",
+           "t2MOVr$", "t2SUBS?ri$", "t2[US]XT[BH]$")>;
+def : InstRW<[M55WriteDI], (instregex "t2IT", "IT")>;
+
+
+def : InstRW<[M55WriteLat0], (instregex "t2LoopDec")>;
+
+// Forwarding
+
+// No forwarding in the ALU normally
+def : ReadAdvance<ReadALU, 0>;
+def : ReadAdvance<ReadALUsr, 0>;
+def : ReadAdvance<ReadMUL, 0>;
+def : ReadAdvance<ReadMAC, 0>;
+
+//=============//
+// MVE and VFP //
+//=============//
+
+// The Writes that take ResourceCycles=[2] are MVE instruction, the others VFP.
+
+let SingleIssue = 1, Latency = 1 in {
+  def M55WriteLSE2 : SchedWriteRes<[M55UnitLoadStore]>;
+  def M55WriteIntE2 : SchedWriteRes<[M55UnitVecALU]>;
+  def M55WriteFloatE2 : SchedWriteRes<[M55UnitVecFPALU]>;
+  def M55WriteSysE2 : SchedWriteRes<[M55UnitVecSys]>;
+
+  def M55Write2LSE2 : SchedWriteRes<[M55UnitLoadStore]> { let ResourceCycles=[2]; }
+  def M55Write2IntE2 : SchedWriteRes<[M55UnitVecALU]> { let ResourceCycles=[2]; }
+  def M55Write2FloatE2 : SchedWriteRes<[M55UnitVecFPALU]> { let ResourceCycles=[2]; }
+  def M55Write2IntFPE2 : SchedWriteRes<[M55UnitVecIntFP]> { let ResourceCycles=[2]; }
+}
+
+let SingleIssue = 1, Latency = 2 in {
+  def M55WriteLSE3 : SchedWriteRes<[M55UnitLoadStore]>;
+  def M55WriteIntE3 : SchedWriteRes<[M55UnitVecALU]>;
+  def M55WriteFloatE3 : SchedWriteRes<[M55UnitVecFPALU]>;
+
+  def M55Write2LSE3 : SchedWriteRes<[M55UnitLoadStore]> { let ResourceCycles=[2]; }
+  def M55Write2IntE3 : SchedWriteRes<[M55UnitVecALU]> { let ResourceCycles=[2]; }
+  def M55Write2FloatE3 : SchedWriteRes<[M55UnitVecFPALU]> { let ResourceCycles=[2]; }
+}
+
+let SingleIssue = 1, Latency = 3 in {
+  def M55Write2IntE3Plus1 : SchedWriteRes<[M55UnitVecALU]> { let ResourceCycles=[2]; }
+
+  // Same as M55Write2IntE3/M55Write2FloatE3 above, but longer latency and no forwarding into stores
+  def M55Write2IntE4NoFwd : SchedWriteRes<[M55UnitVecALU]> { let ResourceCycles=[2]; }
+  def M55Write2FloatE4NoFwd : SchedWriteRes<[M55UnitVecFPALU]> { let ResourceCycles=[2]; }
+}
+let SingleIssue = 1, Latency = 4 in {
+  def M55Write2IntE3Plus2 : SchedWriteRes<[M55UnitVecALU]> { let ResourceCycles=[2]; }
+  def M55WriteFloatE3Plus2 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 9 in {
+  def M55WriteFloatE3Plus7 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 15 in {
+  def M55WriteFloatE3Plus13 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 16 in {
+  def M55WriteFloatE3Plus14 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 21 in {
+  def M55WriteFloatE3Plus19 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+// VMUL (Double precision) + VADD (Double precision)
+let SingleIssue = 1, Latency = 24 in {
+  def M55WriteFloatE3Plus22 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 30 in {
+  def M55WriteFloatE3Plus28 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+let SingleIssue = 1, Latency = 36 in {
+  def M55WriteFloatE3Plus34 : SchedWriteRes<[M55UnitVecFPALU]>;
+}
+
+def M55Read0 : SchedReadAdvance<0>;
+def M55Read1 : SchedReadAdvance<1, [M55Write2LSE3, M55Write2IntE3, M55Write2FloatE3]>;
+def M55GatherQRead : SchedReadAdvance<-4>;
+
+// MVE instructions
+
+// Loads and Stores of 
diff erent kinds
+
+// Normal loads
+def : InstRW<[M55Write2LSE2], (instregex "MVE_VLDR(B|H|W)(S|U)(8|16|32)$")>;
+// Pre/post inc loads
+def : InstRW<[M55WriteLat1, M55Write2LSE2], (instregex "MVE_VLDR(B|H|W)(S|U)(8|16|32)_(post|pre)$")>;
+// Gather loads
+def : InstRW<[M55Write2LSE3, M55Read0, M55GatherQRead], (instregex "MVE_VLDR(B|H|W|D)(S|U)(8|16|32|64)_rq")>;
+def : InstRW<[M55Write2LSE3, M55GatherQRead], (instregex "MVE_VLDR(B|H|W|D)(S|U)(8|16|32|64)_qi$")>;
+def : InstRW<[M55WriteLat1, M55Write2LSE3, M55GatherQRead], (instregex "MVE_VLDR(W|D)U(32|64)_qi_pre$")>;
+// Interleaving loads
+def : InstRW<[M55Write2LSE2], (instregex "MVE_VLD[24][0-3]_(8|16|32)$")>;
+// Interleaving loads with wb
+def : InstRW<[M55Write2LSE2, M55WriteLat1], (instregex "MVE_VLD[24][0-3]_(8|16|32)_wb$")>;
+
+// Normal stores
+def : InstRW<[M55Write2LSE2, M55Read1], (instregex "MVE_VSTR(B|H|W)U?(8|16|32)$")>;
+// Pre/post inc stores
+def : InstRW<[M55Write2LSE2, M55Read1], (instregex "MVE_VSTR(B|H|W)U?(8|16|32)_(post|pre)$")>;
+// Scatter stores
+def : InstRW<[M55Write2LSE2, M55Read0, M55Read0, M55GatherQRead], (instregex "MVE_VSTR(B|H|W|D)(8|16|32|64)_rq")>;
+def : InstRW<[M55Write2LSE2, M55Read0, M55GatherQRead], (instregex "MVE_VSTR(B|H|W|D)(8|16|32|64)_qi")>;
+// Interleaving stores
+def : InstRW<[M55Write2LSE2], (instregex "MVE_VST(2|4)")>;
+
+// Integer pipe operations
+
+def : InstRW<[M55Write2IntE3Plus1], (instregex "MVE_VABAV")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VABD(u|s)")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VABS(u|s)")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_VADC")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VADD(_qr_)?i")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VAND")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VBIC")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VBRSR")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VCADDi")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VCLS")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VCLZ")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_V(D|I)?W?DUP")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VEOR")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VHADD")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VHCADD")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VHSUB")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_V(MAX|MIN)A?(s|u)")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_V(MAX|MIN)A?V(s|u)8")>;
+def : InstRW<[M55Write2IntE3Plus1], (instregex "MVE_V(MAX|MIN)A?V(s|u)16")>;
+def : InstRW<[M55Write2IntE3Plus2], (instregex "MVE_V(MAX|MIN)A?V(s|u)32")>;
+def : InstRW<[M55Write2IntE4NoFwd], (instregex "MVE_VMOVN")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VMOVL")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_VMULL[BT]p")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VMVN")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VNEG(u|s)")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VORN")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VORR")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VPSEL")>;
+def : InstRW<[M55Write2IntE2], (instregex "MQPRCopy")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VQABS")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VQADD")>;
+def : InstRW<[M55Write2IntE4NoFwd], (instregex "MVE_VQMOV")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VQNEG")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VSHL")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_V[QR]SHL")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_VQRSHL")>;
+def : InstRW<[M55Write2IntE4NoFwd], (instregex "MVE_VQ?R?SHRU?N")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VSHR_")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_VRSHR_")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VQSUB")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VREV")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VRHADD")>;
+def : InstRW<[M55Write2IntE3], (instregex "MVE_VSBC")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VSLI")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VSRI")>;
+def : InstRW<[M55Write2IntE2], (instregex "MVE_VSUB(_qr_)?i")>;
+
+// FP/Mul pipe operations.
+
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VABDf")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VABSf")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VADDf")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VADD_qr_f")>;
+def : InstRW<[M55Write2FloatE3, M55WriteLat1], (instregex "MVE_VADDLV")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VADDV")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VCADDf")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCMLA")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCMUL")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VCMP(i|s|u)", "MVE_VPTv(4|8|16)(i|s|u)")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VCMPf", "MVE_VPTv(4|8)f")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCVTf16(u|s)16")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCVTf32(u|s)32")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCVT(u|s)16f16")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCVT(u|s)32f32")>;
+def : InstRW<[M55Write2FloatE4NoFwd], (instregex "MVE_VCVTf16f32")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VCVTf32f16")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VFM(A|S)")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_V(MIN|MAX)NM")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VMOV_from_lane")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VMOV_rr_q")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VMOVi")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VMUL(_qr_)?[if]")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VQ?R?D?MULH")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VQ?D?MULL[TB]?[su]")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VQDMULL_qr_")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VQ?R?D?ML(A|S)[^L]")>;
+def : InstRW<[M55Write2FloatE3, M55WriteLat1], (instregex "MVE_VR?ML(A|S)L")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VNEGf")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VRINTf")>;
+def : InstRW<[M55Write2FloatE2], (instregex "MVE_VSUBf")>;
+def : InstRW<[M55Write2FloatE3], (instregex "MVE_VSUB_qr_f")>;
+
+// Some VMOV's can go down either pipeline.
+def : InstRW<[M55Write2IntFPE2], (instregex "MVE_VMOV_to_lane", "MVE_VMOV_q_rr")>;
+
+def : InstRW<[M55WriteSysE2], (instregex "MVE_VCTP")>;
+def : InstRW<[M55WriteSysE2], (instregex "MVE_VPNOT")>;
+def : InstRW<[M55WriteSysE2], (instregex "MVE_VPST")>;
+
+
+// VFP instructions
+
+def : SchedAlias<WriteFPCVT, M55WriteFloatE3>;
+def : SchedAlias<WriteFPMOV, M55WriteFloatE3>;
+def : SchedAlias<WriteFPALU32, M55WriteFloatE3>;
+def : SchedAlias<WriteFPALU64, M55WriteFloatE3Plus13>;
+def : SchedAlias<WriteFPMUL32, M55WriteFloatE3>;
+def : SchedAlias<WriteFPMUL64, M55WriteFloatE3Plus19>;
+def : SchedAlias<WriteFPMAC32, M55WriteFloatE3Plus2>;
+def : SchedAlias<WriteFPMAC64, M55WriteFloatE3Plus34>;
+def : SchedAlias<WriteFPDIV32, M55WriteFloatE3Plus14>;
+def : SchedAlias<WriteFPDIV64, M55WriteFloatE3Plus28>;
+def : SchedAlias<WriteFPSQRT32, M55WriteFloatE3Plus14>;
+def : SchedAlias<WriteFPSQRT64, M55WriteFloatE3Plus28>;
+def : ReadAdvance<ReadFPMUL, 0>;
+def : ReadAdvance<ReadFPMAC, 0>;
+
+def : InstRW<[M55WriteLSE3], (instregex "VLD")>;
+def : InstRW<[M55WriteLSE2], (instregex "VST")>;
+def : InstRW<[M55WriteLSE3], (instregex "VLLD", "VLST")>;
+
+def : InstRW<[M55WriteFloatE3], (instregex "VABS(H|S|D)")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VCVT(A|M|N|P|R|X|Z)(S|U)(H|S|D)")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VCVT(B|T)(DH|HD)")>;
+def : InstRW<[M55WriteFloatE2], (instregex "VCMPZ?(E|H|S|D)")>;
+def : InstRW<[M55WriteFloatE3Plus7], (instregex "VDIVH")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VFN?M(A|S)(H|S)")>; // VFMA
+def : InstRW<[M55WriteFloatE3Plus22], (instregex "VFN?M(A|S)D")>; // VFMA
+def : InstRW<[M55WriteFloatE3], (instregex "VFP_V(MAX|MIN)NM")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VINSH$", "VMOVH$", "VMOVHR$", "VMOVSR$", "VMOVDRR$")>; // VINS, VMOVX, to-FP reg movs
+def : InstRW<[M55WriteFloatE2], (instregex "VMOVD$", "VMOVS$", "VMOVR")>; // Other VMOV's
+def : InstRW<[M55WriteFloatE2], (instregex "FCONSTH", "FCONSTS", "FCONSTD")>;
+def : InstRW<[M55WriteFloatE2], (instregex "VGETLNi32", "VSETLNi32")>;
+def : InstRW<[M55WriteFloatE2], (instregex "VMSR", "VMRS")>;
+def : InstRW<[M55WriteFloatE3Plus2], (instregex "VN?ML(A|S)H")>; // VMLA
+def : InstRW<[M55WriteFloatE3], (instregex "VNEG(H|S|D)")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VRINT(A|M|N|P|R|X|Z)(H|S|D)")>;
+def : InstRW<[M55WriteFloatE3], (instregex "VSEL..(H|S|D)")>;
+def : InstRW<[M55WriteFloatE3Plus7], (instregex "VSQRTH")>;
+
+def : WriteRes<WriteVLD1, []>;
+def : WriteRes<WriteVLD2, []>;
+def : WriteRes<WriteVLD3, []>;
+def : WriteRes<WriteVLD4, []>;
+def : WriteRes<WriteVST1, []>;
+def : WriteRes<WriteVST2, []>;
+def : WriteRes<WriteVST3, []>;
+def : WriteRes<WriteVST4, []>;
+
+}

diff  --git a/llvm/test/CodeGen/Thumb2/LowOverheadLoops/spillingmove.ll b/llvm/test/CodeGen/Thumb2/LowOverheadLoops/spillingmove.ll
index 597502c3596f4..a687eac32dfce 100644
--- a/llvm/test/CodeGen/Thumb2/LowOverheadLoops/spillingmove.ll
+++ b/llvm/test/CodeGen/Thumb2/LowOverheadLoops/spillingmove.ll
@@ -191,41 +191,43 @@ define void @__arm_2d_impl_rgb16_colour_filling_with_alpha_sched(i16* noalias no
 ; CHECK-NEXT:    push {r4, r5, r6, r7, lr}
 ; CHECK-NEXT:    sub sp, #4
 ; CHECK-NEXT:    vpush {d8, d9, d10, d11, d12, d13, d14, d15}
-; CHECK-NEXT:    sub sp, #64
+; CHECK-NEXT:    sub sp, #80
 ; CHECK-NEXT:    ldrsh.w r12, [r2, #2]
 ; CHECK-NEXT:    cmp.w r12, #1
 ; CHECK-NEXT:    blt.w .LBB1_6
 ; CHECK-NEXT:  @ %bb.1: @ %for.cond3.preheader.lr.ph
 ; CHECK-NEXT:    ldrsh.w r2, [r2]
 ; CHECK-NEXT:    cmp r2, #1
-; CHECK-NEXT:    blt.w .LBB1_6
+; CHECK-NEXT:    blt .LBB1_6
 ; CHECK-NEXT:  @ %bb.2: @ %for.cond3.preheader.us.preheader
-; CHECK-NEXT:    ldr r7, [sp, #152]
-; CHECK-NEXT:    movs r4, #252
-; CHECK-NEXT:    lsls r6, r3, #3
-; CHECK-NEXT:    and.w r4, r4, r3, lsr #3
-; CHECK-NEXT:    uxtb r6, r6
+; CHECK-NEXT:    ldr r7, [sp, #168]
 ; CHECK-NEXT:    movs r5, #120
-; CHECK-NEXT:    mul lr, r4, r7
+; CHECK-NEXT:    lsls r6, r3, #3
+; CHECK-NEXT:    movs r4, #252
 ; CHECK-NEXT:    and.w r5, r5, r3, lsr #9
+; CHECK-NEXT:    uxtb r6, r6
+; CHECK-NEXT:    and.w r3, r4, r3, lsr #3
 ; CHECK-NEXT:    muls r6, r7, r6
-; CHECK-NEXT:    vmov.i16 q0, #0x78
-; CHECK-NEXT:    rsb.w r3, r7, #256
+; CHECK-NEXT:    mul lr, r3, r7
+; CHECK-NEXT:    vdup.16 q0, r6
+; CHECK-NEXT:    vstrw.32 q0, [sp, #64] @ 16-byte Spill
+; CHECK-NEXT:    vdup.16 q0, lr
 ; CHECK-NEXT:    muls r5, r7, r5
-; CHECK-NEXT:    lsls r7, r1, #1
 ; CHECK-NEXT:    vstrw.32 q0, [sp, #48] @ 16-byte Spill
-; CHECK-NEXT:    vdup.16 q4, r6
+; CHECK-NEXT:    vmov.i16 q0, #0xfc
 ; CHECK-NEXT:    mov.w r6, #2016
-; CHECK-NEXT:    vdup.16 q0, lr
-; CHECK-NEXT:    movs r4, #0
-; CHECK-NEXT:    vmov.i16 q2, #0xf8
-; CHECK-NEXT:    vmov.i16 q5, #0xfc
 ; CHECK-NEXT:    vstrw.32 q0, [sp, #32] @ 16-byte Spill
 ; CHECK-NEXT:    vdup.16 q0, r5
-; CHECK-NEXT:    vdup.16 q6, r6
-; CHECK-NEXT:    vmov.i16 q3, #0xf800
+; CHECK-NEXT:    rsb.w r3, r7, #256
+; CHECK-NEXT:    lsls r7, r1, #1
 ; CHECK-NEXT:    vstrw.32 q0, [sp, #16] @ 16-byte Spill
-; CHECK-NEXT:    vstrw.32 q3, [sp] @ 16-byte Spill
+; CHECK-NEXT:    vdup.16 q0, r6
+; CHECK-NEXT:    vmov.i16 q2, #0xf8
+; CHECK-NEXT:    vmov.i16 q5, #0x78
+; CHECK-NEXT:    vstrw.32 q0, [sp] @ 16-byte Spill
+; CHECK-NEXT:    vmov.i16 q6, #0xf800
+; CHECK-NEXT:    movs r4, #0
+; CHECK-NEXT:    vldrw.u32 q7, [sp] @ 16-byte Reload
 ; CHECK-NEXT:    .p2align 2
 ; CHECK-NEXT:  .LBB1_3: @ %vector.ph
 ; CHECK-NEXT:    @ =>This Loop Header: Depth=1
@@ -237,48 +239,39 @@ define void @__arm_2d_impl_rgb16_colour_filling_with_alpha_sched(i16* noalias no
 ; CHECK-NEXT:    @ Parent Loop BB1_3 Depth=1
 ; CHECK-NEXT:    @ => This Inner Loop Header: Depth=2
 ; CHECK-NEXT:    vldrh.u16 q0, [r5]
-; CHECK-NEXT:    vmov.f64 d6, d4
-; CHECK-NEXT:    vmov.f64 d7, d5
 ; CHECK-NEXT:    vshl.i16 q1, q0, #3
+; CHECK-NEXT:    vldrw.u32 q4, [sp, #64] @ 16-byte Reload
 ; CHECK-NEXT:    vand q1, q1, q2
-; CHECK-NEXT:    vmov q2, q4
-; CHECK-NEXT:    vmla.i16 q2, q1, r3
-; CHECK-NEXT:    vshr.u16 q1, q0, #3
-; CHECK-NEXT:    vand q1, q1, q5
-; CHECK-NEXT:    vmov.f64 d14, d10
-; CHECK-NEXT:    vmov.f64 d15, d11
-; CHECK-NEXT:    vmov.f64 d10, d8
-; CHECK-NEXT:    vmov.f64 d11, d9
-; CHECK-NEXT:    vldrw.u32 q4, [sp, #32] @ 16-byte Reload
-; CHECK-NEXT:    vshr.u16 q0, q0, #9
 ; CHECK-NEXT:    vmla.i16 q4, q1, r3
-; CHECK-NEXT:    vldrw.u32 q1, [sp, #48] @ 16-byte Reload
+; CHECK-NEXT:    vmov.f64 d6, d4
+; CHECK-NEXT:    vmov.f64 d7, d5
+; CHECK-NEXT:    vldrw.u32 q1, [sp, #32] @ 16-byte Reload
+; CHECK-NEXT:    vshr.u16 q2, q0, #9
+; CHECK-NEXT:    vshr.u16 q0, q0, #3
 ; CHECK-NEXT:    vand q0, q0, q1
-; CHECK-NEXT:    vldrw.u32 q1, [sp, #16] @ 16-byte Reload
+; CHECK-NEXT:    vldrw.u32 q1, [sp, #48] @ 16-byte Reload
 ; CHECK-NEXT:    vmla.i16 q1, q0, r3
-; CHECK-NEXT:    vshr.u16 q0, q2, #11
-; CHECK-NEXT:    vshr.u16 q2, q4, #5
-; CHECK-NEXT:    vand q2, q2, q6
-; CHECK-NEXT:    vorr q0, q2, q0
-; CHECK-NEXT:    vmov.f64 d4, d6
-; CHECK-NEXT:    vmov.f64 d5, d7
-; CHECK-NEXT:    vldrw.u32 q3, [sp] @ 16-byte Reload
-; CHECK-NEXT:    vmov.f64 d8, d10
-; CHECK-NEXT:    vmov.f64 d9, d11
-; CHECK-NEXT:    vand q1, q1, q3
+; CHECK-NEXT:    vand q2, q2, q5
+; CHECK-NEXT:    vshr.u16 q0, q4, #11
+; CHECK-NEXT:    vldrw.u32 q4, [sp, #16] @ 16-byte Reload
+; CHECK-NEXT:    vshr.u16 q1, q1, #5
+; CHECK-NEXT:    vmla.i16 q4, q2, r3
+; CHECK-NEXT:    vand q1, q1, q7
+; CHECK-NEXT:    vorr q0, q1, q0
+; CHECK-NEXT:    vand q1, q4, q6
 ; CHECK-NEXT:    vorr q0, q0, q1
-; CHECK-NEXT:    vmov.f64 d10, d14
-; CHECK-NEXT:    vmov.f64 d11, d15
 ; CHECK-NEXT:    vstrh.16 q0, [r5], #16
+; CHECK-NEXT:    vmov.f64 d4, d6
+; CHECK-NEXT:    vmov.f64 d5, d7
 ; CHECK-NEXT:    letp lr, .LBB1_4
 ; CHECK-NEXT:  @ %bb.5: @ %for.cond3.for.cond.cleanup7_crit_edge.us
 ; CHECK-NEXT:    @ in Loop: Header=BB1_3 Depth=1
 ; CHECK-NEXT:    adds r4, #1
-; CHECK-NEXT:    cmp r4, r12
 ; CHECK-NEXT:    add r0, r7
+; CHECK-NEXT:    cmp r4, r12
 ; CHECK-NEXT:    bne .LBB1_3
 ; CHECK-NEXT:  .LBB1_6: @ %for.cond.cleanup
-; CHECK-NEXT:    add sp, #64
+; CHECK-NEXT:    add sp, #80
 ; CHECK-NEXT:    vpop {d8, d9, d10, d11, d12, d13, d14, d15}
 ; CHECK-NEXT:    add sp, #4
 ; CHECK-NEXT:    pop {r4, r5, r6, r7, pc}

diff  --git a/llvm/test/CodeGen/Thumb2/aligned-nonfallthrough.ll b/llvm/test/CodeGen/Thumb2/aligned-nonfallthrough.ll
index 7a9b3a5990c60..767b7028a967c 100644
--- a/llvm/test/CodeGen/Thumb2/aligned-nonfallthrough.ll
+++ b/llvm/test/CodeGen/Thumb2/aligned-nonfallthrough.ll
@@ -7,15 +7,15 @@ define i32 @loop(ptr nocapture readonly %x) {
 ; CHECK-NEXT:    .save {r7, lr}
 ; CHECK-NEXT:    push {r7, lr}
 ; CHECK-NEXT:    mov.w lr, #500
-; CHECK-NEXT:    movs r1, #0
+; CHECK-NEXT:    mov r1, r0
+; CHECK-NEXT:    movs r0, #0
 ; CHECK-NEXT:    .p2align 2
 ; CHECK-NEXT:  .LBB0_1: @ %for.body
 ; CHECK-NEXT:    @ =>This Inner Loop Header: Depth=1
-; CHECK-NEXT:    ldr r2, [r0], #4
-; CHECK-NEXT:    add r1, r2
+; CHECK-NEXT:    ldr r2, [r1], #4
+; CHECK-NEXT:    add r0, r2
 ; CHECK-NEXT:    le lr, .LBB0_1
 ; CHECK-NEXT:  @ %bb.2: @ %for.cond.cleanup
-; CHECK-NEXT:    mov r0, r1
 ; CHECK-NEXT:    pop {r7, pc}
 entry:
   br label %for.body
@@ -43,8 +43,8 @@ define i64 @loopif(ptr nocapture readonly %x, i32 %y, i32 %n) {
 ; CHECK-NEXT:    blt .LBB1_4
 ; CHECK-NEXT:  @ %bb.1: @ %for.body.lr.ph
 ; CHECK-NEXT:    mov lr, r2
-; CHECK-NEXT:    dls lr, r2
 ; CHECK-NEXT:    mov r12, r0
+; CHECK-NEXT:    dls lr, r2
 ; CHECK-NEXT:    movs r0, #0
 ; CHECK-NEXT:    movs r3, #0
 ; CHECK-NEXT:    .p2align 2

diff  --git a/llvm/test/CodeGen/Thumb2/mve-pipelineloops.ll b/llvm/test/CodeGen/Thumb2/mve-pipelineloops.ll
index 450aa9b815208..70957ca950d71 100644
--- a/llvm/test/CodeGen/Thumb2/mve-pipelineloops.ll
+++ b/llvm/test/CodeGen/Thumb2/mve-pipelineloops.ll
@@ -4,69 +4,64 @@
 define void @arm_cmplx_dot_prod_q15(ptr noundef %pSrcA, ptr noundef %pSrcB, i32 noundef %numSamples, ptr nocapture noundef writeonly %realResult, ptr nocapture noundef writeonly %imagResult) {
 ; CHECK-LABEL: arm_cmplx_dot_prod_q15:
 ; CHECK:       @ %bb.0: @ %entry
-; CHECK-NEXT:    .save {r4, r5, r6, r7, lr}
-; CHECK-NEXT:    push {r4, r5, r6, r7, lr}
-; CHECK-NEXT:    .pad #4
-; CHECK-NEXT:    sub sp, #4
-; CHECK-NEXT:    .vsave {d8, d9, d10, d11}
-; CHECK-NEXT:    vpush {d8, d9, d10, d11}
-; CHECK-NEXT:    ldr.w r12, [sp, #56]
+; CHECK-NEXT:    .save {r4, r5, r6, r7, r8, lr}
+; CHECK-NEXT:    push.w {r4, r5, r6, r7, r8, lr}
+; CHECK-NEXT:    ldr.w r12, [sp, #24]
 ; CHECK-NEXT:    cmp r2, #16
 ; CHECK-NEXT:    blo .LBB0_5
 ; CHECK-NEXT:  @ %bb.1: @ %while.body.preheader
-; CHECK-NEXT:    lsrs r7, r2, #3
 ; CHECK-NEXT:    movs r6, #2
+; CHECK-NEXT:    lsrs r7, r2, #3
 ; CHECK-NEXT:    rsb r6, r6, r2, lsr #3
 ; CHECK-NEXT:    movs r5, #0
 ; CHECK-NEXT:    cmp r7, #2
 ; CHECK-NEXT:    csel r7, r6, r5, hs
 ; CHECK-NEXT:    add.w lr, r7, #1
-; CHECK-NEXT:    vldrh.u16 q4, [r0], #32
-; CHECK-NEXT:    vldrh.u16 q5, [r1], #32
 ; CHECK-NEXT:    mov r4, r5
+; CHECK-NEXT:    vldrh.u16 q0, [r0], #32
 ; CHECK-NEXT:    movs r7, #0
+; CHECK-NEXT:    mov r8, r5
+; CHECK-NEXT:    vldrh.u16 q1, [r1], #32
+; CHECK-NEXT:    vmlsldava.s16 r4, r7, q0, q1
 ; CHECK-NEXT:    vldrh.u16 q2, [r0, #-16]
-; CHECK-NEXT:    mov r6, r5
-; CHECK-NEXT:    sub.w lr, lr, #1
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q0, q1
 ; CHECK-NEXT:    vldrh.u16 q3, [r1, #-16]
-; CHECK-NEXT:    vldrh.u16 q1, [r1], #32
-; CHECK-NEXT:    vldrh.u16 q0, [r0], #32
-; CHECK-NEXT:    vmlsldava.s16 r4, r7, q4, q5
+; CHECK-NEXT:    vmlsldava.s16 r4, r7, q2, q3
+; CHECK-NEXT:    vldrh.u16 q0, [r1], #32
+; CHECK-NEXT:    sub.w lr, lr, #1
 ; CHECK-NEXT:    cmp.w lr, #0
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q4, q5
+; CHECK-NEXT:    vldrh.u16 q1, [r0], #32
 ; CHECK-NEXT:    beq .LBB0_3
 ; CHECK-NEXT:    .p2align 2
 ; CHECK-NEXT:  .LBB0_2: @ %while.body
 ; CHECK-NEXT:    @ =>This Inner Loop Header: Depth=1
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q2, q3
-; CHECK-NEXT:    vmlsldava.s16 r4, r7, q2, q3
-; CHECK-NEXT:    vldrh.u16 q2, [r0, #-16]
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q0, q1
-; CHECK-NEXT:    vmlsldava.s16 r4, r7, q0, q1
-; CHECK-NEXT:    vldrh.u16 q0, [r0], #32
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q2, q3
 ; CHECK-NEXT:    vldrh.u16 q3, [r1, #-16]
-; CHECK-NEXT:    vldrh.u16 q1, [r1], #32
+; CHECK-NEXT:    vmlsldava.s16 r4, r7, q1, q0
+; CHECK-NEXT:    vldrh.u16 q2, [r0, #-16]
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q1, q0
+; CHECK-NEXT:    vldrh.u16 q1, [r0], #32
+; CHECK-NEXT:    vmlsldava.s16 r4, r7, q2, q3
+; CHECK-NEXT:    vldrh.u16 q0, [r1], #32
 ; CHECK-NEXT:    le lr, .LBB0_2
 ; CHECK-NEXT:  .LBB0_3:
-; CHECK-NEXT:    mov.w lr, #14
-; CHECK-NEXT:    vmlsldava.s16 r4, r7, q2, q3
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q2, q3
-; CHECK-NEXT:    and.w r2, lr, r2, lsl #1
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q0, q1
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q2, q3
+; CHECK-NEXT:    movs r6, #14
+; CHECK-NEXT:    and.w r2, r6, r2, lsl #1
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q1, q0
 ; CHECK-NEXT:    vldrh.u16 q2, [r0, #-16]
-; CHECK-NEXT:    vmlsldava.s16 r4, r7, q0, q1
+; CHECK-NEXT:    vmlsldava.s16 r4, r7, q1, q0
 ; CHECK-NEXT:    vldrh.u16 q0, [r1, #-16]
+; CHECK-NEXT:    vmlaldavax.s16 r8, r5, q2, q0
 ; CHECK-NEXT:    vctp.16 r2
-; CHECK-NEXT:    vpstt
-; CHECK-NEXT:    vldrht.u16 q1, [r0]
-; CHECK-NEXT:    vldrht.u16 q3, [r1]
-; CHECK-NEXT:    vmlaldavax.s16 r6, r5, q2, q0
 ; CHECK-NEXT:    vmlsldava.s16 r4, r7, q2, q0
 ; CHECK-NEXT:    vpst
-; CHECK-NEXT:    vmlsldavat.s16 r4, r7, q1, q3
+; CHECK-NEXT:    vldrht.u16 q1, [r0]
 ; CHECK-NEXT:    cmp r2, #9
-; CHECK-NEXT:    vpst
-; CHECK-NEXT:    vmlaldavaxt.s16 r6, r5, q1, q3
+; CHECK-NEXT:    vpsttt
+; CHECK-NEXT:    vldrht.u16 q0, [r1]
+; CHECK-NEXT:    vmlsldavat.s16 r4, r7, q1, q0
+; CHECK-NEXT:    vmlaldavaxt.s16 r8, r5, q1, q0
 ; CHECK-NEXT:    blo .LBB0_10
 ; CHECK-NEXT:  @ %bb.4: @ %do.body.1
 ; CHECK-NEXT:    subs r2, #8
@@ -75,7 +70,7 @@ define void @arm_cmplx_dot_prod_q15(ptr noundef %pSrcA, ptr noundef %pSrcB, i32
 ; CHECK-NEXT:    vldrht.u16 q0, [r0, #16]
 ; CHECK-NEXT:    vldrht.u16 q1, [r1, #16]
 ; CHECK-NEXT:    vmlsldavat.s16 r4, r7, q0, q1
-; CHECK-NEXT:    vmlaldavaxt.s16 r6, r5, q0, q1
+; CHECK-NEXT:    vmlaldavaxt.s16 r8, r5, q0, q1
 ; CHECK-NEXT:    b .LBB0_10
 ; CHECK-NEXT:    .p2align 2
 ; CHECK-NEXT:  .LBB0_5: @ %if.else
@@ -96,22 +91,20 @@ define void @arm_cmplx_dot_prod_q15(ptr noundef %pSrcA, ptr noundef %pSrcB, i32
 ; CHECK-NEXT:    vmlaldavax.s16 r4, r5, q0, q1
 ; CHECK-NEXT:    letp lr, .LBB0_7
 ; CHECK-NEXT:  @ %bb.8: @ %if.end.loopexit177
-; CHECK-NEXT:    mov r6, r4
+; CHECK-NEXT:    mov r8, r4
 ; CHECK-NEXT:    mov r4, r2
 ; CHECK-NEXT:    b .LBB0_10
 ; CHECK-NEXT:    .p2align 2
 ; CHECK-NEXT:  .LBB0_9:
 ; CHECK-NEXT:    mov r7, r4
-; CHECK-NEXT:    movs r6, #0
+; CHECK-NEXT:    mov.w r8, #0
 ; CHECK-NEXT:    mov r5, r4
 ; CHECK-NEXT:  .LBB0_10: @ %if.end
 ; CHECK-NEXT:    asrl r4, r7, #6
-; CHECK-NEXT:    asrl r6, r5, #6
+; CHECK-NEXT:    asrl r8, r5, #6
 ; CHECK-NEXT:    str r4, [r3]
-; CHECK-NEXT:    str.w r6, [r12]
-; CHECK-NEXT:    vpop {d8, d9, d10, d11}
-; CHECK-NEXT:    add sp, #4
-; CHECK-NEXT:    pop {r4, r5, r6, r7, pc}
+; CHECK-NEXT:    str.w r8, [r12]
+; CHECK-NEXT:    pop.w {r4, r5, r6, r7, r8, pc}
 entry:
   %cmp = icmp ugt i32 %numSamples, 15
   br i1 %cmp, label %while.body.preheader, label %if.else

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-fp.s b/llvm/test/tools/llvm-mca/ARM/m55-fp.s
new file mode 100644
index 0000000000000..6318cfa9d6e9c
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-fp.s
@@ -0,0 +1,575 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -instruction-tables < %s | FileCheck %s
+
+vabs.f16 s0, s2
+vabs.f32 s0, s2
+vabs.f64 d0, d2
+vadd.f16 s0, s2, s1
+vadd.f32 s0, s2, s1
+vadd.f64 d0, d2, d1
+vcmp.f16 s1, s2
+vcmp.f32 s1, s2
+vcmp.f64 d1, d2
+vcmp.f16 s1, #0.0
+vcmp.f32 s1, #0.0
+vcmp.f64 d1, #0.0
+vcmpe.f16 s1, s2
+vcmpe.f32 s1, s2
+vcmpe.f64 d1, d2
+vcmpe.f16 s1, #0.0
+vcmpe.f32 s1, #0.0
+vcmpe.f64 d1, #0.0
+vcvt.f32.f64 s1, d2
+vcvt.f64.f32 d1, s1
+vcvt.f16.u16 s1, s2, #8
+vcvt.f16.s16 s1, s2, #8
+vcvt.f16.u32 s1, s2, #8
+vcvt.f16.s32 s1, s2, #8
+vcvt.u16.f16 s1, s2, #8
+vcvt.s16.f16 s1, s2, #8
+vcvt.u32.f16 s1, s2, #8
+vcvt.s32.f16 s1, s2, #8
+vcvt.f32.u16 s1, s2, #8
+vcvt.f32.s16 s1, s2, #8
+vcvt.f32.u32 s1, s2, #8
+vcvt.f32.s32 s1, s2, #8
+vcvt.u16.f32 s1, s2, #8
+vcvt.s16.f32 s1, s2, #8
+vcvt.u32.f32 s1, s2, #8
+vcvt.s32.f32 s1, s2, #8
+vcvt.f64.u16 d1, d2, #8
+vcvt.f64.s16 d1, d2, #8
+vcvt.f64.u32 d1, d2, #8
+vcvt.f64.s32 d1, d2, #8
+vcvt.u16.f64 d1, d2, #8
+vcvt.s16.f64 d1, d2, #8
+vcvt.u32.f64 d1, d2, #8
+vcvt.s32.f64 d1, d2, #8
+vcvt.u32.f16 s1, s2
+vcvt.s32.f16 s1, s2
+vcvt.u32.f32 s1, s2
+vcvt.s32.f32 s1, s2
+vcvt.u32.f64 s1, d2
+vcvt.s32.f64 s1, d2
+vcvt.f16.u32 s1, s2
+vcvt.f16.s32 s1, s2
+vcvt.f32.u32 s1, s2
+vcvt.f32.s32 s1, s2
+vcvt.f64.u32 d1, s2
+vcvt.f64.s32 d1, s2
+vcvta.u32.f16 s1, s2
+vcvta.s32.f16 s1, s2
+vcvta.u32.f32 s1, s2
+vcvta.s32.f32 s1, s2
+vcvta.u32.f64 s1, d2
+vcvta.s32.f64 s1, d2
+vcvtm.u32.f16 s1, s2
+vcvtm.s32.f16 s1, s2
+vcvtm.u32.f32 s1, s2
+vcvtm.s32.f32 s1, s2
+vcvtm.u32.f64 s1, d2
+vcvtm.s32.f64 s1, d2
+vcvtn.u32.f16 s1, s2
+vcvtn.s32.f16 s1, s2
+vcvtn.u32.f32 s1, s2
+vcvtn.s32.f32 s1, s2
+vcvtn.u32.f64 s1, d2
+vcvtn.s32.f64 s1, d2
+vcvtp.u32.f16 s1, s2
+vcvtp.s32.f16 s1, s2
+vcvtp.u32.f32 s1, s2
+vcvtp.s32.f32 s1, s2
+vcvtp.u32.f64 s1, d2
+vcvtp.s32.f64 s1, d2
+vcvtb.f16.f32 s1, s2
+vcvtb.f16.f64 s1, d2
+vcvtb.f32.f16 s1, s2
+vcvtb.f64.f16 d1, s2
+vcvtr.u32.f16 s1, s2
+vcvtr.s32.f16 s1, s2
+vcvtr.u32.f32 s1, s2
+vcvtr.s32.f32 s1, s2
+vcvtr.u32.f64 s1, d2
+vcvtr.s32.f64 s1, d2
+vcvtt.f16.f32 s1, s2
+vcvtt.f16.f64 s1, d2
+vcvtt.f32.f16 s1, s2
+vcvtt.f64.f16 d1, s2
+vdiv.f16 s0, s2, s1
+vdiv.f32 s0, s2, s1
+vdiv.f64 d0, d2, d1
+vfma.f16 s0, s2, s1
+vfma.f32 s0, s2, s1
+vfma.f64 d0, d2, d1
+vfms.f16 s0, s2, s1
+vfms.f32 s0, s2, s1
+vfms.f64 d0, d2, d1
+vfnma.f16 s0, s2, s1
+vfnma.f32 s0, s2, s1
+vfnma.f64 d0, d2, d1
+vfnms.f16 s0, s2, s1
+vfnms.f32 s0, s2, s1
+vfnms.f64 d0, d2, d1
+vins.f16 s0, s1
+vmaxnm.f16 s0, s2, s1
+vmaxnm.f32 s0, s2, s1
+vmaxnm.f64 d0, d2, d1
+vminnm.f16 s0, s2, s1
+vminnm.f32 s0, s2, s1
+vminnm.f64 d0, d2, d1
+vmla.f16 s0, s2, s1
+vmla.f32 s0, s2, s1
+vmla.f64 d0, d2, d1
+vmls.f16 s0, s2, s1
+vmls.f32 s0, s2, s1
+vmls.f64 d0, d2, d1
+vmov.f16 s0, r1
+vmov.f16 r0, s1
+vmov.f32 s0, r1
+vmov.f32 r0, s1
+vmov.f64 d0, r1, r2
+vmov.f64 r0, r1, d1
+vmov s0, s1, r0, r1
+vmov r0, r1, s0, s1
+vmov.f16 s0, #1.0
+vmov.f32 s0, #1.0
+vmov.f64 d0, #1.0
+vmov.f32 s0, s1
+vmov.f64 d0, d1
+vmovx.f16 s0, s1
+vmul.f16 s0, s2, s1
+vmul.f32 s0, s2, s1
+vmul.f64 d0, d2, d1
+vneg.f16 s0, s2
+vneg.f32 s0, s2
+vneg.f64 d0, d2
+vnmla.f16 s0, s2, s1
+vnmla.f32 s0, s2, s1
+vnmla.f64 d0, d2, d1
+vnmls.f16 s0, s2, s1
+vnmls.f32 s0, s2, s1
+vnmls.f64 d0, d2, d1
+vnmul.f16 s0, s2, s1
+vnmul.f32 s0, s2, s1
+vnmul.f64 d0, d2, d1
+vrinta.f16 s0, s2
+vrinta.f32.f32 s0, s2
+vrinta.f64.f64 d0, d2
+vrintm.f16 s0, s2
+vrintm.f32.f32 s0, s2
+vrintm.f64.f64 d0, d2
+vrintn.f16 s0, s2
+vrintn.f32.f32 s0, s2
+vrintn.f64.f64 d0, d2
+vrintp.f16 s0, s2
+vrintp.f32.f32 s0, s2
+vrintp.f64.f64 d0, d2
+vrintr.f16.f16 s0, s2
+vrintr.f32.f32 s0, s2
+vrintr.f64.f64 d0, d2
+vrintz.f16.f16 s0, s2
+vrintz.f32.f32 s0, s2
+vrintz.f64.f64 d0, d2
+vrintx.f16.f16 s0, s2
+vrintx.f32.f32 s0, s2
+vrintx.f64.f64 d0, d2
+vseleq.f16 s0, s2, s1
+vseleq.f32 s0, s2, s1
+vseleq.f64 d0, d2, d1
+vsqrt.f16 s0, s2
+vsqrt.f32 s0, s2
+vsqrt.f64 d0, d2
+vsub.f16 s0, s2, s1
+vsub.f32 s0, s2, s1
+vsub.f64 d0, d2, d1
+
+#vldr pc
+#vldr [rn + value]
+#vstr pc
+#vstr [rn + value]
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      2     1.00                        vabs.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vabs.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vabs.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vadd.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vadd.f32	s0, s2, s1
+# CHECK-NEXT:  1      15    1.00                        vadd.f64	d0, d2, d1
+# CHECK-NEXT:  1      1     1.00                        vcmp.f16	s1, s2
+# CHECK-NEXT:  1      1     1.00                        vcmp.f32	s1, s2
+# CHECK-NEXT:  1      1     1.00                        vcmp.f64	d1, d2
+# CHECK-NEXT:  1      1     1.00                        vcmp.f16	s1, #0
+# CHECK-NEXT:  1      1     1.00                        vcmp.f32	s1, #0
+# CHECK-NEXT:  1      1     1.00                        vcmp.f64	d1, #0
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f16	s1, s2
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f32	s1, s2
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f64	d1, d2
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f16	s1, #0
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f32	s1, #0
+# CHECK-NEXT:  1      1     1.00                        vcmpe.f64	d1, #0
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.f32	d1, s1
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.u16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.s16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.u32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.s32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u16.f16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s16.f16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.u16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.s16	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.u32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.s32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u16.f32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s16.f32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f32	s1, s1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.u16	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.s16	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.u32	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.s32	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u16.f64	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s16.f64	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f64	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f64	d1, d1, #8
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvt.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.u32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f16.s32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.u32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f32.s32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.u32	d1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvt.f64.s32	d1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvta.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvta.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvta.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvta.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvta.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvta.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtm.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtn.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtp.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtb.f16.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtb.f16.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtb.f32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtb.f64.f16	d1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.u32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.s32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.u32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.s32.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.u32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtr.s32.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtt.f16.f32	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtt.f16.f64	s1, d2
+# CHECK-NEXT:  1      2     1.00                        vcvtt.f32.f16	s1, s2
+# CHECK-NEXT:  1      2     1.00                        vcvtt.f64.f16	d1, s2
+# CHECK-NEXT:  1      9     1.00                        vdiv.f16	s0, s2, s1
+# CHECK-NEXT:  1      16    1.00                        vdiv.f32	s0, s2, s1
+# CHECK-NEXT:  1      30    1.00                        vdiv.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vfma.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vfma.f32	s0, s2, s1
+# CHECK-NEXT:  1      24    1.00                        vfma.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vfms.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vfms.f32	s0, s2, s1
+# CHECK-NEXT:  1      24    1.00                        vfms.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vfnma.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vfnma.f32	s0, s2, s1
+# CHECK-NEXT:  1      24    1.00                        vfnma.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vfnms.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vfnms.f32	s0, s2, s1
+# CHECK-NEXT:  1      24    1.00                        vfnms.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vins.f16	s0, s1
+# CHECK-NEXT:  1      2     1.00                        vmaxnm.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vmaxnm.f32	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vmaxnm.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vminnm.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vminnm.f32	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vminnm.f64	d0, d2, d1
+# CHECK-NEXT:  1      4     1.00                        vmla.f16	s0, s2, s1
+# CHECK-NEXT:  1      4     1.00                        vmla.f32	s0, s2, s1
+# CHECK-NEXT:  1      36    1.00                        vmla.f64	d0, d2, d1
+# CHECK-NEXT:  1      4     1.00                        vmls.f16	s0, s2, s1
+# CHECK-NEXT:  1      4     1.00                        vmls.f32	s0, s2, s1
+# CHECK-NEXT:  1      36    1.00                        vmls.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vmov.f16	s0, r1
+# CHECK-NEXT:  1      1     1.00                        vmov.f16	r0, s1
+# CHECK-NEXT:  1      2     1.00                        vmov	s0, r1
+# CHECK-NEXT:  1      1     1.00                        vmov	r0, s1
+# CHECK-NEXT:  1      2     1.00                        vmov	d0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        vmov	r0, r1, d1
+# CHECK-NEXT:  1      2     1.00                        vmov	s0, s1, r0, r1
+# CHECK-NEXT:  1      1     1.00                        vmov	r0, r1, s0, s1
+# CHECK-NEXT:  1      1     1.00                        vmov.f16	s0, #1.000000e+00
+# CHECK-NEXT:  1      1     1.00                        vmov.f32	s0, #1.000000e+00
+# CHECK-NEXT:  1      1     1.00                        vmov.f64	d0, #1.000000e+00
+# CHECK-NEXT:  1      1     1.00                        vmov.f32	s0, s1
+# CHECK-NEXT:  1      1     1.00                        vmov.f64	d0, d1
+# CHECK-NEXT:  1      2     1.00                        vmovx.f16	s0, s1
+# CHECK-NEXT:  1      2     1.00                        vmul.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vmul.f32	s0, s2, s1
+# CHECK-NEXT:  1      21    1.00                        vmul.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vneg.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vneg.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vneg.f64	d0, d2
+# CHECK-NEXT:  1      4     1.00                        vnmla.f16	s0, s2, s1
+# CHECK-NEXT:  1      4     1.00                        vnmla.f32	s0, s2, s1
+# CHECK-NEXT:  1      36    1.00                        vnmla.f64	d0, d2, d1
+# CHECK-NEXT:  1      4     1.00                        vnmls.f16	s0, s2, s1
+# CHECK-NEXT:  1      4     1.00                        vnmls.f32	s0, s2, s1
+# CHECK-NEXT:  1      36    1.00                        vnmls.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vnmul.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vnmul.f32	s0, s2, s1
+# CHECK-NEXT:  1      21    1.00                        vnmul.f64	d0, d2, d1
+# CHECK-NEXT:  1      2     1.00                        vrinta.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrinta.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrinta.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintm.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintm.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintm.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintn.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintn.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintn.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintp.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintp.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintp.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintr.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintr.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintr.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintz.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintz.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintz.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vrintx.f16	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintx.f32	s0, s2
+# CHECK-NEXT:  1      2     1.00                        vrintx.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vseleq.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vseleq.f32	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vseleq.f64	d0, d2, d1
+# CHECK-NEXT:  1      9     1.00                        vsqrt.f16	s0, s2
+# CHECK-NEXT:  1      16    1.00                        vsqrt.f32	s0, s2
+# CHECK-NEXT:  1      30    1.00                        vsqrt.f64	d0, d2
+# CHECK-NEXT:  1      2     1.00                        vsub.f16	s0, s2, s1
+# CHECK-NEXT:  1      2     1.00                        vsub.f32	s0, s2, s1
+# CHECK-NEXT:  1      15    1.00                        vsub.f64	d0, d2, d1
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -      -      -     181.00  -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -      -      -     1.00    -     vabs.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vabs.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vabs.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vadd.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vadd.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vadd.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f64	d1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f16	s1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f32	s1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcmp.f64	d1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f64	d1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f16	s1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f32	s1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcmpe.f64	d1, #0
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.f32	d1, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.u16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.s16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.u32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.s32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u16.f16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s16.f16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.u16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.s16	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.u32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.s32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u16.f32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s16.f32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f32	s1, s1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.u16	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.s16	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.u32	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.s32	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u16.f64	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s16.f64	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f64	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f64	d1, d1, #8
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.u32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f16.s32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.u32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f32.s32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.u32	d1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvt.f64.s32	d1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvta.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtm.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtn.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtp.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtb.f16.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtb.f16.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtb.f32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtb.f64.f16	d1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.u32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.s32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.u32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.s32.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.u32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtr.s32.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtt.f16.f32	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtt.f16.f64	s1, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtt.f32.f16	s1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vcvtt.f64.f16	d1, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vdiv.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vdiv.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vdiv.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vfma.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfma.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfma.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vfms.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfms.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfms.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnma.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnma.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnma.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnms.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnms.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vfnms.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vins.f16	s0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmaxnm.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmaxnm.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmaxnm.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vminnm.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vminnm.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vminnm.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vmla.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmla.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmla.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vmls.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmls.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmls.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f16	s0, r1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f16	r0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	s0, r1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	r0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	d0, r1, r2
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	r0, r1, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	s0, s1, r0, r1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov	r0, r1, s0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f16	s0, #1.000000e+00
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f32	s0, #1.000000e+00
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f64	d0, #1.000000e+00
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f32	s0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmov.f64	d0, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vmovx.f16	s0, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmul.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmul.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vmul.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vneg.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vneg.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vneg.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vnmla.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmla.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmla.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmls.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmls.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmls.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmul.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmul.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vnmul.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vrinta.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrinta.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrinta.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintm.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintm.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintm.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintn.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintn.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintn.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintp.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintp.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintp.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintr.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintr.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintr.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintz.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintz.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintz.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintx.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintx.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vrintx.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vseleq.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vseleq.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vseleq.f64	d0, d2, d1
+# CHECK-NEXT:  -      -      -     1.00    -     vsqrt.f16	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vsqrt.f32	s0, s2
+# CHECK-NEXT:  -      -      -     1.00    -     vsqrt.f64	d0, d2
+# CHECK-NEXT:  -      -      -     1.00    -     vsub.f16	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vsub.f32	s0, s2, s1
+# CHECK-NEXT:  -      -      -     1.00    -     vsub.f64	d0, d2, d1

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-int.s b/llvm/test/tools/llvm-mca/ARM/m55-int.s
new file mode 100644
index 0000000000000..9347aa1a09a20
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-int.s
@@ -0,0 +1,1425 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -mattr=+mve.fp -instruction-tables < %s | FileCheck %s
+
+adc r0, r1, #0
+adcs r0, r1, #0
+adcs r0, r1
+adc.w r0, r1, r2
+adcs.w r0, r1, r2
+adc.w r0, r1, r2, LSL #1
+adcs.w r0, r1, r2, LSL #1
+add r0, sp, #1
+add sp, #1
+add.w r0, sp, #1
+adds.w r0, sp, #1
+addw r0, sp, #1
+add r0, sp, r0
+add sp, r1
+add.w r0, sp, r1
+adds.w r0, sp, r1
+add.w r0, sp, r1, LSL #1
+adds.w r0, sp, r1, LSL #1
+adds r0, r1, #1
+adds r0, #42
+add.w r0, r1, #1
+adds.w r0, r1, #1
+addw r0, r1, #1
+adds r0, r1, r2
+add r0, r1
+add.w r0, r1, r2
+adds.w r0, r1, r2
+add.w r0, r1, r2, LSL #1
+adds.w r0, r1, r2, LSL #1
+adr r0, #-6
+adr r8, #-6
+adr.w r0, #-6
+and r0, r1, #1
+ands r0, r1, #1
+ands r1, r0
+and.w r0, r1, r2
+ands.w r0, r1, r2
+and.w r0, r1, r2, LSL #1
+ands.w r0, r1, r2, LSL #1
+asrs r0, r1, #1
+asr.w r0, r1, #1
+asrs.w r0, r1, #1
+asrs r0, r1
+asr.w r0, r1, r2
+asrs.w r0, r1, r2
+asrl r0, r1, #1
+asrl r0, r1, r2
+bfc r0, #1, #2
+bfi r0, r1, #1, #2
+bic r0, r1, #1
+bics r0, r1, #1
+bics r0, r1
+bic.w r0, r1, r2
+bics.w r0, r1, r2
+bic.w r0, r1, r2, LSL #1
+bics.w r0, r1, r2, LSL #1
+bkpt #1
+clrex
+clrm {r1, r2}
+clz r0, r1
+cmn r0, #1
+cmn r0, r1
+cmn.w r0, r1
+cmn.w r0, r1, LSL #1
+cmp r0, #1
+cmp.w r0, #1
+cmp r0, r1
+cmp r0, r10
+cmp.w r0, r1
+cmp.w r0, r1, LSL #1
+#cpsdb 1
+#cpsie if
+csel r1, r2, r3, eq
+csinc r1, r2, r3, eq
+csinv r1, r2, r3, eq
+csneg r1, r2, r3, eq
+#dbg #1
+dmb
+dsb
+eor r0, r1, #1
+eors r0, r1, #1
+eors r0, r1
+eor.w r0, r1, r2
+eors.w r0, r1, r2
+eor.w r0, r1, r2, LSL #1
+eors.w r0, r1, r2, LSL #1
+isb
+lda r0, [r1]
+ldab r0, [r1]
+ldaex r0, [r1]
+ldaexb r0, [r1]
+ldaexh r0, [r1]
+ldah r0, [r1]
+ldm r0!, {r1}
+ldm r0, {r1}
+ldm.w r0, {r1}
+ldm.w r0!, {r1}
+ldmdb r0, {r1}
+ldmdb r0!, {r1}
+ldr r0, [r1, #4]
+ldr r0, [sp, #4]
+ldr.w r0, [r1, #4]
+ldr r0, [r1, #-1]
+ldr r0, [r1], #1
+ldr r0, [r1, #1]!
+ldr r0, #4
+ldr.w r0, #4
+ldr r0, next
+ldr.w r0, next
+ldr r0, [r1, r2]
+ldr.w r0, [r1, r2]
+ldr.w r0, [r1, r2, LSL #1]
+ldrb r0, [r1, #1]
+ldrb.w r0, [r1, #1]
+ldrb r0, [r1, #-1]
+ldrb r0, [r1], #1
+ldrb r0, [r1, #1]!
+ldrb r0, #4
+ldrb r0, next
+ldrb r0, [r1, r2]
+ldrb.w r0, [r1, r2]
+ldrb.w r0, [r1, r2, LSL #1]
+ldrbt r0, [r1, #1]
+ldrd r0, r2, [r1]
+ldrd r0, r2, [r1, #-4]
+ldrd r0, r2, [r1], #4
+ldrd r0, r2, [r1, #4]!
+ldrd r0, r2, next
+ldrex r0, [r1]
+ldrex r0, [r1, #4]
+ldrexb r0, [r1]
+ldrexh r0, [r1]
+ldrh r0, [r1, #2]
+ldrh.w r0, [r1, #1]
+ldrh r0, [r1, #-1]
+ldrh r0, [r1], #1
+ldrh r0, [r1, #1]!
+ldrh r0, #4
+ldrh r0, next
+ldrh r0, [r1, r2]
+ldrh.w r0, [r1, r2]
+ldrh.w r0, [r1, r2, LSL #1]
+ldrht r0, [r1, #1]
+ldrsb r0, [r1, #1]
+ldrsb r0, [r1, #-1]
+ldrsb r0, [r1], #1
+ldrsb r0, [r1, #1]!
+ldrsb r0, #4
+ldrsb r0, next
+ldrsb r0, [r1, r2]
+ldrsb.w r0, [r1, r2]
+ldrsb.w r0, [r1, r2, LSL #1]
+ldrsbt r0, [r1, #1]
+ldrsh r0, [r1, #2]
+ldrsh r0, [r1, #-1]
+ldrsh r0, [r1], #1
+ldrsh r0, [r1, #1]!
+ldrsh r0, #4
+ldrsh r0, next
+ldrsh r0, [r1, r2]
+ldrsh.w r0, [r1, r2]
+ldrsh.w r0, [r1, r2, LSL #1]
+ldrsht r0, [r1, #1]
+ldrt r0, [r1, #1]
+lsls r0, r1, #1
+lsl.w r0, r1, #1
+lsls.w r0, r1, #1
+lsls r0, r1
+lsl.w r0, r1, r2
+lsls.w r0, r1, r2
+lsll r0, r1, #2
+lsll r0, r1, r2
+lsrs r0, r1, #1
+lsr.w r0, r1, #1
+lsrs.w r0, r1, #1
+lsrs r0, r1
+lsr.w r0, r1, r2
+lsrs.w r0, r1, r2
+lsrl r0, r1, #2
+mla r0, r1, r2, r3
+mls r0, r1, r2, r3
+movs r0, #1
+mov.w r0, #1
+movs.w r0, #1
+movw r0, #1
+mov r0, r1
+#movs r0, r1
+mov.w r0, r1
+movs.w r0, r1
+movt r0, #1
+mrs r0, apsr
+msr apsr, r0
+muls r1, r2, r1
+mul r0, r1, r2
+mvn r0, #1
+mvns r0, #1
+mvns r0, r1
+mvn.w r0, r1
+mvns.w r0, r1
+mvn.w r0, r1, LSL #1
+mvns.w r0, r1, LSL #1
+nop
+orn r0, r1, #1
+orns r0, r1, #1
+orn r0, r1, r2
+orns r0, r1, r2
+orn r0, r1, r2, LSL #1
+orns r0, r1, r2, LSL #1
+orr r0, r1, #1
+orrs r0, r1, #1
+orrs r0, r1
+orr r0, r1, r2
+orrs r0, r1, r2
+orr r0, r1, r2, LSL #1
+orrs r0, r1, r2, LSL #1
+pkhbt r0, r1, r2
+pkhbt r0, r1, r2, LSL #1
+pkhtb r0, r1, r2
+pkhtb r0, r1, r2, ASR #1
+pop { r0 }
+pop.w { r0, r1 }
+pop.w { r0 }
+pssbb
+push { r0 }
+push.w { r0, r1 }
+push.w { r0 }
+qadd r0, r1, r2
+qadd16 r0, r1, r2
+qadd8 r0, r1, r2
+qasx r0, r1, r2
+qdadd r0, r1, r2
+qdsub r0, r1, r2
+qsax r0, r1, r2
+qsub r0, r1, r2
+qsub16 r0, r1, r2
+qsub8 r0, r1, r2
+rbit r0, r1
+rev r0, r1
+rev.w r0, r1
+rev16 r0, r1
+rev16.w r0, r1
+revsh r0, r1
+revsh.w r0, r1
+ror r0, r1, #1
+rors r0, r1, #1
+rors r0, r1
+ror.w r0, r1, r2
+rors.w r0, r1, r2
+rrx r0, r1
+rrxs r0, r1
+rsbs r0, r1, #0
+rsb.w r0, r1, #1
+rsbs.w r0, r1, #1
+rsb r0, r1, r2
+rsbs r0, r1, r2
+rsb r0, r1, r2, LSL #1
+rsbs r0, r1, r2, LSL #1
+sadd16 r0, r1, r2
+sadd8 r0, r1, r2
+sasx r0, r1, r2
+sbc r0, r1, #1
+sbcs r0, r1, #1
+sbcs r0, r1
+sbc r0, r1, r2
+sbcs r0, r1, r2
+sbc r0, r1, r2, LSL #1
+sbcs r0, r1, r2, LSL #1
+sbfx r0, r1, #1, #2
+sdiv r0, r1, r2
+sel r0, r1, r2
+sev
+#sg
+shadd16 r0, r1, r2
+shadd8 r0, r1, r2
+shasx r0, r1, r2
+shsax r0, r1, r2
+shsub16 r0, r1, r2
+shsub8 r0, r1, r2
+smlabb r0, r1, r2, r3
+smlabt r0, r1, r2, r3
+smlatb r0, r1, r2, r3
+smlatt r0, r1, r2, r3
+smlad r0, r1, r2, r3
+smladx r0, r1, r2, r3
+smlal r0, r1, r2, r3
+smlalbb r0, r1, r2, r3
+smlalbt r0, r1, r2, r3
+smlaltb r0, r1, r2, r3
+smlaltt r0, r1, r2, r3
+smlald r0, r1, r2, r3
+smlaldx r0, r1, r2, r3
+smlawb r0, r1, r2, r3
+smlawt r0, r1, r2, r3
+smlsd r0, r1, r2, r3
+smlsdx r0, r1, r2, r3
+smlsld r0, r1, r2, r3
+smlsldx r0, r1, r2, r3
+smmla r0, r1, r2, r3
+smmlar r0, r1, r2, r3
+smmls r0, r1, r2, r3
+smmlsr r0, r1, r2, r3
+smmul r0, r1, r2
+smmulr r0, r1, r2
+smuad r0, r1, r2
+smuadx r0, r1, r2
+smulbb r0, r1, r2
+smulbt r0, r1, r2
+smultb r0, r1, r2
+smultt r0, r1, r2
+smull r0, r1, r2, r3
+smulwb r0, r1, r2
+smulwt r0, r1, r2
+smusd r0, r1, r2
+smusdx r0, r1, r2
+sqrshr r0, r1
+sqrshrl r0, r1, #48, r2
+sqshl r0, #7
+sqshll r0, r1, #7
+srshr r0, #7
+srshrl r0, r1, #7
+ssat r0, #1, r2
+ssat r0, #1, r2, LSL #1
+ssat16 r0, #1, r1
+ssax r0, r1, r2
+ssbb
+ssub16 r0, r1, r2
+ssub8 r0, r1, r2
+stl r0, [r1]
+stlb r0, [r1]
+stlex r0, r1, [r2]
+stlexb r0, r1, [r2]
+stlexh r0, r1, [r2]
+stlh r0, [r1]
+stm r0!, { r1 }
+stm.w r0, { r1 }
+stm.w r0!, { r1 }
+stmdb r0, { r1 }
+stmdb r0!, { r1 }
+str r0, [ r1 ]
+str r0, [ r1, #4 ]
+str r0, [ sp, #4 ]
+str.w r0, [ r1, #1 ]
+str r0, [ r1, #-1 ]
+str r0, [ r1 ], #1
+#str r0, [ r1, #1 ]!
+str r0, [ r1, r2 ]
+str.w r0, [ r1, r2 ]
+str.w r0, [ r1, r2, LSL #1 ]
+strb r0, [ r1 ]
+strb r0, [ r1, #1 ]
+strb.w r0, [ r1, #1 ]
+strb r0, [ r1, #-1 ]
+strb r0, [ r1 ], #1
+strb r0, [ r1, #1 ]!
+strb r0, [ r1, r2 ]
+strb.w r0, [ r1, r2 ]
+strb.w r0, [ r1, r2, LSL #1 ]
+strbt r0, [ r1, #1 ]
+strd r0, r1, [ r2, #4 ]
+strd r0, r1, [ r2 ], #4
+strd r0, r1, [ r2, #4 ]!
+strex r0, r1, [ r2 ]
+strex r0, r1, [ r2, #4 ]
+strexb r0, r1, [ r2 ]
+strexh r0, r1, [ r2 ]
+strh r0, [ r1 ]
+strh r0, [ r1, #2 ]
+strh.w r0, [ r1, #2 ]
+strh r0, [ r1, #-1 ]
+strh r0, [ r1 ], #1
+strh r0, [ r1, #1 ]!
+strh r0, [ r1, r2 ]
+strh.w r0, [ r1, r2 ]
+strh.w r0, [ r1, r2, LSL #1 ]
+strht r0, [r1, #1 ]
+strt r0, [r1, #1 ]
+sub sp, sp, #4
+sub.w r0, sp, #1
+subs.w r0, sp, #1
+subw r0, sp, #1
+sub r0, sp, r1
+subs r0, sp, r1
+sub r0, sp, r1, LSL #1
+subs r0, sp, r1, LSL #1
+subs r0, r1, #1
+subs r0, #1
+sub.w r0, r1, #1
+subs.w r0, r1, #1
+subw r0, r1, #1
+subs r0, r1, r2
+sub.w r0, r1, r2
+subs.w r0, r1, r2
+sub.w r0, r1, r2, LSL #1
+subs.w r0, r1, r2, LSL #1
+#svc #1		; treated as a call
+sxtab r0, r1, r2
+sxtab r0, r1, r2, ROR #8
+sxtab16 r0, r1, r2
+sxtab16 r0, r1, r2, ROR #8
+sxtah r0, r1, r2
+sxtah r0, r1, r2, ROR #8
+sxtb r0, r1
+sxtb.w r0, r1
+sxtb.w r0, r1, ROR #8
+sxtb16 r0, r1
+sxtb16 r0, r1, ROR #8
+sxth r0, r1
+sxth.w r0, r1
+sxth.w r0, r1, ROR #8
+tbb [r0, r1]
+tbh [r0, r1, LSL #1]
+teq r0, #1
+teq r0, r1
+teq r0, r1, LSL #1
+tst r0, #1
+tst r0, r1
+tst.w r0, r1
+tst.w r0, r1, LSL #1
+#tt r0, r1
+#ttt r0, r1
+#tta r0, r1
+#ttat r0, r1
+uadd16 r0, r1, r2
+uadd8 r0, r1, r2
+uasx r0, r1, r2
+ubfx r0, r1, #1, #2
+#udf #1
+udiv r0, r1, r2
+uhadd16 r0, r1, r2
+uhadd8 r0, r1, r2
+uhasx r0, r1, r2
+uhsax r0, r1, r2
+uhsub16 r0, r1, r2
+uhsub8 r0, r1, r2
+umaal r0, r1, r2, r3
+umlal r0, r1, r2, r3
+umull r0, r1, r2, r3
+uqadd16 r0, r1, r2
+uqadd8 r0, r1, r2
+uqasx r0, r1, r2
+uqrshl r0, r1
+uqrshll r0, r1, #48, r2
+uqsax r0, r1, r2
+uqshl r0, #1
+uqshll r0, r1, #1
+uqsub16 r0, r1, r2
+uqsub8 r0, r1, r2
+urshr r0, #1
+urshrl r0, r1, #1
+usad8 r0, r1, r2
+usada8 r0, r1, r2, r3
+usat r0, #1, r1
+usat r0, #1, r1, LSL #1
+usat16 r0, #1, r1
+usax r0, r1, r2
+usub16 r0, r1, r2
+usub8 r0, r1, r2
+uxtab r0, r1, r2
+uxtab r0, r1, r2, ROR #8
+uxtab16 r0, r1, r2
+uxtab16 r0, r1, r2, ROR #8
+uxtah r0, r1, r2
+uxtah r0, r1, r2, ROR #8
+uxtb r0, r1
+uxtb.w r0, r1
+uxtb.w r0, r1, ROR #8
+uxtb16 r0, r1
+uxtb16 r0, r1, ROR #8
+uxth r0, r1
+uxth.w r0, r1
+uxth.w r0, r1, ROR #8
+wfe
+wfi
+yield
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      1     1.00                        adc	r0, r1, #0
+# CHECK-NEXT:  1      1     1.00                        adcs	r0, r1, #0
+# CHECK-NEXT:  1      1     1.00                  U     adcs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        adc.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        adcs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        adc.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        adcs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     0.50                        add.w	r0, sp, #1
+# CHECK-NEXT:  1      1     1.00                  U     add.w	sp, sp, #1
+# CHECK-NEXT:  1      1     0.50                        add.w	r0, sp, #1
+# CHECK-NEXT:  1      1     0.50                        adds.w	r0, sp, #1
+# CHECK-NEXT:  1      1     1.00                        addw	r0, sp, #1
+# CHECK-NEXT:  1      1     1.00                  U     add	r0, sp, r0
+# CHECK-NEXT:  1      1     1.00                  U     add	sp, r1
+# CHECK-NEXT:  1      1     1.00                        add.w	r0, sp, r1
+# CHECK-NEXT:  1      1     1.00                        adds.w	r0, sp, r1
+# CHECK-NEXT:  1      2     1.00                        add.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  1      2     1.00                        adds.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  1      1     0.50                        adds	r0, r1, #1
+# CHECK-NEXT:  1      1     0.50                        adds	r0, #42
+# CHECK-NEXT:  1      1     0.50                        add.w	r0, r1, #1
+# CHECK-NEXT:  1      1     0.50                        adds.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        addw	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        adds	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        add	r0, r1
+# CHECK-NEXT:  1      1     1.00                        add.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        adds.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        add.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        adds.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     1.00                  U     adr.w	r0, #-6
+# CHECK-NEXT:  1      1     1.00                  U     adr.w	r8, #-6
+# CHECK-NEXT:  1      1     1.00                  U     adr.w	r0, #-6
+# CHECK-NEXT:  1      1     1.00                        and	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        ands	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        ands	r1, r0
+# CHECK-NEXT:  1      1     1.00                        and.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        ands.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        and.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        ands.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     1.00                        asrs	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        asr.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        asrs.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        asrs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        asr.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        asrs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        asrl	r0, r1, #1
+# CHECK-NEXT:  1      2     1.00                        asrl	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        bfc	r0, #1, #2
+# CHECK-NEXT:  1      1     1.00                        bfi	r0, r1, #1, #2
+# CHECK-NEXT:  1      1     1.00                        bic	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        bics	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        bics	r0, r1
+# CHECK-NEXT:  1      1     1.00                        bic.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        bics.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        bic.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        bics.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     1.00                  U     bkpt	#1
+# CHECK-NEXT:  1      2     1.00    *      *      U     clrex
+# CHECK-NEXT:  1      2     1.00                  U     clrm	{r1, r2}
+# CHECK-NEXT:  1      1     1.00                        clz	r0, r1
+# CHECK-NEXT:  1      1     1.00                        cmn.w	r0, #1
+# CHECK-NEXT:  1      1     1.00                        cmn	r0, r1
+# CHECK-NEXT:  1      1     1.00                        cmn.w	r0, r1
+# CHECK-NEXT:  1      2     1.00                        cmn.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      1     1.00                        cmp	r0, #1
+# CHECK-NEXT:  1      1     1.00                        cmp.w	r0, #1
+# CHECK-NEXT:  1      1     1.00                        cmp	r0, r1
+# CHECK-NEXT:  1      1     1.00                  U     cmp	r0, r10
+# CHECK-NEXT:  1      1     1.00                        cmp.w	r0, r1
+# CHECK-NEXT:  1      2     1.00                        cmp.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      1     1.00                        csel	r1, r2, r3, eq
+# CHECK-NEXT:  1      1     1.00                        csinc	r1, r2, r3, eq
+# CHECK-NEXT:  1      1     1.00                        csinv	r1, r2, r3, eq
+# CHECK-NEXT:  1      1     1.00                        csneg	r1, r2, r3, eq
+# CHECK-NEXT:  1      2     1.00    *      *      U     dmb	sy
+# CHECK-NEXT:  1      2     1.00    *      *      U     dsb	sy
+# CHECK-NEXT:  1      1     1.00                        eor	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        eors	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        eors	r0, r1
+# CHECK-NEXT:  1      1     1.00                        eor.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        eors.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        eor.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        eors.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00    *      *      U     isb	sy
+# CHECK-NEXT:  1      2     1.00    *                   lda	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *                   ldab	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldaex	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldaexb	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldaexh	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *                   ldah	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *                   ldm	r0!, {r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldm.w	r0, {r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldm.w	r0, {r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r1, [r0], #4
+# CHECK-NEXT:  1      2     1.00    *                   ldmdb	r0, {r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldmdb	r0!, {r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [r1, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [sp, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldr.w	r0, [r1, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [r1, #-1]
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [r1], #1
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [r1, #1]!
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldr.w	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldr.w	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldr.w	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldr.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb.w	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb	r0, [r1, #-1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb	r0, [r1], #1
+# CHECK-NEXT:  1      2     1.00    *                   ldrb	r0, [r1, #1]!
+# CHECK-NEXT:  1      2     1.00    *                   ldrb.w	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb.w	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldrb	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb.w	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      2     1.00                  U     ldrbt	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrd	r0, r2, [r1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrd	r0, r2, [r1, #-4]
+# CHECK-NEXT:  1      2     1.00    *                   ldrd	r0, r2, [r1], #4
+# CHECK-NEXT:  1      2     1.00    *                   ldrd	r0, r2, [r1, #4]!
+# CHECK-NEXT:  1      2     1.00    *                   ldrd	r0, r2, next
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldrex	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldrex	r0, [r1, #4]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldrexb	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     ldrexh	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh	r0, [r1, #2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh.w	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh	r0, [r1, #-1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh	r0, [r1], #1
+# CHECK-NEXT:  1      2     1.00    *                   ldrh	r0, [r1, #1]!
+# CHECK-NEXT:  1      2     1.00    *                   ldrh.w	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh.w	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldrh	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh.w	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      2     1.00                  U     ldrht	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb.w	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb	r0, [r1, #-1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb	r0, [r1], #1
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb	r0, [r1, #1]!
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb.w	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb.w	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb.w	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      2     1.00                  U     ldrsbt	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh.w	r0, [r1, #2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh	r0, [r1, #-1]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh	r0, [r1], #1
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh	r0, [r1, #1]!
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh.w	r0, [pc, #4]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh.w	r0, next
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh.w	r0, [r1, r2]
+# CHECK-NEXT:  1      2     1.00    *                   ldrsh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      2     1.00                  U     ldrsht	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00                  U     ldrt	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00                        lsls	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsl.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsls.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsls	r0, r1
+# CHECK-NEXT:  1      1     1.00                        lsl.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        lsls.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        lsll	r0, r1, #2
+# CHECK-NEXT:  1      2     1.00                        lsll	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        lsrs	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsr.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsrs.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        lsrs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        lsr.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        lsrs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        lsrl	r0, r1, #2
+# CHECK-NEXT:  1      2     1.00                        mla	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        mls	r0, r1, r2, r3
+# CHECK-NEXT:  1      1     0.50                        movs	r0, #1
+# CHECK-NEXT:  1      1     0.50                        mov.w	r0, #1
+# CHECK-NEXT:  1      1     0.50                        movs.w	r0, #1
+# CHECK-NEXT:  1      1     0.50                        movw	r0, #1
+# CHECK-NEXT:  1      1     0.50                        mov	r0, r1
+# CHECK-NEXT:  1      1     0.50                        mov.w	r0, r1
+# CHECK-NEXT:  1      1     0.50                        movs.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        movt	r0, #1
+# CHECK-NEXT:  1      1     1.00                  U     mrs	r0, apsr
+# CHECK-NEXT:  1      1     1.00                  U     msr	apsr_nzcvq, r0
+# CHECK-NEXT:  1      1     1.00                        muls	r1, r2, r1
+# CHECK-NEXT:  1      2     1.00                        mul	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        mvn	r0, #1
+# CHECK-NEXT:  1      1     1.00                        mvns	r0, #1
+# CHECK-NEXT:  1      1     1.00                        mvns	r0, r1
+# CHECK-NEXT:  1      1     1.00                        mvn.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        mvns.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        mvn.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      1     1.00                        mvns.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      1     1.00    *      *      U     nop
+# CHECK-NEXT:  1      1     1.00                        orn	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        orns	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        orn	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        orns	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        orn	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        orns	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     1.00                        orr	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        orrs	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        orrs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        orr.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        orrs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        orr.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        orrs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        pkhbt	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        pkhbt	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        pkhbt	r0, r2, r1
+# CHECK-NEXT:  1      2     1.00                        pkhtb	r0, r1, r2, asr #1
+# CHECK-NEXT:  1      1     1.00    *             U     pop	{r0}
+# CHECK-NEXT:  1      2     1.00    *                   pop.w	{r0, r1}
+# CHECK-NEXT:  1      2     1.00    *                   ldr	r0, [sp], #4
+# CHECK-NEXT:  1      2     1.00    *      *      U     pssbb
+# CHECK-NEXT:  1      1     1.00           *      U     push	{r0}
+# CHECK-NEXT:  1      1     1.00           *            push.w	{r0, r1}
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [sp, #-4]!
+# CHECK-NEXT:  1      2     1.00                        qadd	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qasx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qdadd	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qdsub	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qsax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qsub	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qsub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        qsub8	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        rbit	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rev	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rev.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rev16	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rev16.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        revsh	r0, r1
+# CHECK-NEXT:  1      1     1.00                        revsh.w	r0, r1
+# CHECK-NEXT:  1      1     1.00                        ror.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        rors.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        rors	r0, r1
+# CHECK-NEXT:  1      1     1.00                        ror.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        rors.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        rrx	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rrxs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        rsbs	r0, r1, #0
+# CHECK-NEXT:  1      1     1.00                        rsb.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        rsbs.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                  U     rsb	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                  U     rsbs	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        rsb	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        rsbs	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00    *      *      U     sadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     sadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     sasx	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sbc	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        sbcs	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                  U     sbcs	r0, r1
+# CHECK-NEXT:  1      1     1.00                        sbc.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sbcs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        sbc.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        sbcs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        sbfx	r0, r1, #1, #2
+# CHECK-NEXT:  1      2     1.00                        sdiv	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00    *                   sel	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00    *      *      U     sev
+# CHECK-NEXT:  1      2     1.00                        shadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        shadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        shasx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        shsax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        shsub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        shsub8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smlabb	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlabt	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlatb	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlatt	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlad	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smladx	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlal	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlalbb	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlalbt	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlaltb	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlaltt	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlald	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlaldx	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlawb	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlawt	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlsd	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlsdx	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlsld	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smlsldx	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smmla	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smmlar	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                  U     smmls	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smmlsr	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smmul	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smmulr	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smuad	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smuadx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smulbb	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smulbt	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smultb	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smultt	r0, r1, r2
+# CHECK-NEXT:  2      2     1.00                        smull	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        smulwb	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smulwt	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smusd	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        smusdx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        sqrshr	r0, r1
+# CHECK-NEXT:  1      2     1.00                        sqrshrl	r0, r1, #48, r2
+# CHECK-NEXT:  1      2     1.00                        sqshl	r0, #7
+# CHECK-NEXT:  1      2     1.00                        sqshll	r0, r1, #7
+# CHECK-NEXT:  1      2     1.00                        srshr	r0, #7
+# CHECK-NEXT:  1      2     1.00                        srshrl	r0, r1, #7
+# CHECK-NEXT:  1      2     1.00                        ssat	r0, #1, r2
+# CHECK-NEXT:  1      2     1.00                        ssat	r0, #1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        ssat16	r0, #1, r1
+# CHECK-NEXT:  1      2     1.00    *      *      U     ssax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     ssbb
+# CHECK-NEXT:  1      2     1.00    *      *      U     ssub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     ssub8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00           *            stl	r0, [r1]
+# CHECK-NEXT:  1      2     1.00           *            stlb	r0, [r1]
+# CHECK-NEXT:  1      2     1.00    *      *      U     stlex	r0, r1, [r2]
+# CHECK-NEXT:  1      2     1.00    *      *      U     stlexb	r0, r1, [r2]
+# CHECK-NEXT:  1      2     1.00    *      *      U     stlexh	r0, r1, [r2]
+# CHECK-NEXT:  1      2     1.00           *            stlh	r0, [r1]
+# CHECK-NEXT:  1      1     1.00           *            stm	r0!, {r1}
+# CHECK-NEXT:  1      1     1.00           *            stm.w	r0, {r1}
+# CHECK-NEXT:  1      1     1.00           *            stm.w	r0!, {r1}
+# CHECK-NEXT:  1      1     1.00           *            stmdb	r0, {r1}
+# CHECK-NEXT:  1      1     1.00           *            str	r1, [r0, #-4]!
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [r1]
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [r1, #4]
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [sp, #4]
+# CHECK-NEXT:  1      1     1.00           *            str.w	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [r1, #-1]
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [r1], #1
+# CHECK-NEXT:  1      1     1.00           *            str	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            str.w	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            str.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1]
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00           *            strb.w	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1, #-1]
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1], #1
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1, #1]!
+# CHECK-NEXT:  1      1     1.00           *            strb	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            strb.w	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            strb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      1     1.00                  U     strbt	r0, [r1, #1]
+# CHECK-NEXT:  1      2     1.00           *            strd	r0, r1, [r2, #4]
+# CHECK-NEXT:  1      2     1.00           *            strd	r0, r1, [r2], #4
+# CHECK-NEXT:  1      2     1.00           *            strd	r0, r1, [r2, #4]!
+# CHECK-NEXT:  1      1     1.00    *      *      U     strex	r0, r1, [r2]
+# CHECK-NEXT:  1      1     1.00    *      *      U     strex	r0, r1, [r2, #4]
+# CHECK-NEXT:  1      1     1.00    *      *      U     strexb	r0, r1, [r2]
+# CHECK-NEXT:  1      1     1.00    *      *      U     strexh	r0, r1, [r2]
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1]
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1, #2]
+# CHECK-NEXT:  1      1     1.00           *            strh.w	r0, [r1, #2]
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1, #-1]
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1], #1
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1, #1]!
+# CHECK-NEXT:  1      1     1.00           *            strh	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            strh.w	r0, [r1, r2]
+# CHECK-NEXT:  1      1     1.00           *            strh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT:  1      1     1.00                  U     strht	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00                  U     strt	r0, [r1, #1]
+# CHECK-NEXT:  1      1     1.00                  U     sub	sp, #4
+# CHECK-NEXT:  1      1     0.50                        sub.w	r0, sp, #1
+# CHECK-NEXT:  1      1     0.50                        subs.w	r0, sp, #1
+# CHECK-NEXT:  1      1     1.00                        subw	r0, sp, #1
+# CHECK-NEXT:  1      1     1.00                        sub.w	r0, sp, r1
+# CHECK-NEXT:  1      1     1.00                        subs.w	r0, sp, r1
+# CHECK-NEXT:  1      2     1.00                        sub.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  1      2     1.00                        subs.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  1      1     0.50                        subs	r0, r1, #1
+# CHECK-NEXT:  1      1     0.50                        subs	r0, #1
+# CHECK-NEXT:  1      1     0.50                        sub.w	r0, r1, #1
+# CHECK-NEXT:  1      1     0.50                        subs.w	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        subw	r0, r1, #1
+# CHECK-NEXT:  1      1     1.00                        subs	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sub.w	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        subs.w	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        sub.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      2     1.00                        subs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  1      1     1.00                        sxtab	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sxtab	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      1     1.00                        sxtab16	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sxtab16	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      1     1.00                        sxtah	r0, r1, r2
+# CHECK-NEXT:  1      1     1.00                        sxtah	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      1     0.50                        sxtb	r0, r1
+# CHECK-NEXT:  1      1     0.50                        sxtb.w	r0, r1
+# CHECK-NEXT:  1      1     0.50                        sxtb.w	r0, r1, ror #8
+# CHECK-NEXT:  1      1     1.00                        sxtb16	r0, r1
+# CHECK-NEXT:  1      1     1.00                        sxtb16	r0, r1, ror #8
+# CHECK-NEXT:  1      1     0.50                        sxth	r0, r1
+# CHECK-NEXT:  1      1     0.50                        sxth.w	r0, r1
+# CHECK-NEXT:  1      1     0.50                        sxth.w	r0, r1, ror #8
+# CHECK-NEXT:  1      1     1.00                  U     tbb	[r0, r1]
+# CHECK-NEXT:  1      1     1.00                  U     tbh	[r0, r1, lsl #1]
+# CHECK-NEXT:  1      1     1.00                        teq.w	r0, #1
+# CHECK-NEXT:  1      1     1.00                        teq.w	r0, r1
+# CHECK-NEXT:  1      2     1.00                        teq.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      1     1.00                        tst.w	r0, #1
+# CHECK-NEXT:  1      1     1.00                        tst	r0, r1
+# CHECK-NEXT:  1      1     1.00                        tst.w	r0, r1
+# CHECK-NEXT:  1      2     1.00                        tst.w	r0, r1, lsl #1
+# CHECK-NEXT:  1      2     1.00    *      *      U     uadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     uadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     uasx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        ubfx	r0, r1, #1, #2
+# CHECK-NEXT:  1      2     1.00                        udiv	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhasx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhsax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhsub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uhsub8	r0, r1, r2
+# CHECK-NEXT:  2      2     1.00                        umaal	r0, r1, r2, r3
+# CHECK-NEXT:  2      2     1.00                        umlal	r0, r1, r2, r3
+# CHECK-NEXT:  2      2     1.00                        umull	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        uqadd16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uqadd8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uqasx	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uqrshl	r0, r1
+# CHECK-NEXT:  1      2     1.00                        uqrshll	r0, r1, #48, r2
+# CHECK-NEXT:  1      2     1.00                        uqsax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uqshl	r0, #1
+# CHECK-NEXT:  1      2     1.00                        uqshll	r0, r1, #1
+# CHECK-NEXT:  1      2     1.00                        uqsub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uqsub8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        urshr	r0, #1
+# CHECK-NEXT:  1      2     1.00                        urshrl	r0, r1, #1
+# CHECK-NEXT:  1      2     1.00                        usad8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        usada8	r0, r1, r2, r3
+# CHECK-NEXT:  1      2     1.00                        usat	r0, #1, r1
+# CHECK-NEXT:  1      2     1.00                        usat	r0, #1, r1, lsl #1
+# CHECK-NEXT:  1      2     1.00                        usat16	r0, #1, r1
+# CHECK-NEXT:  1      2     1.00    *      *      U     usax	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     usub16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00    *      *      U     usub8	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uxtab	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uxtab	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      2     1.00                        uxtab16	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uxtab16	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      2     1.00                        uxtah	r0, r1, r2
+# CHECK-NEXT:  1      2     1.00                        uxtah	r0, r1, r2, ror #8
+# CHECK-NEXT:  1      1     0.50                        uxtb	r0, r1
+# CHECK-NEXT:  1      1     0.50                        uxtb.w	r0, r1
+# CHECK-NEXT:  1      1     0.50                        uxtb.w	r0, r1, ror #8
+# CHECK-NEXT:  1      1     1.00                        uxtb16	r0, r1
+# CHECK-NEXT:  1      1     1.00                        uxtb16	r0, r1, ror #8
+# CHECK-NEXT:  1      1     0.50                        uxth	r0, r1
+# CHECK-NEXT:  1      1     0.50                        uxth.w	r0, r1
+# CHECK-NEXT:  1      1     0.50                        uxth.w	r0, r1, ror #8
+# CHECK-NEXT:  1      1     1.00    *      *      U     wfe
+# CHECK-NEXT:  1      1     1.00    *      *      U     wfi
+# CHECK-NEXT:  1      1     1.00    *      *      U     yield
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT: 430.00  -      -      -      -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT: 1.00    -      -      -      -     adc	r0, r1, #0
+# CHECK-NEXT: 1.00    -      -      -      -     adcs	r0, r1, #0
+# CHECK-NEXT: 1.00    -      -      -      -     adcs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     adc.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     adcs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     adc.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     adcs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT:  -      -      -      -      -     add.w	r0, sp, #1
+# CHECK-NEXT: 1.00    -      -      -      -     add.w	sp, sp, #1
+# CHECK-NEXT:  -      -      -      -      -     add.w	r0, sp, #1
+# CHECK-NEXT:  -      -      -      -      -     adds.w	r0, sp, #1
+# CHECK-NEXT: 1.00    -      -      -      -     addw	r0, sp, #1
+# CHECK-NEXT: 1.00    -      -      -      -     add	r0, sp, r0
+# CHECK-NEXT: 1.00    -      -      -      -     add	sp, r1
+# CHECK-NEXT: 1.00    -      -      -      -     add.w	r0, sp, r1
+# CHECK-NEXT: 1.00    -      -      -      -     adds.w	r0, sp, r1
+# CHECK-NEXT: 1.00    -      -      -      -     add.w	r0, sp, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     adds.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  -      -      -      -      -     adds	r0, r1, #1
+# CHECK-NEXT:  -      -      -      -      -     adds	r0, #42
+# CHECK-NEXT:  -      -      -      -      -     add.w	r0, r1, #1
+# CHECK-NEXT:  -      -      -      -      -     adds.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     addw	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     adds	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     add	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     add.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     adds.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     add.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     adds.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     adr.w	r0, #-6
+# CHECK-NEXT: 1.00    -      -      -      -     adr.w	r8, #-6
+# CHECK-NEXT: 1.00    -      -      -      -     adr.w	r0, #-6
+# CHECK-NEXT: 1.00    -      -      -      -     and	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     ands	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     ands	r1, r0
+# CHECK-NEXT: 1.00    -      -      -      -     and.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     ands.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     and.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     ands.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     asrs	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     asr.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     asrs.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     asrs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     asr.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     asrs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     asrl	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     asrl	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     bfc	r0, #1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     bfi	r0, r1, #1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     bic	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     bics	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     bics	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     bic.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     bics.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     bic.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     bics.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     bkpt	#1
+# CHECK-NEXT: 1.00    -      -      -      -     clrex
+# CHECK-NEXT: 1.00    -      -      -      -     clrm	{r1, r2}
+# CHECK-NEXT: 1.00    -      -      -      -     clz	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     cmn.w	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     cmn	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     cmn.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     cmn.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     cmp	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     cmp.w	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     cmp	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     cmp	r0, r10
+# CHECK-NEXT: 1.00    -      -      -      -     cmp.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     cmp.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     csel	r1, r2, r3, eq
+# CHECK-NEXT: 1.00    -      -      -      -     csinc	r1, r2, r3, eq
+# CHECK-NEXT: 1.00    -      -      -      -     csinv	r1, r2, r3, eq
+# CHECK-NEXT: 1.00    -      -      -      -     csneg	r1, r2, r3, eq
+# CHECK-NEXT: 1.00    -      -      -      -     dmb	sy
+# CHECK-NEXT: 1.00    -      -      -      -     dsb	sy
+# CHECK-NEXT: 1.00    -      -      -      -     eor	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     eors	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     eors	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     eor.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     eors.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     eor.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     eors.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     isb	sy
+# CHECK-NEXT: 1.00    -      -      -      -     lda	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldab	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldaex	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldaexb	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldaexh	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldah	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldm	r0!, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldm.w	r0, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldm.w	r0, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r1, [r0], #4
+# CHECK-NEXT: 1.00    -      -      -      -     ldmdb	r0, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldmdb	r0!, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [r1, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [sp, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr.w	r0, [r1, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr.w	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldr.w	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldr.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb.w	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb.w	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb.w	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrbt	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrd	r0, r2, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrd	r0, r2, [r1, #-4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrd	r0, r2, [r1], #4
+# CHECK-NEXT: 1.00    -      -      -      -     ldrd	r0, r2, [r1, #4]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldrd	r0, r2, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldrex	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrex	r0, [r1, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrexb	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrexh	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh	r0, [r1, #2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh.w	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh.w	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh.w	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrht	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb.w	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb.w	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb.w	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsbt	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh.w	r0, [r1, #2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh.w	r0, [pc, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh.w	r0, next
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrsht	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     ldrt	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     lsls	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsl.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsls.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsls	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     lsl.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     lsls.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     lsll	r0, r1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     lsll	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     lsrs	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsr.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsrs.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     lsrs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     lsr.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     lsrs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     lsrl	r0, r1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     mla	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     mls	r0, r1, r2, r3
+# CHECK-NEXT:  -      -      -      -      -     movs	r0, #1
+# CHECK-NEXT:  -      -      -      -      -     mov.w	r0, #1
+# CHECK-NEXT:  -      -      -      -      -     movs.w	r0, #1
+# CHECK-NEXT:  -      -      -      -      -     movw	r0, #1
+# CHECK-NEXT:  -      -      -      -      -     mov	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     mov.w	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     movs.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     movt	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     mrs	r0, apsr
+# CHECK-NEXT: 1.00    -      -      -      -     msr	apsr_nzcvq, r0
+# CHECK-NEXT: 1.00    -      -      -      -     muls	r1, r2, r1
+# CHECK-NEXT: 1.00    -      -      -      -     mul	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     mvn	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     mvns	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     mvns	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     mvn.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     mvns.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     mvn.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     mvns.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     nop
+# CHECK-NEXT: 1.00    -      -      -      -     orn	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     orns	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     orn	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     orns	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     orn	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     orns	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     orr	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     orrs	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     orrs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     orr.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     orrs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     orr.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     orrs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     pkhbt	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     pkhbt	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     pkhbt	r0, r2, r1
+# CHECK-NEXT: 1.00    -      -      -      -     pkhtb	r0, r1, r2, asr #1
+# CHECK-NEXT: 1.00    -      -      -      -     pop	{r0}
+# CHECK-NEXT: 1.00    -      -      -      -     pop.w	{r0, r1}
+# CHECK-NEXT: 1.00    -      -      -      -     ldr	r0, [sp], #4
+# CHECK-NEXT: 1.00    -      -      -      -     pssbb
+# CHECK-NEXT: 1.00    -      -      -      -     push	{r0}
+# CHECK-NEXT: 1.00    -      -      -      -     push.w	{r0, r1}
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [sp, #-4]!
+# CHECK-NEXT: 1.00    -      -      -      -     qadd	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qdadd	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qdsub	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qsax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qsub	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qsub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     qsub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     rbit	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rev	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rev.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rev16	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rev16.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     revsh	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     revsh.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     ror.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     rors.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     rors	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     ror.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     rors.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     rrx	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rrxs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     rsbs	r0, r1, #0
+# CHECK-NEXT: 1.00    -      -      -      -     rsb.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     rsbs.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     rsb	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     rsbs	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     rsb	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     rsbs	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     sadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sbc	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     sbcs	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     sbcs	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     sbc.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sbcs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sbc.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     sbcs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     sbfx	r0, r1, #1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     sdiv	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sel	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sev
+# CHECK-NEXT: 1.00    -      -      -      -     shadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     shadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     shasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     shsax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     shsub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     shsub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smlabb	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlabt	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlatb	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlatt	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlad	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smladx	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlal	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlalbb	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlalbt	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlaltb	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlaltt	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlald	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlaldx	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlawb	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlawt	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlsd	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlsdx	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlsld	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smlsldx	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smmla	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smmlar	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smmls	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smmlsr	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smmul	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smmulr	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smuad	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smuadx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smulbb	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smulbt	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smultb	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smultt	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smull	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     smulwb	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smulwt	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smusd	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     smusdx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sqrshr	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     sqrshrl	r0, r1, #48, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sqshl	r0, #7
+# CHECK-NEXT: 1.00    -      -      -      -     sqshll	r0, r1, #7
+# CHECK-NEXT: 1.00    -      -      -      -     srshr	r0, #7
+# CHECK-NEXT: 1.00    -      -      -      -     srshrl	r0, r1, #7
+# CHECK-NEXT: 1.00    -      -      -      -     ssat	r0, #1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     ssat	r0, #1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     ssat16	r0, #1, r1
+# CHECK-NEXT: 1.00    -      -      -      -     ssax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     ssbb
+# CHECK-NEXT: 1.00    -      -      -      -     ssub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     ssub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     stl	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     stlb	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     stlex	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     stlexb	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     stlexh	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     stlh	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     stm	r0!, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     stm.w	r0, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     stm.w	r0!, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     stmdb	r0, {r1}
+# CHECK-NEXT: 1.00    -      -      -      -     str	r1, [r0, #-4]!
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [r1, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [sp, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     str.w	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     str	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     str.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     str.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strb.w	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     strb	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strb.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strb.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strbt	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strd	r0, r1, [r2, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     strd	r0, r1, [r2], #4
+# CHECK-NEXT: 1.00    -      -      -      -     strd	r0, r1, [r2, #4]!
+# CHECK-NEXT: 1.00    -      -      -      -     strex	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strex	r0, r1, [r2, #4]
+# CHECK-NEXT: 1.00    -      -      -      -     strexb	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strexh	r0, r1, [r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1]
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1, #2]
+# CHECK-NEXT: 1.00    -      -      -      -     strh.w	r0, [r1, #2]
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1, #-1]
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1], #1
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1, #1]!
+# CHECK-NEXT: 1.00    -      -      -      -     strh	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strh.w	r0, [r1, r2]
+# CHECK-NEXT: 1.00    -      -      -      -     strh.w	r0, [r1, r2, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strht	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     strt	r0, [r1, #1]
+# CHECK-NEXT: 1.00    -      -      -      -     sub	sp, #4
+# CHECK-NEXT:  -      -      -      -      -     sub.w	r0, sp, #1
+# CHECK-NEXT:  -      -      -      -      -     subs.w	r0, sp, #1
+# CHECK-NEXT: 1.00    -      -      -      -     subw	r0, sp, #1
+# CHECK-NEXT: 1.00    -      -      -      -     sub.w	r0, sp, r1
+# CHECK-NEXT: 1.00    -      -      -      -     subs.w	r0, sp, r1
+# CHECK-NEXT: 1.00    -      -      -      -     sub.w	r0, sp, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     subs.w	r0, sp, r1, lsl #1
+# CHECK-NEXT:  -      -      -      -      -     subs	r0, r1, #1
+# CHECK-NEXT:  -      -      -      -      -     subs	r0, #1
+# CHECK-NEXT:  -      -      -      -      -     sub.w	r0, r1, #1
+# CHECK-NEXT:  -      -      -      -      -     subs.w	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     subw	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     subs	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sub.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     subs.w	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sub.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     subs.w	r0, r1, r2, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     sxtab	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sxtab	r0, r1, r2, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     sxtab16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sxtab16	r0, r1, r2, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     sxtah	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     sxtah	r0, r1, r2, ror #8
+# CHECK-NEXT:  -      -      -      -      -     sxtb	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     sxtb.w	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     sxtb.w	r0, r1, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     sxtb16	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     sxtb16	r0, r1, ror #8
+# CHECK-NEXT:  -      -      -      -      -     sxth	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     sxth.w	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     sxth.w	r0, r1, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     tbb	[r0, r1]
+# CHECK-NEXT: 1.00    -      -      -      -     tbh	[r0, r1, lsl #1]
+# CHECK-NEXT: 1.00    -      -      -      -     teq.w	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     teq.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     teq.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     tst.w	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     tst	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     tst.w	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     tst.w	r0, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     uadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     ubfx	r0, r1, #1, #2
+# CHECK-NEXT: 1.00    -      -      -      -     udiv	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhsax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhsub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uhsub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     umaal	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     umlal	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     umull	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     uqadd16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqadd8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqasx	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqrshl	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     uqrshll	r0, r1, #48, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqsax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqshl	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     uqshll	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     uqsub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uqsub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     urshr	r0, #1
+# CHECK-NEXT: 1.00    -      -      -      -     urshrl	r0, r1, #1
+# CHECK-NEXT: 1.00    -      -      -      -     usad8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     usada8	r0, r1, r2, r3
+# CHECK-NEXT: 1.00    -      -      -      -     usat	r0, #1, r1
+# CHECK-NEXT: 1.00    -      -      -      -     usat	r0, #1, r1, lsl #1
+# CHECK-NEXT: 1.00    -      -      -      -     usat16	r0, #1, r1
+# CHECK-NEXT: 1.00    -      -      -      -     usax	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     usub16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     usub8	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uxtab	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uxtab	r0, r1, r2, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     uxtab16	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uxtab16	r0, r1, r2, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     uxtah	r0, r1, r2
+# CHECK-NEXT: 1.00    -      -      -      -     uxtah	r0, r1, r2, ror #8
+# CHECK-NEXT:  -      -      -      -      -     uxtb	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     uxtb.w	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     uxtb.w	r0, r1, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     uxtb16	r0, r1
+# CHECK-NEXT: 1.00    -      -      -      -     uxtb16	r0, r1, ror #8
+# CHECK-NEXT:  -      -      -      -      -     uxth	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     uxth.w	r0, r1
+# CHECK-NEXT:  -      -      -      -      -     uxth.w	r0, r1, ror #8
+# CHECK-NEXT: 1.00    -      -      -      -     wfe
+# CHECK-NEXT: 1.00    -      -      -      -     wfi
+# CHECK-NEXT: 1.00    -      -      -      -     yield

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-mve-fp.s b/llvm/test/tools/llvm-mca/ARM/m55-mve-fp.s
new file mode 100644
index 0000000000000..bcbd3c97f2162
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-mve-fp.s
@@ -0,0 +1,315 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -instruction-tables < %s | FileCheck %s
+
+vabd.f16 q0, q2, q1
+vabd.f32 q0, q2, q1
+vabs.f16 q0, q2
+vabs.f32 q0, q2
+vadd.f16 q0, q2, q1
+vadd.f32 q0, q2, q1
+vadd.f16 q0, q2, r0
+vadd.f32 q0, q2, r0
+vcadd.f16 q0, q2, q1, #90
+vcadd.f32 q0, q2, q1, #90
+vcmla.f16 q0, q2, q1, #90
+vcmla.f32 q0, q2, q1, #90
+vcmul.f16 q0, q2, q1, #90
+vcmul.f32 q0, q2, q1, #90
+vcvt.f16.s16 q0, q1, #4
+vcvt.f16.u16 q0, q1, #4
+vcvt.s16.f16 q0, q1, #4
+vcvt.u16.f16 q0, q1, #4
+vcvt.f32.s32 q0, q1, #4
+vcvt.f32.u32 q0, q1, #4
+vcvt.s32.f32 q0, q1, #4
+vcvt.u32.f32 q0, q1, #4
+vcvt.f16.s16 q0, q1
+vcvt.f32.s32 q0, q1
+vcvt.f16.u16 q0, q1
+vcvt.f32.u32 q0, q1
+vcvt.s16.f16 q0, q1
+vcvt.s32.f32 q0, q1
+vcvt.u16.f16 q0, q1
+vcvt.u32.f32 q0, q1
+vcvtb.f16.f32 q0, q1
+vcvtb.f32.f16 q0, q1
+vcvtt.f16.f32 q0, q1
+vcvtt.f32.f16 q0, q1
+vcvta.s16.f16 q0, q1
+vcvta.s32.f32 q0, q1
+vcvta.u16.f16 q0, q1
+vcvta.u32.f32 q0, q1
+vcvtm.s16.f16 q0, q1
+vcvtm.s32.f32 q0, q1
+vcvtm.u16.f16 q0, q1
+vcvtm.u32.f32 q0, q1
+vcvtn.s16.f16 q0, q1
+vcvtn.s32.f32 q0, q1
+vcvtn.u16.f16 q0, q1
+vcvtn.u32.f32 q0, q1
+vcvtp.s16.f16 q0, q1
+vcvtp.s32.f32 q0, q1
+vcvtp.u16.f16 q0, q1
+vcvtp.u32.f32 q0, q1
+vfma.f16 q0, q2, r0
+vfma.f32 q0, q2, r0
+vfma.f16 q0, q2, q1
+vfma.f32 q0, q2, q1
+vfms.f16 q0, q2, q1
+vfms.f32 q0, q2, q1
+vfmas.f16 q0, q2, r0
+vfmas.f32 q0, q2, r0
+vmaxnm.f16 q0, q2, q1
+vmaxnm.f32 q0, q2, q1
+vmaxnma.f16 q0, q2
+vmaxnma.f32 q0, q2
+vmaxnmv.f16 r0, q2
+vmaxnmv.f32 r0, q2
+vmaxnmav.f16 r0, q2
+vmaxnmav.f32 r0, q2
+vminnm.f16 q0, q2, q1
+vminnm.f32 q0, q2, q1
+vminnma.f16 q0, q2
+vminnma.f32 q0, q2
+vminnmv.f16 r0, q2
+vminnmv.f32 r0, q2
+vminnmav.f16 r0, q2
+vminnmav.f32 r0, q2
+vmul.f16 q0, q2, q1
+vmul.f32 q0, q2, q1
+vmul.f16 q0, q2, r0
+vmul.f32 q0, q2, r0
+vneg.f16 q0, q2
+vneg.f32 q0, q2
+vrinta.f16 q0, q2
+vrinta.f32 q0, q2
+vrintm.f16 q0, q2
+vrintm.f32 q0, q2
+vrintn.f16 q0, q2
+vrintn.f32 q0, q2
+vrintp.f16 q0, q2
+vrintp.f32 q0, q2
+vrintx.f16 q0, q2
+vrintx.f32 q0, q2
+vrintz.f16 q0, q2
+vrintz.f32 q0, q2
+vsub.f16 q0, q2, q1
+vsub.f32 q0, q2, q1
+vsub.f16 q0, q2, r0
+vsub.f32 q0, q2, r0
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      1     2.00                        vabd.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.f32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabs.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vabs.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vadd.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vadd.f32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vadd.f16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vadd.f32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vcadd.f16	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vcadd.f32	q0, q2, q1, #90
+# CHECK-NEXT:  1      2     2.00                        vcmla.f16	q0, q2, q1, #90
+# CHECK-NEXT:  1      2     2.00                        vcmla.f32	q0, q2, q1, #90
+# CHECK-NEXT:  1      2     2.00                        vcmul.f16	q0, q2, q1, #90
+# CHECK-NEXT:  1      2     2.00                        vcmul.f32	q0, q2, q1, #90
+# CHECK-NEXT:  1      2     2.00                        vcvt.f16.s16	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.f16.u16	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.s16.f16	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.u16.f16	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.f32.s32	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.f32.u32	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.s32.f32	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.u32.f32	q0, q1, #4
+# CHECK-NEXT:  1      2     2.00                        vcvt.f16.s16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.f32.s32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.f16.u16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.f32.u32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.s16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.s32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.u16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvt.u32.f32	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vcvtb.f16.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtb.f32.f16	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vcvtt.f16.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtt.f32.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvta.s16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvta.s32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvta.u16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvta.u32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtm.s16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtm.s32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtm.u16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtm.u32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtn.s16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtn.s32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtn.u16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtn.u32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtp.s16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtp.s32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtp.u16.f16	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vcvtp.u32.f32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vfma.f16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vfma.f32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vfma.f16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vfma.f32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vfms.f16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vfms.f32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vfmas.f16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vfmas.f32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vmaxnm.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmaxnm.f32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmaxnma.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxnma.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxnmv.f16	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxnmv.f32	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxnmav.f16	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxnmav.f32	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnm.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vminnm.f32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vminnma.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnma.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnmv.f16	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnmv.f32	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnmav.f16	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vminnmav.f32	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vmul.f16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.f32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.f16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmul.f32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vneg.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vneg.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrinta.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrinta.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintm.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintm.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintn.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintn.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintp.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintp.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintx.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintx.f32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintz.f16	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vrintz.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vsub.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vsub.f32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vsub.f16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vsub.f32	q0, q2, r0
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -      -      -     192.00  -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -      -      -     2.00    -     vabd.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vabd.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vabs.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vabs.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vadd.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vadd.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vadd.f16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vadd.f32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vcadd.f16	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcadd.f32	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcmla.f16	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcmla.f32	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcmul.f16	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcmul.f32	q0, q2, q1, #90
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f16.s16	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f16.u16	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.s16.f16	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.u16.f16	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f32.s32	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f32.u32	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.s32.f32	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.u32.f32	q0, q1, #4
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f16.s16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f32.s32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f16.u16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.f32.u32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.s16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.s32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.u16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvt.u32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtb.f16.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtb.f32.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtt.f16.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtt.f32.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvta.s16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvta.s32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvta.u16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvta.u32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtm.s16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtm.s32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtm.u16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtm.u32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtn.s16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtn.s32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtn.u16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtn.u32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtp.s16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtp.s32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtp.u16.f16	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtp.u32.f32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vfma.f16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vfma.f32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vfma.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vfma.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vfms.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vfms.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vfmas.f16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vfmas.f32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnm.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnm.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnma.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnma.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnmv.f16	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnmv.f32	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnmav.f16	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmaxnmav.f32	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnm.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vminnm.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vminnma.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnma.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnmv.f16	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnmv.f32	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnmav.f16	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vminnmav.f32	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.f16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.f32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vneg.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vneg.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrinta.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrinta.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintm.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintm.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintn.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintn.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintp.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintp.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintx.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintx.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintz.f16	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vrintz.f32	q0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vsub.f16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vsub.f32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vsub.f16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vsub.f32	q0, q2, r0

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-mve-int.s b/llvm/test/tools/llvm-mca/ARM/m55-mve-int.s
new file mode 100644
index 0000000000000..dc8025ea5fc25
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-mve-int.s
@@ -0,0 +1,1566 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -instruction-tables < %s | FileCheck %s
+
+vabav.s8  r0, q2, q1
+vabav.u8  r0, q2, q1
+vabav.s16 r0, q2, q1
+vabav.u16 r0, q2, q1
+vabav.s32 r0, q2, q1
+vabav.u32 r0, q2, q1
+vabd.s8  q0, q2, q1
+vabd.u8  q0, q2, q1
+vabd.s16 q0, q2, q1
+vabd.u16 q0, q2, q1
+vabd.s32 q0, q2, q1
+vabd.u32 q0, q2, q1
+vabs.s8  q0, q2
+vabs.s16 q0, q2
+vabs.s32 q0, q2
+vadc.i32 q0, q2, q1
+vadci.i32 q0, q2, q1
+vadd.i8  q0, q2, q1
+vadd.i16 q0, q2, q1
+vadd.i32 q0, q2, q1
+vadd.i8  q0, q2, r0
+vadd.i16 q0, q2, r0
+vadd.i32 q0, q2, r0
+vaddlv.s32 r0, r1, q1
+vaddlv.u32 r0, r1, q1
+vaddlva.s32 r0, r1, q1
+vaddlva.u32 r0, r1, q1
+vaddv.s8  r0, q1
+vaddv.u8  r0, q1
+vaddv.s16 r0, q1
+vaddv.u16 r0, q1
+vaddv.s32 r0, q1
+vaddv.u32 r0, q1
+vaddva.s8  r0, q1
+vaddva.u8  r0, q1
+vaddva.s16 r0, q1
+vaddva.u16 r0, q1
+vaddva.s32 r0, q1
+vaddva.u32 r0, q1
+vand q0, q2, q1
+vbic.i16 q0, #10
+vbic.i32 q0, #10
+vbic q0, q2, q1
+vbrsr.8  q0, q2, r0
+vbrsr.16 q0, q2, r0
+vbrsr.32 q0, q2, r0
+vcadd.i8  q0, q2, q1, #90
+vcadd.i16 q0, q2, q1, #90
+vcadd.i32 q0, q2, q1, #90
+vcls.s8  q0, q2
+vcls.s16 q0, q2
+vcls.s32 q0, q2
+vclz.i8  q0, q2
+vclz.i16 q0, q2
+vclz.i32 q0, q2
+vdwdup.u8  q0, r0, r1, #4
+vdwdup.u16 q0, r0, r1, #4
+vdwdup.u32 q0, r0, r1, #4
+vddup.u8  q0, r0, #4
+vddup.u16 q0, r0, #4
+vddup.u32 q0, r0, #4
+vdup.8  q0, r0
+vdup.16 q0, r0
+vdup.32 q0, r0
+veor q0, q2, q1
+vhadd.s8  q0, q2, q1
+vhadd.u8  q0, q2, q1
+vhadd.s16 q0, q2, q1
+vhadd.u16 q0, q2, q1
+vhadd.s32 q0, q2, q1
+vhadd.u32 q0, q2, q1
+vhadd.s8  q0, q2, r0
+vhadd.u8  q0, q2, r0
+vhadd.s16 q0, q2, r0
+vhadd.u16 q0, q2, r0
+vhadd.s32 q0, q2, r0
+vhadd.u32 q0, q2, r0
+vhcadd.s8  q0, q2, q1, #90
+vhcadd.s16 q0, q2, q1, #90
+vhcadd.s32 q0, q2, q1, #90
+vhsub.s8  q0, q2, q1
+vhsub.u8  q0, q2, q1
+vhsub.s16 q0, q2, q1
+vhsub.u16 q0, q2, q1
+vhsub.s32 q0, q2, q1
+vhsub.u32 q0, q2, q1
+vhsub.s8  q0, q2, r0
+vhsub.u8  q0, q2, r0
+vhsub.s16 q0, q2, r0
+vhsub.u16 q0, q2, r0
+vhsub.s32 q0, q2, r0
+vhsub.u32 q0, q2, r0
+viwdup.u8  q0, r0, r1, #4
+viwdup.u16 q0, r0, r1, #4
+viwdup.u32 q0, r0, r1, #4
+vidup.u8  q0, r0, #4
+vidup.u16 q0, r0, #4
+vidup.u32 q0, r0, #4
+vmax.s8  q0, q2, q1
+vmax.u8  q0, q2, q1
+vmax.s16 q0, q2, q1
+vmax.u16 q0, q2, q1
+vmax.s32 q0, q2, q1
+vmax.u32 q0, q2, q1
+vmaxa.s8  q0, q2
+vmaxa.s16 q0, q2
+vmaxa.s32 q0, q2
+vmaxv.s8  r0, q2
+vmaxv.u8  r0, q2
+vmaxv.s16 r0, q2
+vmaxv.u16 r0, q2
+vmaxv.s32 r0, q2
+vmaxv.u32 r0, q2
+vmaxav.s8  r0, q2
+vmaxav.s16 r0, q2
+vmaxav.s32 r0, q2
+vmin.s8  q0, q2, q1
+vmin.u8  q0, q2, q1
+vmin.s16 q0, q2, q1
+vmin.u16 q0, q2, q1
+vmin.s32 q0, q2, q1
+vmin.u32 q0, q2, q1
+vmina.s8  q0, q2
+vmina.s16 q0, q2
+vmina.s32 q0, q2
+vminv.s8  r0, q2
+vminv.u8  r0, q2
+vminv.s16 r0, q2
+vminv.u16 r0, q2
+vminv.s32 r0, q2
+vminv.u32 r0, q2
+vminav.s8  r0, q2
+vminav.s16 r0, q2
+vminav.s32 r0, q2
+vmla.i8  q0, q2, r0
+vmla.i16 q0, q2, r0
+vmla.i32 q0, q2, r0
+vmladav.s8  r0, q2, q1
+vmladav.u8  r0, q2, q1
+vmladav.s16 r0, q2, q1
+vmladav.u16 r0, q2, q1
+vmladav.s32 r0, q2, q1
+vmladav.u32 r0, q2, q1
+vmladava.s8  r0, q2, q1
+vmladava.u8  r0, q2, q1
+vmladava.s16 r0, q2, q1
+vmladava.u16 r0, q2, q1
+vmladava.s32 r0, q2, q1
+vmladava.u32 r0, q2, q1
+vmladavax.s8  r0, q2, q1
+vmladavax.s16 r0, q2, q1
+vmladavax.s32 r0, q2, q1
+vmladavx.s8  r0, q2, q1
+vmladavx.s16 r0, q2, q1
+vmladavx.s32 r0, q2, q1
+vmlaldav.s16 r0, r1, q2, q1
+vmlaldav.u16 r0, r1, q2, q1
+vmlaldav.s32 r0, r1, q2, q1
+vmlaldav.u32 r0, r1, q2, q1
+vmlaldava.s16 r0, r1, q2, q1
+vmlaldava.u16 r0, r1, q2, q1
+vmlaldava.s32 r0, r1, q2, q1
+vmlaldava.u32 r0, r1, q2, q1
+vmlaldavax.s16 r0, r1, q2, q1
+vmlaldavax.s32 r0, r1, q2, q1
+vmlaldavx.s16 r0, r1, q2, q1
+vmlaldavx.s32 r0, r1, q2, q1
+vmlas.i8  q0, q2, r0
+vmlas.i16 q0, q2, r0
+vmlas.i32 q0, q2, r0
+vmlsdav.s8  r0, q2, q1
+vmlsdav.s16 r0, q2, q1
+vmlsdav.s32 r0, q2, q1
+vmlsdava.s8  r0, q2, q1
+vmlsdava.s16 r0, q2, q1
+vmlsdava.s32 r0, q2, q1
+vmlsdavax.s8  r0, q2, q1
+vmlsdavax.s16 r0, q2, q1
+vmlsdavax.s32 r0, q2, q1
+vmlsdavx.s8  r0, q2, q1
+vmlsdavx.s16 r0, q2, q1
+vmlsdavx.s32 r0, q2, q1
+vmlsldav.s16 r0, r1, q2, q1
+vmlsldav.s32 r0, r1, q2, q1
+vmlsldava.s16 r0, r1, q2, q1
+vmlsldava.s32 r0, r1, q2, q1
+vmlsldavax.s16 r0, r1, q2, q1
+vmlsldavax.s32 r0, r1, q2, q1
+vmlsldavx.s16 r0, r1, q2, q1
+vmlsldavx.s32 r0, r1, q2, q1
+vmov.8  q0[1], r0
+vmov.16 q0[1], r0
+vmov.32 q0[1], r0
+vmov.i8  q0, #0
+vmov.i16 q0, #0
+vmov.i32 q0, #0
+vmov.i64 q0, #0
+vmov.f32 q0, #1.0
+vmov r1, r2, q0[2], q0[0]
+vmov q0[2], q0[0], r1, r2
+vmov.32 r0, q0[1]
+vmov.s16 r0, q0[1]
+vmov.u16 r0, q0[1]
+vmov.s8  r0, q0[1]
+vmov.u8  r0, q0[1]
+vmovlb.s8  q0, q1
+vmovlb.u8  q0, q1
+vmovlb.s16 q0, q1
+vmovlb.u16 q0, q1
+vmovlt.s8  q0, q1
+vmovlt.u8  q0, q1
+vmovlt.s16 q0, q1
+vmovlt.u16 q0, q1
+vmovnb.i16 q0, q1
+vmovnb.i32 q0, q1
+vmovnt.i16 q0, q1
+vmovnt.i32 q0, q1
+vmul.i8  q0, q2, q1
+vmul.i16 q0, q2, q1
+vmul.i32 q0, q2, q1
+vmul.i8  q0, q2, r0
+vmul.i16 q0, q2, r0
+vmul.i32 q0, q2, r0
+vmulh.s8  q0, q2, q1
+vmulh.u8  q0, q2, q1
+vmulh.s16 q0, q2, q1
+vmulh.u16 q0, q2, q1
+vmulh.s32 q0, q2, q1
+vmulh.u32 q0, q2, q1
+vrmulh.s8  q0, q2, q1
+vrmulh.u8  q0, q2, q1
+vrmulh.s16 q0, q2, q1
+vrmulh.u16 q0, q2, q1
+vrmulh.s32 q0, q2, q1
+vrmulh.u32 q0, q2, q1
+vmullb.s8  q0, q2, q1
+vmullb.u8  q0, q2, q1
+vmullb.s16 q0, q2, q1
+vmullb.u16 q0, q2, q1
+vmullb.s32 q0, q2, q1
+vmullb.u32 q0, q2, q1
+vmullt.s8  q0, q2, q1
+vmullt.u8  q0, q2, q1
+vmullt.s16 q0, q2, q1
+vmullt.u16 q0, q2, q1
+vmullt.s32 q0, q2, q1
+vmullt.u32 q0, q2, q1
+vmullb.p8  q0, q2, q1
+vmullb.p16 q0, q2, q1
+vmullt.p8  q0, q2, q1
+vmullt.p16 q0, q2, q1
+vmvn.i16 q0, #10
+vmvn.i32 q0, #10
+vmvn q0, q2
+vneg.s8  q0, q2
+vneg.s16 q0, q2
+vneg.s32 q0, q2
+vorn q0, q2, q1
+vorr.i16 q0, #10
+vorr.i32 q0, #10
+vorr q0, q2, q1
+vpsel q0, q2, q1
+vqabs.s8  q0, q2
+vqabs.s16 q0, q2
+vqabs.s32 q0, q2
+vqadd.s8  q0, q2, q1
+vqadd.u8  q0, q2, q1
+vqadd.s16 q0, q2, q1
+vqadd.u16 q0, q2, q1
+vqadd.s32 q0, q2, q1
+vqadd.u32 q0, q2, q1
+vqadd.s8  q0, q2, r0
+vqadd.u8  q0, q2, r0
+vqadd.s16 q0, q2, r0
+vqadd.u16 q0, q2, r0
+vqadd.s32 q0, q2, r0
+vqadd.u32 q0, q2, r0
+vqdmladh.s8  q0, q2, q1
+vqdmladh.s16 q0, q2, q1
+vqdmladh.s32 q0, q2, q1
+vqdmladhx.s8  q0, q2, q1
+vqdmladhx.s16 q0, q2, q1
+vqdmladhx.s32 q0, q2, q1
+vqrdmladh.s8  q0, q2, q1
+vqrdmladh.s16 q0, q2, q1
+vqrdmladh.s32 q0, q2, q1
+vqrdmladhx.s8  q0, q2, q1
+vqrdmladhx.s16 q0, q2, q1
+vqrdmladhx.s32 q0, q2, q1
+vqdmlah.s8  q0, q2, r0
+vqdmlah.s16 q0, q2, r0
+vqdmlah.s32 q0, q2, r0
+vqrdmlah.s8  q0, q2, r0
+vqrdmlah.s16 q0, q2, r0
+vqrdmlah.s32 q0, q2, r0
+vqdmlash.s8  q0, q2, r0
+vqdmlash.s16 q0, q2, r0
+vqdmlash.s32 q0, q2, r0
+vqrdmlash.s8  q0, q2, r0
+vqrdmlash.s16 q0, q2, r0
+vqrdmlash.s32 q0, q2, r0
+vqdmlsdh.s8  q0, q2, q1
+vqdmlsdh.s16 q0, q2, q1
+vqdmlsdh.s32 q0, q2, q1
+vqdmlsdhx.s8  q0, q2, q1
+vqdmlsdhx.s16 q0, q2, q1
+vqdmlsdhx.s32 q0, q2, q1
+vqrdmlsdh.s8  q0, q2, q1
+vqrdmlsdh.s16 q0, q2, q1
+vqrdmlsdh.s32 q0, q2, q1
+vqrdmlsdhx.s8  q0, q2, q1
+vqrdmlsdhx.s16 q0, q2, q1
+vqrdmlsdhx.s32 q0, q2, q1
+vqdmulh.s8  q0, q2, q1
+vqdmulh.s16 q0, q2, q1
+vqdmulh.s32 q0, q2, q1
+vqrdmulh.s8  q0, q2, q1
+vqrdmulh.s16 q0, q2, q1
+vqrdmulh.s32 q0, q2, q1
+vqdmulh.s8  q0, q2, r0
+vqdmulh.s16 q0, q2, r0
+vqdmulh.s32 q0, q2, r0
+vqrdmulh.s8  q0, q2, r0
+vqrdmulh.s16 q0, q2, r0
+vqrdmulh.s32 q0, q2, r0
+vqdmullt.s16 q0, q2, q1
+vqdmullt.s32 q0, q2, q1
+vqdmullb.s16 q0, q2, r0
+vqdmullb.s32 q0, q2, r0
+vqmovnt.s16 q0, q2
+vqmovnt.u16 q0, q2
+vqmovnt.s32 q0, q2
+vqmovnt.u32 q0, q2
+vqmovnb.s16 q0, q2
+vqmovnb.u16 q0, q2
+vqmovnb.s32 q0, q2
+vqmovnb.u32 q0, q2
+vqmovunt.s16 q0, q2
+vqmovunt.s32 q0, q2
+vqmovunb.s16 q0, q2
+vqmovunb.s32 q0, q2
+vqneg.s8  q0, q2
+vqneg.s16 q0, q2
+vqneg.s32 q0, q2
+vqrshl.s8  q0, q2, q1
+vqrshl.u8  q0, q2, q1
+vqrshl.s16 q0, q2, q1
+vqrshl.u16 q0, q2, q1
+vqrshl.s32 q0, q2, q1
+vqrshl.u32 q0, q2, q1
+vqrshl.s8  q0, r0
+vqrshl.u8  q0, r0
+vqrshl.s16 q0, r0
+vqrshl.u16 q0, r0
+vqrshl.s32 q0, r0
+vqrshl.u32 q0, r0
+vqrshrnb.s16 q0, q2, #5
+vqrshrnb.u16 q0, q2, #5
+vqrshrnb.s32 q0, q2, #5
+vqrshrnb.u32 q0, q2, #5
+vqrshrnt.s16 q0, q2, #5
+vqrshrnt.u16 q0, q2, #5
+vqrshrnt.s32 q0, q2, #5
+vqrshrnt.u32 q0, q2, #5
+vqrshrunb.s16 q0, q2, #5
+vqrshrunb.s32 q0, q2, #5
+vqrshrunt.s16 q0, q2, #5
+vqrshrunt.s32 q0, q2, #5
+vqshl.s8  q0, r0
+vqshl.u8  q0, r0
+vqshl.s16 q0, r0
+vqshl.u16 q0, r0
+vqshl.s32 q0, r0
+vqshl.u32 q0, r0
+vqshl.s8  q0, q2, #5
+vqshl.u8  q0, q2, #5
+vqshl.s16 q0, q2, #5
+vqshl.u16 q0, q2, #5
+vqshl.s32 q0, q2, #5
+vqshl.u32 q0, q2, #5
+vqshlu.s8  q0, q2, #5
+vqshlu.s16 q0, q2, #5
+vqshlu.s32 q0, q2, #5
+vqshl.s8  q0, q2, q1
+vqshl.u8  q0, q2, q1
+vqshl.s16 q0, q2, q1
+vqshl.u16 q0, q2, q1
+vqshl.s32 q0, q2, q1
+vqshl.u32 q0, q2, q1
+vqshrnb.s16 q0, q2, #5
+vqshrnb.u16 q0, q2, #5
+vqshrnb.s32 q0, q2, #5
+vqshrnb.u32 q0, q2, #5
+vqshrnt.s16 q0, q2, #5
+vqshrnt.u16 q0, q2, #5
+vqshrnt.s32 q0, q2, #5
+vqshrnt.u32 q0, q2, #5
+vqshrunb.s16 q0, q2, #5
+vqshrunb.s32 q0, q2, #5
+vqshrunt.s16 q0, q2, #5
+vqshrunt.s32 q0, q2, #5
+vqsub.s8  q0, q2, q1
+vqsub.u8  q0, q2, q1
+vqsub.s16 q0, q2, q1
+vqsub.u16 q0, q2, q1
+vqsub.s32 q0, q2, q1
+vqsub.u32 q0, q2, q1
+vqsub.s8  q0, q2, r0
+vqsub.u8  q0, q2, r0
+vqsub.s16 q0, q2, r0
+vqsub.u16 q0, q2, r0
+vqsub.s32 q0, q2, r0
+vqsub.u32 q0, q2, r0
+vrev16.8  q0, q2
+vrev32.8  q0, q2
+vrev32.16 q0, q2
+vrev64.8  q0, q2
+vrev64.16 q0, q2
+vrev64.32 q0, q2
+vrhadd.s8  q0, q2, q1
+vrhadd.u8  q0, q2, q1
+vrhadd.s16 q0, q2, q1
+vrhadd.u16 q0, q2, q1
+vrhadd.s32 q0, q2, q1
+vrhadd.u32 q0, q2, q1
+vrmlaldavh.s32 r0, r1, q2, q1
+vrmlaldavh.u32 r0, r1, q2, q1
+vrmlaldavha.s32 r0, r1, q2, q1
+vrmlaldavha.u32 r0, r1, q2, q1
+vrmlaldavhx.s32 r0, r1, q2, q1
+vrmlaldavhax.s32 r0, r1, q2, q1
+vrmlsldavh.s32 r0, r1, q2, q1
+vrmlsldavha.s32 r0, r1, q2, q1
+vrmlsldavhx.s32 r0, r1, q2, q1
+vrmlsldavhax.s32 r0, r1, q2, q1
+vrshl.s8  q0, q2, q1
+vrshl.u8  q0, q2, q1
+vrshl.s16 q0, q2, q1
+vrshl.u16 q0, q2, q1
+vrshl.s32 q0, q2, q1
+vrshl.u32 q0, q2, q1
+vrshl.s8  q0, r0
+vrshl.u8  q0, r0
+vrshl.s16 q0, r0
+vrshl.u16 q0, r0
+vrshl.s32 q0, r0
+vrshl.u32 q0, r0
+vrshr.s8  q0, q2, #5
+vrshr.u8  q0, q2, #5
+vrshr.s16 q0, q2, #5
+vrshr.u16 q0, q2, #5
+vrshr.s32 q0, q2, #5
+vrshr.u32 q0, q2, #5
+vrshrnb.i16 q0, q2, #5
+vrshrnb.i32 q0, q2, #5
+vrshrnt.i16 q0, q2, #5
+vrshrnt.i32 q0, q2, #5
+vsbc.i32 q0, q2, q1
+vsbci.i32 q0, q2, q1
+vshl.i8  q0, q2, #1
+vshl.i16 q0, q2, #1
+vshl.i32 q0, q2, #1
+vshl.s8  q0, r0
+vshl.u8  q0, r0
+vshl.s16 q0, r0
+vshl.u16 q0, r0
+vshl.s32 q0, r0
+vshl.u32 q0, r0
+vshl.s8  q0, q2, q1
+vshl.u8  q0, q2, q1
+vshl.s16 q0, q2, q1
+vshl.u16 q0, q2, q1
+vshl.s32 q0, q2, q1
+vshl.u32 q0, q2, q1
+vshlc q0, r0, #5
+vshllt.s8  q0, q2, #5
+vshllt.u8  q0, q2, #5
+vshllt.s16 q0, q2, #5
+vshllt.u16 q0, q2, #5
+vshllb.s8  q0, q2, #5
+vshllb.u8  q0, q2, #5
+vshllb.s16 q0, q2, #5
+vshllb.u16 q0, q2, #5
+vshllt.s8  q0, q2, #8
+vshllt.u8  q0, q2, #8
+vshllt.s16 q0, q2, #16
+vshllt.u16 q0, q2, #16
+vshllb.s8  q0, q2, #8
+vshllb.u8  q0, q2, #8
+vshllb.s16 q0, q2, #16
+vshllb.u16 q0, q2, #16
+vshr.s8  q0, q2, #5
+vshr.u8  q0, q2, #5
+vshr.s16 q0, q2, #5
+vshr.u16 q0, q2, #5
+vshr.s32 q0, q2, #5
+vshr.u32 q0, q2, #5
+vshrnb.i16 q0, q2, #5
+vshrnb.i32 q0, q2, #5
+vshrnt.i16 q0, q2, #5
+vshrnt.i32 q0, q2, #5
+vsli.8  q0, q2, #5
+vsli.16 q0, q2, #5
+vsli.32 q0, q2, #5
+vsri.8  q0, q2, #5
+vsri.16 q0, q2, #5
+vsri.32 q0, q2, #5
+vsub.i8  q0, q2, q1
+vsub.i16 q0, q2, q1
+vsub.i32 q0, q2, q1
+vsub.i8  q0, q2, r0
+vsub.i16 q0, q2, r0
+vsub.i32 q0, q2, r0
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      3     2.00                        vabav.s8	r0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vabav.u8	r0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vabav.s16	r0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vabav.u16	r0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vabav.s32	r0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vabav.u32	r0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabd.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vabs.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vabs.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vabs.s32	q0, q2
+# CHECK-NEXT:  1      2     2.00                  U     vadc.i32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                  U     vadci.i32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vadd.i8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vadd.i16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vadd.i32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vadd.i8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vadd.i16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vadd.i32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vaddlv.s32	r0, r1, q1
+# CHECK-NEXT:  1      2     2.00                        vaddlv.u32	r0, r1, q1
+# CHECK-NEXT:  1      2     2.00                        vaddlva.s32	r0, r1, q1
+# CHECK-NEXT:  1      2     2.00                        vaddlva.u32	r0, r1, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.s8	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.u8	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.s16	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.u16	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.s32	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddv.u32	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.s8	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.u8	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.s16	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.u16	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.s32	r0, q1
+# CHECK-NEXT:  1      2     2.00                        vaddva.u32	r0, q1
+# CHECK-NEXT:  1      1     2.00                        vand	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vbic.i16	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vbic.i32	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vbic	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vbrsr.8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vbrsr.16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vbrsr.32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vcadd.i8	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vcadd.i16	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vcadd.i32	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vcls.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vcls.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vcls.s32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vclz.i8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vclz.i16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vclz.i32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vdwdup.u8	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        vdwdup.u16	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        vdwdup.u32	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        vddup.u8	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vddup.u16	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vddup.u32	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vdup.8	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vdup.16	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vdup.32	q0, r0
+# CHECK-NEXT:  1      1     2.00                        veor	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhadd.s8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhadd.u8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhadd.s16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhadd.u16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhadd.s32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhadd.u32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhcadd.s8	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vhcadd.s16	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vhcadd.s32	q0, q2, q1, #90
+# CHECK-NEXT:  1      1     2.00                        vhsub.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vhsub.s8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhsub.u8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhsub.s16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhsub.u16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhsub.s32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vhsub.u32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        viwdup.u8	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        viwdup.u16	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        viwdup.u32	q0, r0, r1, #4
+# CHECK-NEXT:  1      1     2.00                        vidup.u8	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vidup.u16	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vidup.u32	q0, r0, #4
+# CHECK-NEXT:  1      1     2.00                        vmax.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmax.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmax.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmax.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmax.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmax.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmaxa.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxa.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmaxa.s32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vmaxv.s8	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vmaxv.u8	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vmaxv.s16	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vmaxv.u16	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vmaxv.s32	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vmaxv.u32	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vmaxav.s8	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vmaxav.s16	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vmaxav.s32	r0, q2
+# CHECK-NEXT:  1      1     2.00                        vmin.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmin.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmin.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmin.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmin.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmin.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmina.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmina.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vmina.s32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vminv.s8	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vminv.u8	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vminv.s16	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vminv.u16	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vminv.s32	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vminv.u32	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vminav.s8	r0, q2
+# CHECK-NEXT:  1      3     2.00                        vminav.s16	r0, q2
+# CHECK-NEXT:  1      4     2.00                        vminav.s32	r0, q2
+# CHECK-NEXT:  1      2     2.00                        vmla.i8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmla.i16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmla.i32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmlav.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlav.u8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlav.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlav.u16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlav.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlav.u32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.u8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.u16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlava.u32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavax.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavax.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavax.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavx.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavx.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmladavx.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalv.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalv.u16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalv.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalv.u32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalva.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalva.u16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalva.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlalva.u32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlaldavax.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlaldavax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlaldavx.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlaldavx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlas.i8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmlas.i16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmlas.i32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmlsdav.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdav.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdav.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdava.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdava.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdava.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavax.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavax.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavax.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavx.s8	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavx.s16	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsdavx.s32	r0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldav.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldav.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldava.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldava.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldavax.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldavax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldavx.s16	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmlsldavx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      1     1.00                        vmov.8	q0[1], r0
+# CHECK-NEXT:  1      1     1.00                        vmov.16	q0[1], r0
+# CHECK-NEXT:  1      1     1.00                        vmov.32	q0[1], r0
+# CHECK-NEXT:  1      2     2.00                        vmov.i8	q0, #0x0
+# CHECK-NEXT:  1      2     2.00                        vmov.i16	q0, #0x0
+# CHECK-NEXT:  1      2     2.00                        vmov.i32	q0, #0x0
+# CHECK-NEXT:  1      2     2.00                        vmov.i64	q0, #0x0
+# CHECK-NEXT:  1      2     2.00                        vmov.f32	q0, #1.000000e+00
+# CHECK-NEXT:  1      1     2.00                        vmov	r1, r2, q0[2], q0[0]
+# CHECK-NEXT:  1      1     1.00                        vmov	q0[2], q0[0], r1, r2
+# CHECK-NEXT:  1      1     2.00                        vmov.32	r0, q0[1]
+# CHECK-NEXT:  1      1     2.00                        vmov.s16	r0, q0[1]
+# CHECK-NEXT:  1      1     2.00                        vmov.u16	r0, q0[1]
+# CHECK-NEXT:  1      1     2.00                        vmov.s8	r0, q0[1]
+# CHECK-NEXT:  1      1     2.00                        vmov.u8	r0, q0[1]
+# CHECK-NEXT:  1      1     2.00                        vmovlb.s8	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlb.u8	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlb.s16	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlb.u16	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlt.s8	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlt.u8	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlt.s16	q0, q1
+# CHECK-NEXT:  1      1     2.00                        vmovlt.u16	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vmovnb.i16	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vmovnb.i32	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vmovnt.i16	q0, q1
+# CHECK-NEXT:  1      3     2.00                        vmovnt.i32	q0, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.i8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.i16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.i32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmul.i8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmul.i16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmul.i32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vmulh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmulh.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmulh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmulh.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmulh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmulh.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmulh.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.p8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullb.p16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.p8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vmullt.p16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vmvn.i16	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vmvn.i32	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vmvn	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vneg.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vneg.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vneg.s32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vorn	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorr.i16	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vorr.i32	q0, #0xa
+# CHECK-NEXT:  1      1     2.00                        vorr	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vpsel	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqabs.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqabs.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqabs.s32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqadd.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqadd.s8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqadd.u8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqadd.s16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqadd.u16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqadd.s32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqadd.u32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmladh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmladh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmladh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmladhx.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmladhx.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmladhx.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladhx.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladhx.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmladhx.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlah.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlah.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlah.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlah.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlah.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlah.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlash.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlash.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlash.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlash.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlash.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmlash.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdhx.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdhx.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmlsdhx.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdhx.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdhx.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmlsdhx.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmulh.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s8	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqrdmulh.s32	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmullt.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmullt.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqdmullb.s16	q0, q2, r0
+# CHECK-NEXT:  1      2     2.00                        vqdmullb.s32	q0, q2, r0
+# CHECK-NEXT:  1      3     2.00                        vqmovnt.s16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnt.u16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnt.s32	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnt.u32	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnb.s16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnb.u16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnb.s32	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovnb.u32	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovunt.s16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovunt.s32	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovunb.s16	q0, q2
+# CHECK-NEXT:  1      3     2.00                        vqmovunb.s32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqneg.s8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqneg.s16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vqneg.s32	q0, q2
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqrshl.s32	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqrshl.u32	q0, r0
+# CHECK-NEXT:  1      3     2.00                        vqrshrnb.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnb.u16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnb.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnb.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnt.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnt.u16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnt.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrnt.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrunb.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrunb.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrunt.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqrshrunt.s32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.s8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.u8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.s16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.u16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.s32	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.u32	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vqshl.s8	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.u8	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.s16	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.u16	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.s32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.u32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshlu.s8	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshlu.s16	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshlu.s32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vqshl.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqshl.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqshl.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqshl.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqshl.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vqshl.u32	q0, q2, q1
+# CHECK-NEXT:  1      3     2.00                        vqshrnb.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnb.u16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnb.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnb.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnt.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnt.u16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnt.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrnt.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrunb.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrunb.s32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrunt.s16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vqshrunt.s32	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vqsub.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vqsub.s8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqsub.u8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqsub.s16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqsub.u16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqsub.s32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vqsub.u32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vrev16.8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrev32.8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrev32.16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrev64.8	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrev64.16	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrev64.32	q0, q2
+# CHECK-NEXT:  1      1     2.00                        vrhadd.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vrhadd.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vrhadd.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vrhadd.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vrhadd.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vrhadd.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlalvh.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlalvh.u32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlalvha.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlalvha.u32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlaldavhx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlaldavhax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlsldavh.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlsldavha.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlsldavhx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrmlsldavhax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.s8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.u8	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.s16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.u16	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.s32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.u32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                        vrshl.s8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshl.u8	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshl.s16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshl.u16	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshl.s32	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshl.u32	q0, r0
+# CHECK-NEXT:  1      2     2.00                        vrshr.s8	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vrshr.u8	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vrshr.s16	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vrshr.u16	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vrshr.s32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                        vrshr.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vrshrnb.i16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vrshrnb.i32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vrshrnt.i16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vrshrnt.i32	q0, q2, #5
+# CHECK-NEXT:  1      2     2.00                  U     vsbc.i32	q0, q2, q1
+# CHECK-NEXT:  1      2     2.00                  U     vsbci.i32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.i8	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00                        vshl.i16	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00                        vshl.i32	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00                        vshl.s8	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.u8	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.s16	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.u16	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.s32	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.u32	q0, r0
+# CHECK-NEXT:  1      1     2.00                        vshl.s8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.u8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.u16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.s32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vshl.u32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                  U     vshlc	q0, r0, #5
+# CHECK-NEXT:  1      1     2.00                        vshllt.s8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllt.u8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllt.s16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllt.u16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllb.s8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllb.u8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllb.s16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllb.u16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshllt.s8	q0, q2, #8
+# CHECK-NEXT:  1      1     2.00                        vshllt.u8	q0, q2, #8
+# CHECK-NEXT:  1      1     2.00                        vshllt.s16	q0, q2, #16
+# CHECK-NEXT:  1      1     2.00                        vshllt.u16	q0, q2, #16
+# CHECK-NEXT:  1      1     2.00                        vshllb.s8	q0, q2, #8
+# CHECK-NEXT:  1      1     2.00                        vshllb.u8	q0, q2, #8
+# CHECK-NEXT:  1      1     2.00                        vshllb.s16	q0, q2, #16
+# CHECK-NEXT:  1      1     2.00                        vshllb.u16	q0, q2, #16
+# CHECK-NEXT:  1      1     2.00                        vshr.s8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshr.u8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshr.s16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshr.u16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshr.s32	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vshr.u32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vshrnb.i16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vshrnb.i32	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vshrnt.i16	q0, q2, #5
+# CHECK-NEXT:  1      3     2.00                        vshrnt.i32	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsli.8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsli.16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsli.32	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsri.8	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsri.16	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsri.32	q0, q2, #5
+# CHECK-NEXT:  1      1     2.00                        vsub.i8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vsub.i16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vsub.i32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vsub.i8	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vsub.i16	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00                        vsub.i32	q0, q2, r0
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -      -     672.00 354.00  -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.s8	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.u8	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.s16	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.u16	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.s32	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabav.u32	r0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabd.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vabs.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vabs.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vabs.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vadc.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vadci.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vaddlv.s32	r0, r1, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddlv.u32	r0, r1, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddlva.s32	r0, r1, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddlva.u32	r0, r1, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.s8	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.u8	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.s16	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.u16	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.s32	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddv.u32	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.s8	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.u8	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.s16	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.u16	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.s32	r0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vaddva.u32	r0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vand	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vbic.i16	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vbic.i32	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vbic	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vbrsr.8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vbrsr.16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vbrsr.32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vcadd.i8	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vcadd.i16	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vcadd.i32	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vcls.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vcls.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vcls.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vclz.i8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vclz.i16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vclz.i32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vdwdup.u8	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vdwdup.u16	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vdwdup.u32	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vddup.u8	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vddup.u16	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vddup.u32	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vdup.8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vdup.16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vdup.32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     veor	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.s32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhadd.u32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhcadd.s8	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vhcadd.s16	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vhcadd.s32	q0, q2, q1, #90
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.s32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vhsub.u32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     viwdup.u8	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     viwdup.u16	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     viwdup.u32	q0, r0, r1, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vidup.u8	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vidup.u16	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vidup.u32	q0, r0, #4
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmax.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxa.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxa.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxa.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.s8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.u8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.s16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.u16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.s32	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxv.u32	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxav.s8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxav.s16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmaxav.s32	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmin.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmina.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmina.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vmina.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.s8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.u8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.s16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.u16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.s32	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminv.u32	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminav.s8	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminav.s16	r0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vminav.s32	r0, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vmla.i8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmla.i16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmla.i32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.u8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.u16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlav.u32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.u8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.u16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlava.u32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavax.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavax.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavax.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavx.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavx.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmladavx.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalv.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalv.u16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalv.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalv.u32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalva.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalva.u16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalva.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlalva.u32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlaldavax.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlaldavax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlaldavx.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlaldavx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlas.i8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmlas.i16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmlas.i32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdav.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdav.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdav.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdava.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdava.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdava.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavax.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavax.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavax.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavx.s8	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavx.s16	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsdavx.s32	r0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldav.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldav.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldava.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldava.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldavax.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldavax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldavx.s16	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmlsldavx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -     1.00   1.00    -     vmov.8	q0[1], r0
+# CHECK-NEXT:  -      -     1.00   1.00    -     vmov.16	q0[1], r0
+# CHECK-NEXT:  -      -     1.00   1.00    -     vmov.32	q0[1], r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.i8	q0, #0x0
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.i16	q0, #0x0
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.i32	q0, #0x0
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.i64	q0, #0x0
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.f32	q0, #1.000000e+00
+# CHECK-NEXT:  -      -      -     2.00    -     vmov	r1, r2, q0[2], q0[0]
+# CHECK-NEXT:  -      -     1.00   1.00    -     vmov	q0[2], q0[0], r1, r2
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.32	r0, q0[1]
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.s16	r0, q0[1]
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.u16	r0, q0[1]
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.s8	r0, q0[1]
+# CHECK-NEXT:  -      -      -     2.00    -     vmov.u8	r0, q0[1]
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlb.s8	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlb.u8	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlb.s16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlb.u16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlt.s8	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlt.u8	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlt.s16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovlt.u16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnb.i16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnb.i32	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnt.i16	q0, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnt.i32	q0, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.u8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.u16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmulh.u32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.u8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.u16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmulh.u32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.u8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.u16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.u32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.u8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.u16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vmullt.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmullb.p8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmullb.p16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmullt.p8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmullt.p16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vmvn.i16	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vmvn.i32	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vmvn	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vneg.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vneg.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vneg.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vorn	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorr.i16	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vorr.i32	q0, #0xa
+# CHECK-NEXT:  -      -     2.00    -      -     vorr	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vpsel	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqabs.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqabs.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqabs.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.s32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqadd.u32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladhx.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladhx.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmladhx.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladhx.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladhx.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmladhx.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlah.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlah.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlah.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlah.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlah.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlah.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlash.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlash.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlash.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlash.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlash.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlash.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdhx.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdhx.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmlsdhx.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdhx.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdhx.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmlsdhx.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s8	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmulh.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s8	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqrdmulh.s32	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmullt.s16	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmullt.s32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmullb.s16	q0, q2, r0
+# CHECK-NEXT:  -      -      -     2.00    -     vqdmullb.s32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnt.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnt.u16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnt.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnt.u32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnb.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnb.u16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnb.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnb.u32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovunt.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovunt.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovunb.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovunb.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqneg.s8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqneg.s16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqneg.s32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.s32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshl.u32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnb.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnb.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnb.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnb.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnt.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnt.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnt.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrnt.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrunb.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrunb.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrunt.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqrshrunt.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshlu.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshlu.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshlu.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshl.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnb.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnb.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnb.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnb.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnt.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnt.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnt.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnt.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrunb.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrunb.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrunt.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrunt.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.s32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vqsub.u32	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrev16.8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrev32.8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrev32.16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrev64.8	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrev64.16	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrev64.32	q0, q2
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrhadd.u32	q0, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlalvh.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlalvh.u32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlalvha.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlalvha.u32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlaldavhx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlaldavhax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlsldavh.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlsldavha.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlsldavhx.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vrmlsldavhax.s32	r0, r1, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.s32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshl.u32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.u8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshr.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnb.i16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnb.i32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnt.i16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnt.i32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsbc.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vsbci.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.i8	q0, q2, #1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.i16	q0, q2, #1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.i32	q0, q2, #1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u8	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u16	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u32	q0, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.s32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshl.u32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vshlc	q0, r0, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.u8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.u8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.s8	q0, q2, #8
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.u8	q0, q2, #8
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.s16	q0, q2, #16
+# CHECK-NEXT:  -      -     2.00    -      -     vshllt.u16	q0, q2, #16
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.s8	q0, q2, #8
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.u8	q0, q2, #8
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.s16	q0, q2, #16
+# CHECK-NEXT:  -      -     2.00    -      -     vshllb.u16	q0, q2, #16
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.s8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.u8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.s16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.u16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.s32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshr.u32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnb.i16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnb.i32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnt.i16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnt.i32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsli.8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsli.16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsli.32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsri.8	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsri.16	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsri.32	q0, q2, #5
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i8	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i16	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i32	q0, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i8	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i16	q0, q2, r0
+# CHECK-NEXT:  -      -     2.00    -      -     vsub.i32	q0, q2, r0

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-mve-ldst.s b/llvm/test/tools/llvm-mca/ARM/m55-mve-ldst.s
new file mode 100644
index 0000000000000..8e06105933aed
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-mve-ldst.s
@@ -0,0 +1,323 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -instruction-tables < %s | FileCheck %s
+
+vldrb.8 q1, [r0, 0]
+vldrb.8 q1, [r0, 0]!
+vldrb.8 q1, [r0], 0
+vldrh.16 q1, [r0, 0]
+vldrh.16 q1, [r0, 0]!
+vldrh.16 q1, [r0], 0
+vldrw.32 q1, [r0, 0]
+vldrw.32 q1, [r0, 0]!
+vldrw.32 q1, [r0], 0
+
+vldrb.u16 q1, [r0, 0]
+vldrb.u16 q1, [r0, 0]!
+vldrb.u16 q1, [r0], 0
+vldrb.u32 q1, [r0, 0]
+vldrb.u32 q1, [r0, 0]!
+vldrb.u32 q1, [r0], 0
+vldrh.u32 q1, [r0, 0]
+vldrh.u32 q1, [r0, 0]!
+vldrh.u32 q1, [r0], 0
+
+vldrb.s16 q1, [r0, 4]
+vldrb.s16 q1, [r0, 4]!
+vldrb.s16 q1, [r0], 4
+vldrb.s32 q1, [r0, 4]
+vldrb.s32 q1, [r0, 4]!
+vldrb.s32 q1, [r0], 4
+vldrh.s32 q1, [r0, 4]
+vldrh.s32 q1, [r0, 4]!
+vldrh.s32 q1, [r0], 4
+
+vldrw.32 q1, [r0, q0]
+vldrh.16 q1, [r0, q0]
+vldrb.8 q1, [r0, q0]
+vldrb.u16 q1, [r0, q0]
+vldrb.u32 q1, [r0, q0]
+vldrh.u32 q1, [r0, q0]
+vldrb.s16 q1, [r0, q0]
+vldrb.s32 q1, [r0, q0]
+vldrh.s32 q1, [r0, q0]
+vldrw.32 q1, [r0, q0, uxtw #2]
+vldrh.16 q1, [r0, q0, uxtw #1]
+vldrh.u32 q1, [r0, q0, uxtw #1]
+vldrh.s32 q1, [r0, q0, uxtw #1]
+
+vldrw.32 q1, [q0, 4]
+vldrw.32 q1, [q0, 4]!
+
+vld20.8 {q0, q1}, [r0]
+vld21.8 {q0, q1}, [r0]!
+vld40.8 {q0, q1, q2, q3}, [r0]
+vld43.8 {q0, q1, q2, q3}, [r0]!
+vld20.16 {q0, q1}, [r0]
+vld21.16 {q0, q1}, [r0]!
+vld40.16 {q0, q1, q2, q3}, [r0]
+vld43.16 {q0, q1, q2, q3}, [r0]!
+vld20.32 {q0, q1}, [r0]
+vld21.32 {q0, q1}, [r0]!
+vld40.32 {q0, q1, q2, q3}, [r0]
+vld43.32 {q0, q1, q2, q3}, [r0]!
+
+vstrb.8 q1, [r0, 0]
+vstrb.8 q1, [r0, 0]!
+vstrb.8 q1, [r0], 0
+vstrh.16 q1, [r0, 0]
+vstrh.16 q1, [r0, 0]!
+vstrh.16 q1, [r0], 0
+vstrw.32 q1, [r0, 0]
+vstrw.32 q1, [r0, 0]!
+vstrw.32 q1, [r0], 0
+
+vstrb.16 q1, [r0, 0]
+vstrb.16 q1, [r0, 0]!
+vstrb.16 q1, [r0], 0
+vstrb.32 q1, [r0, 0]
+vstrb.32 q1, [r0, 0]!
+vstrb.32 q1, [r0], 0
+vstrh.32 q1, [r0, 0]
+vstrh.32 q1, [r0, 0]!
+vstrh.32 q1, [r0], 0
+
+vstrw.32 q1, [r0, q0]
+vstrh.16 q1, [r0, q0]
+vstrb.8 q1, [r0, q0]
+vstrb.16 q1, [r0, q0]
+vstrb.32 q1, [r0, q0]
+vstrh.32 q1, [r0, q0]
+
+vstrw.32 q1, [r0, q0, uxtw #2]
+vstrh.16 q1, [r0, q0, uxtw #1]
+vstrh.32 q1, [r0, q0, uxtw #1]
+
+vstrw.32 q1, [q0, 4]
+vstrw.32 q1, [q0, 4]!
+
+vst20.8 {q0, q1}, [r0]
+vst21.8 {q0, q1}, [r0]!
+vst40.8 {q0, q1, q2, q3}, [r0]
+vst43.8 {q0, q1, q2, q3}, [r0]!
+vst20.16 {q0, q1}, [r0]
+vst21.16 {q0, q1}, [r0]!
+vst40.16 {q0, q1, q2, q3}, [r0]
+vst43.16 {q0, q1, q2, q3}, [r0]!
+vst20.32 {q0, q1}, [r0]
+vst21.32 {q0, q1}, [r0]!
+vst40.32 {q0, q1, q2, q3}, [r0]
+vst43.32 {q0, q1, q2, q3}, [r0]!
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u8	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u8	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u8	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u16	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u16	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u16	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrw.u32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrw.u32	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrw.u32	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u16	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u16	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u16	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u32	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.u32	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u32	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.u32	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s16	q1, [r0, #4]
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s16	q1, [r0, #4]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s16	q1, [r0], #4
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s32	q1, [r0, #4]
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s32	q1, [r0, #4]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrb.s32	q1, [r0], #4
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.s32	q1, [r0, #4]
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.s32	q1, [r0, #4]!
+# CHECK-NEXT:  1      1     2.00    *                   vldrh.s32	q1, [r0], #4
+# CHECK-NEXT:  1      6     2.00    *                   vldrw.u32	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.u16	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrb.u8	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrb.u16	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrb.u32	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.u32	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrb.s16	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrb.s32	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.s32	q1, [r0, q0]
+# CHECK-NEXT:  1      6     2.00    *                   vldrw.u32	q1, [r0, q0, uxtw #2]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.u16	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.u32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  1      6     2.00    *                   vldrh.s32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  1      6     2.00    *                   vldrw.u32	q1, [q0, #4]
+# CHECK-NEXT:  1      6     2.00    *                   vldrw.u32	q1, [q0, #4]!
+# CHECK-NEXT:  1      1     2.00    *                   vld20.8	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld21.8	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vld40.8	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld43.8	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vld20.16	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld21.16	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vld40.16	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld43.16	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vld20.32	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld21.32	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00    *                   vld40.32	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00    *                   vld43.32	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00           *            vstrh.16	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrh.16	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrh.16	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00           *            vstrw.32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrw.32	q1, [r0, #0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrw.32	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00           *            vstrb.16	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrb.16	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrb.16	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00           *            vstrb.32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrb.32	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrb.32	q1, [r0], #0
+# CHECK-NEXT:  1      1     2.00           *            vstrh.32	q1, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vstrh.32	q1, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vstrh.32	q1, [r0], #0
+# CHECK-NEXT:  1      5     2.00           *            vstrw.32	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrh.16	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrb.8	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrb.16	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrb.32	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrh.32	q1, [r0, q0]
+# CHECK-NEXT:  1      5     2.00           *            vstrw.32	q1, [r0, q0, uxtw #2]
+# CHECK-NEXT:  1      5     2.00           *            vstrh.16	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  1      5     2.00           *            vstrh.32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  1      5     2.00           *            vstrw.32	q1, [q0, #4]
+# CHECK-NEXT:  1      5     2.00           *            vstrw.32	q1, [q0, #4]!
+# CHECK-NEXT:  1      1     2.00           *            vst20.8	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst21.8	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vst40.8	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst43.8	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vst20.16	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst21.16	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vst40.16	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst43.16	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vst20.32	{q0, q1}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst21.32	{q0, q1}, [r0]!
+# CHECK-NEXT:  1      1     2.00           *            vst40.32	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  1      1     2.00           *            vst43.32	{q0, q1, q2, q3}, [r0]!
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -     190.00  -      -      -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u8	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u8	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u8	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u16	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u16	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u16	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u16	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u16	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u16	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u32	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u32	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s16	q1, [r0, #4]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s16	q1, [r0, #4]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s16	q1, [r0], #4
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s32	q1, [r0, #4]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s32	q1, [r0, #4]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s32	q1, [r0], #4
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.s32	q1, [r0, #4]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.s32	q1, [r0, #4]!
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.s32	q1, [r0], #4
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u16	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u8	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u16	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.u32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s16	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrb.s32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.s32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [r0, q0, uxtw #2]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u16	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.u32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrh.s32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [q0, #4]
+# CHECK-NEXT:  -     2.00    -      -      -     vldrw.u32	q1, [q0, #4]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld20.8	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld21.8	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld40.8	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld43.8	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld20.16	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld21.16	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld40.16	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld43.16	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld20.32	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld21.32	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vld40.32	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vld43.32	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.16	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.16	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.16	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [r0, #0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.16	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.16	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.16	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.32	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.32	q1, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.32	q1, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.32	q1, [r0], #0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.16	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.16	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.32	q1, [r0, q0]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [r0, q0, uxtw #2]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.16	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrh.32	q1, [r0, q0, uxtw #1]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [q0, #4]
+# CHECK-NEXT:  -     2.00    -      -      -     vstrw.32	q1, [q0, #4]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst20.8	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst21.8	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst40.8	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst43.8	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst20.16	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst21.16	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst40.16	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst43.16	{q0, q1, q2, q3}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst20.32	{q0, q1}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst21.32	{q0, q1}, [r0]!
+# CHECK-NEXT:  -     2.00    -      -      -     vst40.32	{q0, q1, q2, q3}, [r0]
+# CHECK-NEXT:  -     2.00    -      -      -     vst43.32	{q0, q1, q2, q3}, [r0]!

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-mve-pred.s b/llvm/test/tools/llvm-mca/ARM/m55-mve-pred.s
new file mode 100644
index 0000000000000..9add5ce1c39fd
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-mve-pred.s
@@ -0,0 +1,694 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -instruction-tables < %s | FileCheck %s
+
+vcmp.f16 eq, q2, q1
+vcmp.f32 eq, q2, q1
+vcmp.f16 ne, q2, q1
+vcmp.f32 ne, q2, q1
+vcmp.f16 ge, q2, q1
+vcmp.f32 ge, q2, q1
+vcmp.f16 lt, q2, q1
+vcmp.f32 lt, q2, q1
+vcmp.f16 gt, q2, q1
+vcmp.f32 gt, q2, q1
+vcmp.f16 le, q2, q1
+vcmp.f32 le, q2, q1
+vcmp.f16 eq, q2, r1
+vcmp.f32 eq, q2, r1
+vcmp.f16 ne, q2, r1
+vcmp.f32 ne, q2, r1
+vcmp.f16 ge, q2, r1
+vcmp.f32 ge, q2, r1
+vcmp.f16 lt, q2, r1
+vcmp.f32 lt, q2, r1
+vcmp.f16 gt, q2, r1
+vcmp.f32 gt, q2, r1
+vcmp.f16 le, q2, r1
+vcmp.f32 le, q2, r1
+vcmp.i8  eq, q2, q1
+vcmp.i16 eq, q2, q1
+vcmp.i32 eq, q2, q1
+vcmp.i8  ne, q2, q1
+vcmp.i16 ne, q2, q1
+vcmp.i32 ne, q2, q1
+vcmp.u8  cs, q2, q1
+vcmp.u16 cs, q2, q1
+vcmp.u32 cs, q2, q1
+vcmp.u8  hi, q2, q1
+vcmp.u16 hi, q2, q1
+vcmp.u32 hi, q2, q1
+vcmp.s8  ge, q2, q1
+vcmp.s16 ge, q2, q1
+vcmp.s32 ge, q2, q1
+vcmp.s8  lt, q2, q1
+vcmp.s16 lt, q2, q1
+vcmp.s32 lt, q2, q1
+vcmp.s8  gt, q2, q1
+vcmp.s16 gt, q2, q1
+vcmp.s32 gt, q2, q1
+vcmp.s8  le, q2, q1
+vcmp.s16 le, q2, q1
+vcmp.s32 le, q2, q1
+vcmp.i8  eq, q2, r1
+vcmp.i16 eq, q2, r1
+vcmp.i32 eq, q2, r1
+vcmp.i8  ne, q2, r1
+vcmp.i16 ne, q2, r1
+vcmp.i32 ne, q2, r1
+vcmp.u8  cs, q2, r1
+vcmp.u16 cs, q2, r1
+vcmp.u32 cs, q2, r1
+vcmp.u8  hi, q2, r1
+vcmp.u16 hi, q2, r1
+vcmp.u32 hi, q2, r1
+vcmp.s8  ge, q2, r1
+vcmp.s16 ge, q2, r1
+vcmp.s32 ge, q2, r1
+vcmp.s8  lt, q2, r1
+vcmp.s16 lt, q2, r1
+vcmp.s32 lt, q2, r1
+vcmp.s8  gt, q2, r1
+vcmp.s16 gt, q2, r1
+vcmp.s32 gt, q2, r1
+vcmp.s8  le, q2, r1
+vcmp.s16 le, q2, r1
+vcmp.s32 le, q2, r1
+vctp.8  r0
+vctp.16 r0
+vctp.32 r0
+vctp.64 r0
+#vpnot			FIXME: crashes compiler
+vpst
+vorrt    q0, q0, q0
+vpt.f16 eq, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 eq, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 ne, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 ne, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 ge, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 ge, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 lt, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 lt, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 gt, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 gt, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 le, q2, q1
+vorrt    q0, q1, q2
+vpt.f32 le, q2, q1
+vorrt    q0, q1, q2
+vpt.f16 eq, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 eq, q2, r1
+vorrt    q0, q1, q2
+vpt.f16 ne, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 ne, q2, r1
+vorrt    q0, q1, q2
+vpt.f16 ge, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 ge, q2, r1
+vorrt    q0, q1, q2
+vpt.f16 lt, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 lt, q2, r1
+vorrt    q0, q1, q2
+vpt.f16 gt, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 gt, q2, r1
+vorrt    q0, q1, q2
+vpt.f16 le, q2, r1
+vorrt    q0, q1, q2
+vpt.f32 le, q2, r1
+vorrt    q0, q1, q2
+vpt.i8  eq, q2, q1
+vorrt    q0, q1, q2
+vpt.i16 eq, q2, q1
+vorrt    q0, q1, q2
+vpt.i32 eq, q2, q1
+vorrt    q0, q1, q2
+vpt.i8  ne, q2, q1
+vorrt    q0, q1, q2
+vpt.i16 ne, q2, q1
+vorrt    q0, q1, q2
+vpt.i32 ne, q2, q1
+vorrt    q0, q1, q2
+vpt.u8  cs, q2, q1
+vorrt    q0, q1, q2
+vpt.u16 cs, q2, q1
+vorrt    q0, q1, q2
+vpt.u32 cs, q2, q1
+vorrt    q0, q1, q2
+vpt.u8  hi, q2, q1
+vorrt    q0, q1, q2
+vpt.u16 hi, q2, q1
+vorrt    q0, q1, q2
+vpt.u32 hi, q2, q1
+vorrt    q0, q1, q2
+vpt.s8  ge, q2, q1
+vorrt    q0, q1, q2
+vpt.s16 ge, q2, q1
+vorrt    q0, q1, q2
+vpt.s32 ge, q2, q1
+vorrt    q0, q1, q2
+vpt.s8  lt, q2, q1
+vorrt    q0, q1, q2
+vpt.s16 lt, q2, q1
+vorrt    q0, q1, q2
+vpt.s32 lt, q2, q1
+vorrt    q0, q1, q2
+vpt.s8  gt, q2, q1
+vorrt    q0, q1, q2
+vpt.s16 gt, q2, q1
+vorrt    q0, q1, q2
+vpt.s32 gt, q2, q1
+vorrt    q0, q1, q2
+vpt.s8  le, q2, q1
+vorrt    q0, q1, q2
+vpt.s16 le, q2, q1
+vorrt    q0, q1, q2
+vpt.s32 le, q2, q1
+vorrt    q0, q1, q2
+vpt.i8  eq, q2, r1
+vorrt    q0, q1, q2
+vpt.i16 eq, q2, r1
+vorrt    q0, q1, q2
+vpt.i32 eq, q2, r1
+vorrt    q0, q1, q2
+vpt.i8  ne, q2, r1
+vorrt    q0, q1, q2
+vpt.i16 ne, q2, r1
+vorrt    q0, q1, q2
+vpt.i32 ne, q2, r1
+vorrt    q0, q1, q2
+vpt.u8  cs, q2, r1
+vorrt    q0, q1, q2
+vpt.u16 cs, q2, r1
+vorrt    q0, q1, q2
+vpt.u32 cs, q2, r1
+vorrt    q0, q1, q2
+vpt.u8  hi, q2, r1
+vorrt    q0, q1, q2
+vpt.u16 hi, q2, r1
+vorrt    q0, q1, q2
+vpt.u32 hi, q2, r1
+vorrt    q0, q1, q2
+vpt.s8  ge, q2, r1
+vorrt    q0, q1, q2
+vpt.s16 ge, q2, r1
+vorrt    q0, q1, q2
+vpt.s32 ge, q2, r1
+vorrt    q0, q1, q2
+vpt.s8  lt, q2, r1
+vorrt    q0, q1, q2
+vpt.s16 lt, q2, r1
+vorrt    q0, q1, q2
+vpt.s32 lt, q2, r1
+vorrt    q0, q1, q2
+vpt.s8  gt, q2, r1
+vorrt    q0, q1, q2
+vpt.s16 gt, q2, r1
+vorrt    q0, q1, q2
+vpt.s32 gt, q2, r1
+vorrt    q0, q1, q2
+vpt.s8  le, q2, r1
+vorrt    q0, q1, q2
+vpt.s16 le, q2, r1
+vorrt    q0, q1, q2
+vpt.s32 le, q2, r1
+vorrt    q0, q1, q2
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f16	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.f32	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i8	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i16	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i32	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i8	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i16	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i32	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u8	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u16	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u32	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u8	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u16	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u32	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i8	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i16	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i32	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i8	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i16	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.i32	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u8	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u16	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u32	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u8	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u16	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.u32	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s8	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s16	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vcmp.s32	le, q2, r1
+# CHECK-NEXT:  1      1     1.00                        vctp.8	r0
+# CHECK-NEXT:  1      1     1.00                        vctp.16	r0
+# CHECK-NEXT:  1      1     1.00                        vctp.32	r0
+# CHECK-NEXT:  1      1     1.00                        vctp.64	r0
+# CHECK-NEXT:  1      1     1.00                  U     vpst
+# CHECK-NEXT:  1      1     2.00                        vmovt	q0, q0
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f16	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.f32	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i8	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i16	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i32	eq, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i8	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i16	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i32	ne, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u8	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u16	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u32	cs, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u8	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u16	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u32	hi, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	ge, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	lt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	gt, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	le, q2, q1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i8	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i16	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i32	eq, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i8	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i16	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.i32	ne, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u8	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u16	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u32	cs, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u8	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u16	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.u32	hi, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	ge, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	lt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	gt, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s8	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s16	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+# CHECK-NEXT:  1      1     2.00                  U     vpt.s32	le, q2, r1
+# CHECK-NEXT:  1      1     2.00                        vorrt	q0, q1, q2
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -      -     146.00 288.00 5.00
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	eq, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	eq, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	ne, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	ne, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	ge, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	ge, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	lt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	lt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	gt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	gt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	le, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	le, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	eq, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	eq, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	ne, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	ne, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	ge, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	ge, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	lt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	lt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	gt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	gt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f16	le, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.f32	le, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i8	eq, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i16	eq, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i32	eq, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i8	ne, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i16	ne, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i32	ne, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u8	cs, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u16	cs, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u32	cs, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u8	hi, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u16	hi, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u32	hi, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	ge, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	ge, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	ge, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	lt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	lt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	lt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	gt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	gt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	gt, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	le, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	le, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	le, q2, q1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i8	eq, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i16	eq, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i32	eq, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i8	ne, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i16	ne, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.i32	ne, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u8	cs, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u16	cs, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u32	cs, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u8	hi, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u16	hi, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.u32	hi, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	ge, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	ge, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	ge, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	lt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	lt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	lt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	gt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	gt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	gt, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s8	le, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s16	le, q2, r1
+# CHECK-NEXT:  -      -      -     2.00    -     vcmp.s32	le, q2, r1
+# CHECK-NEXT:  -      -      -      -     1.00   vctp.8	r0
+# CHECK-NEXT:  -      -      -      -     1.00   vctp.16	r0
+# CHECK-NEXT:  -      -      -      -     1.00   vctp.32	r0
+# CHECK-NEXT:  -      -      -      -     1.00   vctp.64	r0
+# CHECK-NEXT:  -      -      -      -     1.00   vpst
+# CHECK-NEXT:  -      -     2.00    -      -     vmovt	q0, q0
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	eq, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	eq, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	ne, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	ne, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	ge, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	ge, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	lt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	lt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	gt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	gt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	le, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	le, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	eq, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	eq, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	ne, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	ne, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	ge, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	ge, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	lt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	lt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	gt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	gt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f16	le, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.f32	le, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i8	eq, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i16	eq, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i32	eq, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i8	ne, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i16	ne, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i32	ne, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u8	cs, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u16	cs, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u32	cs, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u8	hi, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u16	hi, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u32	hi, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	ge, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	ge, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	ge, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	lt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	lt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	lt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	gt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	gt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	gt, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	le, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	le, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	le, q2, q1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i8	eq, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i16	eq, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i32	eq, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i8	ne, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i16	ne, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.i32	ne, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u8	cs, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u16	cs, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u32	cs, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u8	hi, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u16	hi, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.u32	hi, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	ge, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	ge, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	ge, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	lt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	lt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	lt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	gt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	gt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	gt, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s8	le, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s16	le, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2
+# CHECK-NEXT:  -      -      -     2.00    -     vpt.s32	le, q2, r1
+# CHECK-NEXT:  -      -     2.00    -      -     vorrt	q0, q1, q2

diff  --git a/llvm/test/tools/llvm-mca/ARM/m55-storefwd.s b/llvm/test/tools/llvm-mca/ARM/m55-storefwd.s
new file mode 100644
index 0000000000000..3a744da93cfbf
--- /dev/null
+++ b/llvm/test/tools/llvm-mca/ARM/m55-storefwd.s
@@ -0,0 +1,269 @@
+# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
+# RUN: llvm-mca -mtriple=thumbv8.1-m.main-none-none-eabi -mcpu=cortex-m55 -timeline < %s | FileCheck %s
+
+# Most MVE operations are either latency=1 or can forward into stores
+vadd.i8  q0, q2, q1
+vstrb.8 q0, [r0, #0]
+vadd.f32  q0, q2, q1
+vstrb.8 q0, [r0, #0]
+vmul.i8  q0, q2, q1
+vstrb.8 q0, [r0, #0]
+vmlas.u32  q0, q2, r0
+vstrb.8 q0, [r0, #0]
+vfma.f16  q0, q2, q1
+vstrb.8 q0, [r0, #0]
+vmullb.s16  q0, q2, q1
+vstrb.8 q0, [r0, #0]
+vcvtt.f32.f16  q0, q2
+vstrb.8 q0, [r0, #0]
+vcvtb.f32.f16  q0, q2
+vstrb.8 q0, [r0, #0]
+
+# The ones that cannot are VCVT.f16.f32 t/b and any VMOVN/VQMOVN/VSHRN/VQSHRN/VRSHRN
+vmovnt.s16  q0, q2
+vstrb.8 q0, [r0, #0]
+vmovnb.u32  q0, q2
+vstrb.8 q0, [r0, #0]
+vqmovnt.s32  q0, q2
+vstrb.8 q0, [r0, #0]
+vqmovnb.u16  q0, q2
+vstrb.8 q0, [r0, #0]
+vshrnt.s32  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vshrnb.u16  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vqshrnt.s16  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vqshrnb.u32  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vrshrnt.s16  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vrshrnb.u16  q0, q2, #1
+vstrb.8 q0, [r0, #0]
+vcvtt.f16.f32  q0, q2
+vstrb.8 q0, [r0, #0]
+vcvtb.f16.f32  q0, q2
+vstrb.8 q0, [r0, #0]
+
+# CHECK:      Iterations:        100
+# CHECK-NEXT: Instructions:      4000
+# CHECK-NEXT: Total Cycles:      6401
+# CHECK-NEXT: Total uOps:        4000
+
+# CHECK:      Dispatch Width:    2
+# CHECK-NEXT: uOps Per Cycle:    0.62
+# CHECK-NEXT: IPC:               0.62
+# CHECK-NEXT: Block RThroughput: 40.0
+
+# CHECK:      Instruction Info:
+# CHECK-NEXT: [1]: #uOps
+# CHECK-NEXT: [2]: Latency
+# CHECK-NEXT: [3]: RThroughput
+# CHECK-NEXT: [4]: MayLoad
+# CHECK-NEXT: [5]: MayStore
+# CHECK-NEXT: [6]: HasSideEffects (U)
+
+# CHECK:      [1]    [2]    [3]    [4]    [5]    [6]    Instructions:
+# CHECK-NEXT:  1      1     2.00                        vadd.i8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      1     2.00                        vadd.f32	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vmul.i8	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vmlas.i32	q0, q2, r0
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vfma.f16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vmullb.s16	q0, q2, q1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vcvtt.f32.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      2     2.00                        vcvtb.f32.f16	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vmovnt.i16	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vmovnb.i32	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vqmovnt.s32	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vqmovnb.u16	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vshrnt.i32	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vshrnb.i16	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vqshrnt.s16	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vqshrnb.u32	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vrshrnt.i16	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vrshrnb.i16	q0, q2, #1
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vcvtt.f16.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+# CHECK-NEXT:  1      3     2.00                        vcvtb.f16.f32	q0, q2
+# CHECK-NEXT:  1      1     2.00           *            vstrb.8	q0, [r0]
+
+# CHECK:      Resources:
+# CHECK-NEXT: [0]   - M55UnitALU
+# CHECK-NEXT: [1]   - M55UnitLoadStore
+# CHECK-NEXT: [2]   - M55UnitVecALU
+# CHECK-NEXT: [3]   - M55UnitVecFPALU
+# CHECK-NEXT: [4]   - M55UnitVecSys
+
+# CHECK:      Resource pressure per iteration:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]
+# CHECK-NEXT:  -     40.00  22.00  18.00   -
+
+# CHECK:      Resource pressure by instruction:
+# CHECK-NEXT: [0]    [1]    [2]    [3]    [4]    Instructions:
+# CHECK-NEXT:  -      -     2.00    -      -     vadd.i8	q0, q2, q1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vadd.f32	q0, q2, q1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vmul.i8	q0, q2, q1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vmlas.i32	q0, q2, r0
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vfma.f16	q0, q2, q1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vmullb.s16	q0, q2, q1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtt.f32.f16	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtb.f32.f16	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnt.i16	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vmovnb.i32	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnt.s32	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vqmovnb.u16	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnt.i32	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vshrnb.i16	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnt.s16	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vqshrnb.u32	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnt.i16	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -     2.00    -      -     vrshrnb.i16	q0, q2, #1
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtt.f16.f32	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+# CHECK-NEXT:  -      -      -     2.00    -     vcvtb.f16.f32	q0, q2
+# CHECK-NEXT:  -     2.00    -      -      -     vstrb.8	q0, [r0]
+
+# CHECK:      Timeline view:
+# CHECK-NEXT:                     0123456789          0123456789          0123456789          0123456789
+# CHECK-NEXT: Index     0123456789          0123456789          0123456789          0123456789
+
+# CHECK:      [0,0]     DE   .    .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vadd.i8	q0, q2, q1
+# CHECK-NEXT: [0,1]     .DE  .    .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,2]     . DE .    .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vadd.f32	q0, q2, q1
+# CHECK-NEXT: [0,3]     .  DE.    .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,4]     .   DeE   .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vmul.i8	q0, q2, q1
+# CHECK-NEXT: [0,5]     .    DE   .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,6]     .    .DeE .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vmlas.i32	q0, q2, r0
+# CHECK-NEXT: [0,7]     .    . DE .    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,8]     .    .  DeE    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vfma.f16	q0, q2, q1
+# CHECK-NEXT: [0,9]     .    .   DE    .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,10]    .    .    DeE  .    .    .    .    .    .    .    .    .    .    .    .    .   .   vmullb.s16	q0, q2, q1
+# CHECK-NEXT: [0,11]    .    .    .DE  .    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,12]    .    .    . DeE.    .    .    .    .    .    .    .    .    .    .    .    .   .   vcvtt.f32.f16	q0, q2
+# CHECK-NEXT: [0,13]    .    .    .  DE.    .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,14]    .    .    .   DeE   .    .    .    .    .    .    .    .    .    .    .    .   .   vcvtb.f32.f16	q0, q2
+# CHECK-NEXT: [0,15]    .    .    .    DE   .    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,16]    .    .    .    .DeeE.    .    .    .    .    .    .    .    .    .    .    .   .   vmovnt.i16	q0, q2
+# CHECK-NEXT: [0,17]    .    .    .    .   DE    .    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,18]    .    .    .    .    DeeE .    .    .    .    .    .    .    .    .    .    .   .   vmovnb.i32	q0, q2
+# CHECK-NEXT: [0,19]    .    .    .    .    .  DE.    .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,20]    .    .    .    .    .   DeeE  .    .    .    .    .    .    .    .    .    .   .   vqmovnt.s32	q0, q2
+# CHECK-NEXT: [0,21]    .    .    .    .    .    . DE .    .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,22]    .    .    .    .    .    .  DeeE   .    .    .    .    .    .    .    .    .   .   vqmovnb.u16	q0, q2
+# CHECK-NEXT: [0,23]    .    .    .    .    .    .    .DE  .    .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,24]    .    .    .    .    .    .    . DeeE    .    .    .    .    .    .    .    .   .   vshrnt.i32	q0, q2, #1
+# CHECK-NEXT: [0,25]    .    .    .    .    .    .    .    DE   .    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,26]    .    .    .    .    .    .    .    .DeeE.    .    .    .    .    .    .    .   .   vshrnb.i16	q0, q2, #1
+# CHECK-NEXT: [0,27]    .    .    .    .    .    .    .    .   DE    .    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,28]    .    .    .    .    .    .    .    .    DeeE .    .    .    .    .    .    .   .   vqshrnt.s16	q0, q2, #1
+# CHECK-NEXT: [0,29]    .    .    .    .    .    .    .    .    .  DE.    .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,30]    .    .    .    .    .    .    .    .    .   DeeE  .    .    .    .    .    .   .   vqshrnb.u32	q0, q2, #1
+# CHECK-NEXT: [0,31]    .    .    .    .    .    .    .    .    .    . DE .    .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,32]    .    .    .    .    .    .    .    .    .    .  DeeE   .    .    .    .    .   .   vrshrnt.i16	q0, q2, #1
+# CHECK-NEXT: [0,33]    .    .    .    .    .    .    .    .    .    .    .DE  .    .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,34]    .    .    .    .    .    .    .    .    .    .    . DeeE    .    .    .    .   .   vrshrnb.i16	q0, q2, #1
+# CHECK-NEXT: [0,35]    .    .    .    .    .    .    .    .    .    .    .    DE   .    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,36]    .    .    .    .    .    .    .    .    .    .    .    .DeeE.    .    .    .   .   vcvtt.f16.f32	q0, q2
+# CHECK-NEXT: [0,37]    .    .    .    .    .    .    .    .    .    .    .    .   DE    .    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [0,38]    .    .    .    .    .    .    .    .    .    .    .    .    DeeE .    .    .   .   vcvtb.f16.f32	q0, q2
+# CHECK-NEXT: [0,39]    .    .    .    .    .    .    .    .    .    .    .    .    .  DE.    .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,0]     .    .    .    .    .    .    .    .    .    .    .    .    .   DE    .    .   .   vadd.i8	q0, q2, q1
+# CHECK-NEXT: [1,1]     .    .    .    .    .    .    .    .    .    .    .    .    .    DE   .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,2]     .    .    .    .    .    .    .    .    .    .    .    .    .    .DE  .    .   .   vadd.f32	q0, q2, q1
+# CHECK-NEXT: [1,3]     .    .    .    .    .    .    .    .    .    .    .    .    .    . DE .    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,4]     .    .    .    .    .    .    .    .    .    .    .    .    .    .  DeE    .   .   vmul.i8	q0, q2, q1
+# CHECK-NEXT: [1,5]     .    .    .    .    .    .    .    .    .    .    .    .    .    .   DE    .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,6]     .    .    .    .    .    .    .    .    .    .    .    .    .    .    DeE  .   .   vmlas.i32	q0, q2, r0
+# CHECK-NEXT: [1,7]     .    .    .    .    .    .    .    .    .    .    .    .    .    .    .DE  .   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,8]     .    .    .    .    .    .    .    .    .    .    .    .    .    .    . DeE.   .   vfma.f16	q0, q2, q1
+# CHECK-NEXT: [1,9]     .    .    .    .    .    .    .    .    .    .    .    .    .    .    .  DE.   .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,10]    .    .    .    .    .    .    .    .    .    .    .    .    .    .    .   DeE  .   vmullb.s16	q0, q2, q1
+# CHECK-NEXT: [1,11]    .    .    .    .    .    .    .    .    .    .    .    .    .    .    .    DE  .   vstrb.8	q0, [r0]
+# CHECK-NEXT: [1,12]    .    .    .    .    .    .    .    .    .    .    .    .    .    .    .    .DeE.   vcvtt.f32.f16	q0, q2
+# CHECK-NEXT: [1,13]    .    .    .    .    .    .    .    .    .    .    .    .    .    .    .    . DE.   vstrb.8	q0, [r0]
+# CHECK-NEXT: Truncated display due to cycle limit
+
+# CHECK:      Average Wait times (based on the timeline view):
+# CHECK-NEXT: [0]: Executions
+# CHECK-NEXT: [1]: Average time spent waiting in a scheduler's queue
+# CHECK-NEXT: [2]: Average time spent waiting in a scheduler's queue while ready
+# CHECK-NEXT: [3]: Average time elapsed from WB until retire stage
+
+# CHECK:            [0]    [1]    [2]    [3]
+# CHECK-NEXT: 0.     10    0.0    0.0    0.0       vadd.i8	q0, q2, q1
+# CHECK-NEXT: 1.     10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 2.     10    0.0    0.0    0.0       vadd.f32	q0, q2, q1
+# CHECK-NEXT: 3.     10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 4.     10    0.0    0.0    0.0       vmul.i8	q0, q2, q1
+# CHECK-NEXT: 5.     10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 6.     10    0.0    0.0    0.0       vmlas.i32	q0, q2, r0
+# CHECK-NEXT: 7.     10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 8.     10    0.0    0.0    0.0       vfma.f16	q0, q2, q1
+# CHECK-NEXT: 9.     10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 10.    10    0.0    0.0    0.0       vmullb.s16	q0, q2, q1
+# CHECK-NEXT: 11.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 12.    10    0.0    0.0    0.0       vcvtt.f32.f16	q0, q2
+# CHECK-NEXT: 13.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 14.    10    0.0    0.0    0.0       vcvtb.f32.f16	q0, q2
+# CHECK-NEXT: 15.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 16.    10    0.0    0.0    0.0       vmovnt.i16	q0, q2
+# CHECK-NEXT: 17.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 18.    10    0.0    0.0    0.0       vmovnb.i32	q0, q2
+# CHECK-NEXT: 19.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 20.    10    0.0    0.0    0.0       vqmovnt.s32	q0, q2
+# CHECK-NEXT: 21.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 22.    10    0.0    0.0    0.0       vqmovnb.u16	q0, q2
+# CHECK-NEXT: 23.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 24.    10    0.0    0.0    0.0       vshrnt.i32	q0, q2, #1
+# CHECK-NEXT: 25.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 26.    10    0.0    0.0    0.0       vshrnb.i16	q0, q2, #1
+# CHECK-NEXT: 27.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 28.    10    0.0    0.0    0.0       vqshrnt.s16	q0, q2, #1
+# CHECK-NEXT: 29.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 30.    10    0.0    0.0    0.0       vqshrnb.u32	q0, q2, #1
+# CHECK-NEXT: 31.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 32.    10    0.0    0.0    0.0       vrshrnt.i16	q0, q2, #1
+# CHECK-NEXT: 33.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 34.    10    0.0    0.0    0.0       vrshrnb.i16	q0, q2, #1
+# CHECK-NEXT: 35.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 36.    10    0.0    0.0    0.0       vcvtt.f16.f32	q0, q2
+# CHECK-NEXT: 37.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT: 38.    10    0.0    0.0    0.0       vcvtb.f16.f32	q0, q2
+# CHECK-NEXT: 39.    10    0.0    0.0    0.0       vstrb.8	q0, [r0]
+# CHECK-NEXT:        10    0.0    0.0    0.0       <total>


        


More information about the llvm-commits mailing list