[llvm] r291254 - [AArch64] Reduce vector insert/extract cost for Falkor.
Chad Rosier via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 6 10:03:27 PST 2017
Author: mcrosier
Date: Fri Jan 6 12:03:26 2017
New Revision: 291254
URL: http://llvm.org/viewvc/llvm-project?rev=291254&view=rev
Log:
[AArch64] Reduce vector insert/extract cost for Falkor.
Differential Revision: https://reviews.llvm.org/D28403
Added:
llvm/trunk/test/Analysis/CostModel/AArch64/falkor.ll
Modified:
llvm/trunk/lib/Target/AArch64/AArch64Subtarget.cpp
Modified: llvm/trunk/lib/Target/AArch64/AArch64Subtarget.cpp
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/AArch64/AArch64Subtarget.cpp?rev=291254&r1=291253&r2=291254&view=diff
==============================================================================
--- llvm/trunk/lib/Target/AArch64/AArch64Subtarget.cpp (original)
+++ llvm/trunk/lib/Target/AArch64/AArch64Subtarget.cpp Fri Jan 6 12:03:26 2017
@@ -71,6 +71,7 @@ void AArch64Subtarget::initializePropert
break;
case Falkor:
MaxInterleaveFactor = 4;
+ VectorInsertExtractBaseCost = 2;
break;
case Kryo:
MaxInterleaveFactor = 4;
Added: llvm/trunk/test/Analysis/CostModel/AArch64/falkor.ll
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/test/Analysis/CostModel/AArch64/falkor.ll?rev=291254&view=auto
==============================================================================
--- llvm/trunk/test/Analysis/CostModel/AArch64/falkor.ll (added)
+++ llvm/trunk/test/Analysis/CostModel/AArch64/falkor.ll Fri Jan 6 12:03:26 2017
@@ -0,0 +1,26 @@
+; RUN: opt < %s -cost-model -analyze -mcpu=falkor | FileCheck %s
+
+target datalayout = "e-m:e-i64:64-i128:128-n32:64-S128"
+target triple = "aarch64--linux-gnu"
+
+; CHECK-LABEL: vectorInstrCost
+define void @vectorInstrCost() {
+
+ ; Vector extracts - extracting the first element should have a zero cost;
+ ; all other elements should have a cost of two.
+ ;
+ ; CHECK: cost of 0 {{.*}} extractelement <2 x i64> undef, i32 0
+ ; CHECK: cost of 2 {{.*}} extractelement <2 x i64> undef, i32 1
+ %t1 = extractelement <2 x i64> undef, i32 0
+ %t2 = extractelement <2 x i64> undef, i32 1
+
+ ; Vector inserts - inserting the first element should have a zero cost; all
+ ; other elements should have a cost of two.
+ ;
+ ; CHECK: cost of 0 {{.*}} insertelement <2 x i64> undef, i64 undef, i32 0
+ ; CHECK: cost of 2 {{.*}} insertelement <2 x i64> undef, i64 undef, i32 1
+ %t3 = insertelement <2 x i64> undef, i64 undef, i32 0
+ %t4 = insertelement <2 x i64> undef, i64 undef, i32 1
+
+ ret void
+}
More information about the llvm-commits
mailing list