[PATCH] D36775: Increase tail dup threshold for -O3 from 3 to 4
Richard Smith - zygoloid via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Aug 15 16:47:30 PDT 2017
rsmith created this revision.
Herald added subscribers: sdardis, sanjoy.
We see a modest performance improvement from this slightly higher tail dup threshold (~0.1% geomean across the suite).
Repository:
rL LLVM
https://reviews.llvm.org/D36775
Files:
lib/CodeGen/MachineBlockPlacement.cpp
test/CodeGen/Mips/brconge.ll
test/CodeGen/Mips/brconle.ll
test/CodeGen/X86/tail-dup-repeat.ll
Index: test/CodeGen/X86/tail-dup-repeat.ll
===================================================================
--- test/CodeGen/X86/tail-dup-repeat.ll
+++ test/CodeGen/X86/tail-dup-repeat.ll
@@ -1,4 +1,4 @@
-; RUN: llc -O3 -tail-dup-placement-threshold=4 -o - %s | FileCheck %s
+; RUN: llc -O3 -o - %s | FileCheck %s
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"
Index: test/CodeGen/Mips/brconle.ll
===================================================================
--- test/CodeGen/Mips/brconle.ll
+++ test/CodeGen/Mips/brconle.ll
@@ -1,4 +1,4 @@
-; RUN: llc -march=mipsel -mattr=mips16 -relocation-model=pic -O3 < %s | FileCheck %s -check-prefix=16
+; RUN: llc -march=mipsel -mattr=mips16 -relocation-model=pic -O2 < %s | FileCheck %s -check-prefix=16
@i = global i32 -5, align 4
@j = global i32 10, align 4
Index: test/CodeGen/Mips/brconge.ll
===================================================================
--- test/CodeGen/Mips/brconge.ll
+++ test/CodeGen/Mips/brconge.ll
@@ -1,4 +1,4 @@
-; RUN: llc -march=mipsel -mattr=mips16 -relocation-model=pic -O3 < %s | FileCheck %s -check-prefix=16
+; RUN: llc -march=mipsel -mattr=mips16 -relocation-model=pic -O2 < %s | FileCheck %s -check-prefix=16
@i = global i32 5, align 4
@j = global i32 10, align 4
Index: lib/CodeGen/MachineBlockPlacement.cpp
===================================================================
--- lib/CodeGen/MachineBlockPlacement.cpp
+++ lib/CodeGen/MachineBlockPlacement.cpp
@@ -143,7 +143,7 @@
"tail-dup-placement-aggressive-threshold",
cl::desc("Instruction cutoff for aggressive tail duplication during "
"layout. Used at -O3. Tail merging during layout is forced to "
- "have a threshold that won't conflict."), cl::init(3),
+ "have a threshold that won't conflict."), cl::init(4),
cl::Hidden);
// Heuristic for tail duplication.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D36775.111283.patch
Type: text/x-patch
Size: 1943 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170815/11298d7c/attachment.bin>
More information about the llvm-commits
mailing list