[PATCH] D21405: [PGO] IRPGO pre-cleanup pass changes

Rong Xu via llvm-commits llvm-commits at lists.llvm.org
Wed Jun 29 15:02:24 PDT 2016


On Wed, Jun 29, 2016 at 2:56 PM, Xinliang David Li <davidxl at google.com>
wrote:

>
>
> On Wed, Jun 29, 2016 at 2:51 PM, Rong Xu <xur at google.com> wrote:
>
>> xur updated this revision to Diff 62287.
>> xur added a comment.
>>
>> ere is the updated patch. Changes are:
>> (1) Removed options preinline as suggested by mehdi.
>> (2) Updated the cleanup passes list.
>>
>> I used google internal benchmarks to measure the performance. The cleanup
>> pass have ~2% geo-mean on the profile-generate performance. The largest
>> saving is 17%.
>>
>
>
> You may want to clarify it is the effect of those passes other than
> pre-inline.
>

Yes. Thanks for pointing this out: This numbers are measured with
pre-intrumentation inline always on, just on/off these cleanup passes.


>
> David
>
>>
>> David's suggestion on using SROA and EarlyCSE is good. It has positive
>> impacts on a number of programs (largest ~10%)
>>
>> Jumpthreading actually helps a few programs also (~3%).
>>
>> In this patch, I propose to use
>> SROA  + EArlyCSE + CFGSimplification + InstructionCombining
>> and remove the rest.
>>
>>
>> http://reviews.llvm.org/D21405
>>
>> Files:
>>   lib/Transforms/IPO/PassManagerBuilder.cpp
>>   test/Transforms/PGOProfile/preinline.ll
>>
>> Index: test/Transforms/PGOProfile/preinline.ll
>> ===================================================================
>> --- /dev/null
>> +++ test/Transforms/PGOProfile/preinline.ll
>> @@ -0,0 +1,20 @@
>> +; RUN: opt < %s -O2 -profile-generate=default.profraw -S | FileCheck %s
>> --check-prefix=GEN
>> +target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
>> +target triple = "x86_64-unknown-linux-gnu"
>> +
>> +define i32 @foo(i32 %i) {
>> +entry:
>> +; GEN: %pgocount = load i64, i64* getelementptr inbounds ([1 x i64], [1
>> x i64]* @__profc_foo
>> +; GEN-NOT: %pgocount.i = load i64, i64* getelementptr inbounds ([1 x
>> i64], [1 x i64]* @__profc__stdin__bar
>> +  %call = call i32 @bar()
>> +  %add = add nsw i32 %i, %call
>> +  ret i32 %add
>> +}
>> +
>> +define internal i32 @bar() {
>> +entry:
>> +  %call = call i32 (...) @bar1()
>> +  ret i32 %call
>> +}
>> +
>> +declare i32 @bar1(...)
>> Index: lib/Transforms/IPO/PassManagerBuilder.cpp
>> ===================================================================
>> --- lib/Transforms/IPO/PassManagerBuilder.cpp
>> +++ lib/Transforms/IPO/PassManagerBuilder.cpp
>> @@ -114,6 +114,10 @@
>>      "enable-loop-versioning-licm", cl::init(false), cl::Hidden,
>>      cl::desc("Enable the experimental Loop Versioning LICM pass"));
>>
>> +static cl::opt<bool>
>> +    DisablePreInliner("disable-preinline", cl::init(false), cl::Hidden,
>> +                      cl::desc("Disable pre-instrumentation inliner"));
>> +
>>  PassManagerBuilder::PassManagerBuilder() {
>>      OptLevel = 2;
>>      SizeLevel = 0;
>> @@ -202,8 +206,30 @@
>>    FPM.add(createLowerExpectIntrinsicPass());
>>  }
>>
>> +// Get the inline threshold for pre-instrumentation inline.
>> +static int computePreInlineThresholdFromOptLevels(unsigned OptLevel,
>> +                                                  unsigned SizeOptLevel)
>> {
>> +  if (SizeOptLevel == 1) // -Os
>> +    return 75;
>> +  if (SizeOptLevel == 2) // -Oz
>> +    return 25;
>> +  return 75;
>> +}
>> +
>>  // Do PGO instrumentation generation or use pass as the option specified.
>>  void PassManagerBuilder::addPGOInstrPasses(legacy::PassManagerBase &MPM)
>> {
>> +  if (PGOInstrGen.empty() && PGOInstrUse.empty())
>> +    return;
>> +  if (OptLevel > 0 && !DisablePreInliner) {
>> +    // Create preinline pass.
>> +    MPM.add(createFunctionInliningPass(
>> +        computePreInlineThresholdFromOptLevels(OptLevel, SizeLevel)));
>> +    MPM.add(createSROAPass());
>> +    MPM.add(createEarlyCSEPass());             // Catch trivial
>> redundancies
>> +    MPM.add(createCFGSimplificationPass());    // Merge & remove BBs
>> +    MPM.add(createInstructionCombiningPass()); // Combine silly seq's
>> +    addExtensionsToPM(EP_Peephole, MPM);
>> +  }
>>    if (!PGOInstrGen.empty()) {
>>      MPM.add(createPGOInstrumentationGenLegacyPass());
>>      // Add the profile lowering pass.
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160629/c561a127/attachment.html>


More information about the llvm-commits mailing list