r337456 - [CodeGen] Disable aggressive structor optimizations at -O0, take 3

Chandler Carruth via cfe-commits cfe-commits at lists.llvm.org
Sat Jul 28 02:26:32 PDT 2018


On Thu, Jul 19, 2018 at 7:10 AM Pavel Labath via cfe-commits <
cfe-commits at lists.llvm.org> wrote:

> Author: labath
> Date: Thu Jul 19 07:05:22 2018
> New Revision: 337456
>
> URL: http://llvm.org/viewvc/llvm-project?rev=337456&view=rev
> Log:
> [CodeGen] Disable aggressive structor optimizations at -O0, take 3
>

I'm really sorry to do this, but I need to revert this patch.... Let me try
to explain....

We're seeing at least two issues with it that are causing *serious* issues
for our users, and due to numerous other things, we really need to get
top-of-tree into usable shape sooner rather than later. As this is
improving a long-standing deficiency in Clang, I think it is reasonable to
revert temporarily while we sort it out. I'm CC-ing Richard Smith and Eric
Christopher directly as I'm going to ask them to make sure we get
satisfactory answers to why this patch causes us so many problems and how
we can make progress here.

I don't have a test case at the moment, and I want to *very clearly* call
out that it is on us to find a test case or otherwise clearly explain what
problem this patch causes and how it can be addressed, or else this patch
should be re-landed.

To at least give some idea of what is going wrong here......

First, this patch does increase object code size. This isn't really
unexpected based on the nature of the patch, and it does so ostensibly in
order to gain material fidelity improvements to debug information. Despite
the fact that the increased object size causes us problems (it made a few
hundered of our binaries' inputs too large to fit into the quota for the
input files to our internal distributed build system) we tried to soldier
onward...

But it also causes the Gold linker at least to use considerably more memory
than it used to. This has resulted in over 400 failures to link executables
due to running out of available memory on the system.

There are a number of possible causes for both the input size issues and
the linker memory issue:
- An unexpected side-effect of this change causes lots of redundant
sections to be output with -ffunction-sections, all of which merge away
into nothing, but this takes huge amounts of object file and linker
resources.
- The object file size increease is expected and unavoidable, but there is
some bug in the linker being triggered?
- Something else

It is going to take us some time to investigate these issues, and sadly,
all of these issues are breaking builds actively for us.

So while we will work very hard on getting to a resolution, if you also
really need this functionality to land sooner rather than later, what I
would request is to add it back under a flag. In fact, if you need that and
request it, we can do the actual work of adding it back under a flag. I
think that's the least we can do here.

Anyways, again, sorry to revert with relatively little warning and with
relatively little info.

Feel free to reach out to myself or Richard Smith, or Eric Christopher who
will all be looking at this first thing next week to understand exactly
what is going on here and what the root causes is and whether any of these
issues are due to bugs in some tool or another, or whether these are all
due to some particular issue with our source code.

-Chandler


> The previous version of this patch (r332839) was reverted because it was
> causing "definition with same mangled name as another definition" errors
> in some module builds. This was caused by an unrelated bug in module
> importing which it exposed. The importing problem was fixed in r336240,
> so this recommits the original patch (r332839).
>
> Differential Revision: https://reviews.llvm.org/D46685
>
> Modified:
>     cfe/trunk/lib/CodeGen/ItaniumCXXABI.cpp
>     cfe/trunk/test/CodeGenCXX/ctor-dtor-alias.cpp
>     cfe/trunk/test/CodeGenCXX/float16-declarations.cpp
>
> Modified: cfe/trunk/lib/CodeGen/ItaniumCXXABI.cpp
> URL:
> http://llvm.org/viewvc/llvm-project/cfe/trunk/lib/CodeGen/ItaniumCXXABI.cpp?rev=337456&r1=337455&r2=337456&view=diff
>
> ==============================================================================
> --- cfe/trunk/lib/CodeGen/ItaniumCXXABI.cpp (original)
> +++ cfe/trunk/lib/CodeGen/ItaniumCXXABI.cpp Thu Jul 19 07:05:22 2018
> @@ -3737,12 +3737,22 @@ static StructorCodegen getCodegenToUse(C
>    }
>    llvm::GlobalValue::LinkageTypes Linkage =
> CGM.getFunctionLinkage(AliasDecl);
>
> -  if (llvm::GlobalValue::isDiscardableIfUnused(Linkage))
> -    return StructorCodegen::RAUW;
> +  // All discardable structors can be RAUWed, but we don't want to do
> that in
> +  // unoptimized code, as that makes complete structor symbol disappear
> +  // completely, which degrades debugging experience.
> +  // Symbols with private linkage can be safely aliased, so we special
> case them
> +  // here.
> +  if (llvm::GlobalValue::isLocalLinkage(Linkage))
> +    return CGM.getCodeGenOpts().OptimizationLevel > 0 ?
> StructorCodegen::RAUW
> +                                                      :
> StructorCodegen::Alias;
>
> +  // Linkonce structors cannot be aliased nor placed in a comdat, so
> these need
> +  // to be emitted separately.
>    // FIXME: Should we allow available_externally aliases?
> -  if (!llvm::GlobalAlias::isValidLinkage(Linkage))
> -    return StructorCodegen::RAUW;
> +  if (llvm::GlobalValue::isDiscardableIfUnused(Linkage) ||
> +      !llvm::GlobalAlias::isValidLinkage(Linkage))
> +    return CGM.getCodeGenOpts().OptimizationLevel > 0 ?
> StructorCodegen::RAUW
> +                                                      :
> StructorCodegen::Emit;
>
>    if (llvm::GlobalValue::isWeakForLinker(Linkage)) {
>      // Only ELF and wasm support COMDATs with arbitrary names (C5/D5).
>
> Modified: cfe/trunk/test/CodeGenCXX/ctor-dtor-alias.cpp
> URL:
> http://llvm.org/viewvc/llvm-project/cfe/trunk/test/CodeGenCXX/ctor-dtor-alias.cpp?rev=337456&r1=337455&r2=337456&view=diff
>
> ==============================================================================
> --- cfe/trunk/test/CodeGenCXX/ctor-dtor-alias.cpp (original)
> +++ cfe/trunk/test/CodeGenCXX/ctor-dtor-alias.cpp Thu Jul 19 07:05:22 2018
> @@ -1,5 +1,7 @@
> -// RUN: %clang_cc1 %s -triple i686-linux -emit-llvm -o -
> -mconstructor-aliases | FileCheck --check-prefix=NOOPT %s
> -
> +// RUN: %clang_cc1 %s -triple i686-linux -emit-llvm -o -
> -mconstructor-aliases > %t
> +// RUN: FileCheck --check-prefix=NOOPT1 --input-file=%t %s
> +// RUN: FileCheck --check-prefix=NOOPT2 --input-file=%t %s
> +// RUN: FileCheck --check-prefix=NOOPT3 --input-file=%t %s
>  // RUN: %clang_cc1 %s -triple i686-linux -emit-llvm -o -
> -mconstructor-aliases -O1 -disable-llvm-passes > %t
>  // RUN: FileCheck --check-prefix=CHECK1 --input-file=%t %s
>  // RUN: FileCheck --check-prefix=CHECK2 --input-file=%t %s
> @@ -21,6 +23,13 @@ namespace test1 {
>  // CHECK1: define weak_odr void @_ZN5test16foobarIvED0Ev({{.*}}
> comdat($_ZN5test16foobarIvED5Ev)
>  // CHECK1-NOT: comdat
>
> +// This should happen regardless of the opt level.
> +// NOOPT1: @_ZN5test16foobarIvEC1Ev = weak_odr unnamed_addr alias void
> {{.*}} @_ZN5test16foobarIvEC2Ev
> +// NOOPT1: @_ZN5test16foobarIvED1Ev = weak_odr unnamed_addr alias void
> (%"struct.test1::foobar"*), void (%"struct.test1::foobar"*)*
> @_ZN5test16foobarIvED2Ev
> +// NOOPT1: define weak_odr void @_ZN5test16foobarIvEC2Ev({{.*}}
> comdat($_ZN5test16foobarIvEC5Ev)
> +// NOOPT1: define weak_odr void @_ZN5test16foobarIvED2Ev({{.*}}
> comdat($_ZN5test16foobarIvED5Ev)
> +// NOOPT1: define weak_odr void @_ZN5test16foobarIvED0Ev({{.*}}
> comdat($_ZN5test16foobarIvED5Ev)
> +
>  // COFF doesn't support comdats with arbitrary names (C5/D5).
>  // COFF: define weak_odr {{.*}} void @_ZN5test16foobarIvEC2Ev({{.*}}
> comdat align
>  // COFF: define weak_odr {{.*}} void @_ZN5test16foobarIvEC1Ev({{.*}}
> comdat align
> @@ -37,12 +46,17 @@ template struct foobar<void>;
>  }
>
>  namespace test2 {
> -// test that when the destrucor is linkonce_odr we just replace every use
> of
> +// test that when the destructor is linkonce_odr we just replace every
> use of
>  // C1 with C2.
>
>  // CHECK1: define internal void @__cxx_global_var_init()
>  // CHECK1: call void @_ZN5test26foobarIvEC2Ev
>  // CHECK1: define linkonce_odr void @_ZN5test26foobarIvEC2Ev({{.*}}
> comdat align
> +
> +// At -O0, we should still emit the complete constructor.
> +// NOOPT1: define internal void @__cxx_global_var_init()
> +// NOOPT1: call void @_ZN5test26foobarIvEC1Ev
> +// NOOPT1: define linkonce_odr void @_ZN5test26foobarIvEC1Ev({{.*}}
> comdat align
>  void g();
>  template <typename T> struct foobar {
>    foobar() { g(); }
> @@ -57,6 +71,11 @@ namespace test3 {
>  // CHECK1: define internal void @__cxx_global_var_init.1()
>  // CHECK1: call i32 @__cxa_atexit{{.*}}_ZN5test312_GLOBAL__N_11AD2Ev
>  // CHECK1: define internal void @_ZN5test312_GLOBAL__N_11AD2Ev(
> +
> +// We can use an alias for internal symbols at -O0.
> +// NOOPT2: _ZN5test312_GLOBAL__N_11BD1Ev = internal unnamed_addr alias
> void {{.*}} @_ZN5test312_GLOBAL__N_11BD2Ev
> +// NOOPT2: define internal void @__cxx_global_var_init.1()
> +// NOOPT2: call i32 @__cxa_atexit{{.*}}_ZN5test312_GLOBAL__N_11BD1Ev
>  namespace {
>  struct A {
>    ~A() {}
> @@ -77,11 +96,12 @@ namespace test4 {
>    // CHECK1: call i32 @__cxa_atexit{{.*}}_ZN5test41AD2Ev
>    // CHECK1: define linkonce_odr void @_ZN5test41AD2Ev({{.*}} comdat align
>
> -  // test that we don't do this optimization at -O0 so that the debugger
> can
> -  // see both destructors.
> -  // NOOPT: define internal void @__cxx_global_var_init.2()
> -  // NOOPT: call i32 @__cxa_atexit{{.*}}@_ZN5test41BD2Ev
> -  // NOOPT: define linkonce_odr void @_ZN5test41BD2Ev({{.*}} comdat align
> +  // Test that we don't do this optimization at -O0 and call the complete
> +  // destructor for B instead. This enables the debugger to see both
> +  // destructors.
> +  // NOOPT2: define internal void @__cxx_global_var_init.2()
> +  // NOOPT2: call i32 @__cxa_atexit{{.*}}@_ZN5test41BD1Ev
> +  // NOOPT2: define linkonce_odr void @_ZN5test41BD1Ev({{.*}} comdat align
>    struct A {
>      virtual ~A() {}
>    };
> @@ -129,6 +149,11 @@ namespace test7 {
>    // out if we should).
>    // pr17875.
>    // CHECK3: define void @_ZN5test71BD2Ev
> +
> +  // At -O0, we should emit both destructors, the complete can be an
> alias to
> +  // the base one.
> +  // NOOPT3: @_ZN5test71BD1Ev = unnamed_addr alias void {{.*}}
> @_ZN5test71BD2Ev
> +  // NOOPT3: define void @_ZN5test71BD2Ev
>    template <typename> struct A {
>      ~A() {}
>    };
>
> Modified: cfe/trunk/test/CodeGenCXX/float16-declarations.cpp
> URL:
> http://llvm.org/viewvc/llvm-project/cfe/trunk/test/CodeGenCXX/float16-declarations.cpp?rev=337456&r1=337455&r2=337456&view=diff
>
> ==============================================================================
> --- cfe/trunk/test/CodeGenCXX/float16-declarations.cpp (original)
> +++ cfe/trunk/test/CodeGenCXX/float16-declarations.cpp Thu Jul 19 07:05:22
> 2018
> @@ -103,7 +103,7 @@ int main(void) {
>
>    C1 c1(f1l);
>  // CHECK-DAG:  [[F1L:%[a-z0-9]+]] = load half, half* %{{.*}}, align 2
> -// CHECK-DAG:  call void @_ZN2C1C2EDF16_(%class.C1* %{{.*}}, half %{{.*}})
> +// CHECK-DAG:  call void @_ZN2C1C1EDF16_(%class.C1* %{{.*}}, half %{{.*}})
>
>    S1<_Float16> s1 = { 132.f16 };
>  // CHECK-DAG: @_ZZ4mainE2s1 = private unnamed_addr constant %struct.S1 {
> half 0xH5820 }, align 2
>
>
> _______________________________________________
> cfe-commits mailing list
> cfe-commits at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20180728/31277dd4/attachment-0001.html>


More information about the cfe-commits mailing list