<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Feb 26, 2016 at 1:07 PM, David Blaikie <span dir="ltr"><<a href="mailto:dblaikie@gmail.com" target="_blank">dblaikie@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote"><span class="">On Fri, Feb 26, 2016 at 12:32 PM, David Li via llvm-commits <span dir="ltr"><<a href="mailto:llvm-commits@lists.llvm.org" target="_blank">llvm-commits@lists.llvm.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">davidxl created this revision.<br>
davidxl added a reviewer: vsk.<br>
davidxl added a subscriber: llvm-commits.<br>
<br>
The per function data and counter variables for available externally functions are created with linkonce linkage (to prevent compiler from dropping counter vars). However on ELF based systems, linkonce symbols are created as weak symbols and the duplicated entries are not removed by the linker. </blockquote><div><br></div></span><div>That ^ seems surprising/strange/confusing/problematic - any idea what's going on there? Is that a bug in the compiler? linker? specification of some kind?</div></div></div></div></blockquote><div><br></div><div>I am not sure what you questions is. Duplicate definitions of weak symbols are allowed so linker won't discard the duplicate copies. The symbol reference will be resolved the strong definition by the linker.</div><div><br></div><div>David</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div><div class="h5">For bad consequences it causes, see comments in the code. One example, the profile counts for _ZNKSs7_M_dataEv method is duplicated 655 times in raw profile data of clang. In the merged indexed profile, the counter values are magnified 655x -- totally dwarfed other functions -- this also distorted profile summary a lot leading to not useful profile data.<br>
<br>
<a href="http://reviews.llvm.org/D17654" rel="noreferrer" target="_blank">http://reviews.llvm.org/D17654</a><br>
<br>
Files:<br>
lib/Transforms/Instrumentation/InstrProfiling.cpp<br>
test/Instrumentation/InstrProfiling/linkage.ll<br>
<br>
Index: test/Instrumentation/InstrProfiling/linkage.ll<br>
===================================================================<br>
--- test/Instrumentation/InstrProfiling/linkage.ll<br>
+++ test/Instrumentation/InstrProfiling/linkage.ll<br>
@@ -7,6 +7,7 @@<br>
@__profn_foo_weak = weak hidden constant [8 x i8] c"foo_weak"<br>
@"__profn_linkage.ll:foo_internal" = internal constant [23 x i8] c"linkage.ll:foo_internal"<br>
@__profn_foo_inline = linkonce_odr hidden constant [10 x i8] c"foo_inline"<br>
+@__profn_foo_extern = linkonce_odr hidden constant [10 x i8] c"foo_extern"<br>
<br>
; COMMON: @__profc_foo = hidden global<br>
; COMMON: @__profd_foo = hidden global<br>
@@ -36,6 +37,15 @@<br>
ret void<br>
}<br>
<br>
+; LINUX: @__profc_foo_extern = linkonce_odr hidden global {{.*}}section "__llvm_prf_cnts", comdat($__profv_foo_extern), align 8<br>
+; LINUX: @__profd_foo_extern = linkonce_odr hidden global {{.*}}section "__llvm_prf_data", comdat($__profv_foo_extern), align 8<br>
+; OTHER: @__profc_foo_extern = linkonce_odr hidden global<br>
+; OTHER: @__profd_foo_extern = linkonce_odr hidden global<br>
+define available_externally void @foo_extern() {<br>
+ call void @llvm.instrprof.increment(i8* getelementptr inbounds ([10 x i8], [10 x i8]* @__profn_foo_extern, i32 0, i32 0), i64 0, i32 1, i32 0)<br>
+ ret void<br>
+}<br>
+<br>
declare void @llvm.instrprof.increment(i8*, i64, i32, i32)<br>
<br>
; OTHER: @__llvm_profile_runtime = external global i32<br>
Index: lib/Transforms/Instrumentation/InstrProfiling.cpp<br>
===================================================================<br>
--- lib/Transforms/Instrumentation/InstrProfiling.cpp<br>
+++ lib/Transforms/Instrumentation/InstrProfiling.cpp<br>
@@ -286,8 +286,38 @@<br>
return F->hasAddressTaken();<br>
}<br>
<br>
-static inline Comdat *getOrCreateProfileComdat(Module &M,<br>
+static inline bool needsComdatForCounter(Function &F, Module &M) {<br>
+<br>
+ if (F.hasComdat())<br>
+ return true;<br>
+<br>
+ Triple TT(M.getTargetTriple());<br>
+ if (!TT.isOSBinFormatELF())<br>
+ return false;<br>
+<br>
+ // See createPGOFuncNameVar for more details. To avoid link errors, profile<br>
+ // counters for function with available_externally linkage needs to be changed<br>
+ // to linkonce linkage. On ELF based systems, this leads to weak symbols to be<br>
+ // created. Without using comdat, duplicate entries won't be removed by the<br>
+ // linker leading to increased data segement size and raw profile size. Even<br>
+ // worse, since the referenced counter from profile per-function data object<br>
+ // will be resolved to the common strong definition, the profile counts for<br>
+ // available_externally functions will end up being duplicated in raw profile<br>
+ // data. This can result in distorted profile as the counts of those dups<br>
+ // will be accumulated by the profile merger.<br>
+ GlobalValue::LinkageTypes Linkage = F.getLinkage();<br>
+ if (Linkage != GlobalValue::ExternalWeakLinkage &&<br>
+ Linkage != GlobalValue::AvailableExternallyLinkage)<br>
+ return false;<br>
+<br>
+ return true;<br>
+}<br>
+<br>
+static inline Comdat *getOrCreateProfileComdat(Module &M, Function &F,<br>
InstrProfIncrementInst *Inc) {<br>
+ if (!needsComdatForCounter(F, M))<br>
+ return nullptr;<br>
+<br>
// COFF format requires a COMDAT section to have a key symbol with the same<br>
// name. The linker targeting COFF also requires that the COMDAT<br>
// a section is associated to must precede the associating section. For this<br>
@@ -315,8 +345,7 @@<br>
// linking.<br>
Function *Fn = Inc->getParent()->getParent();<br>
Comdat *ProfileVarsComdat = nullptr;<br>
- if (Fn->hasComdat())<br>
- ProfileVarsComdat = getOrCreateProfileComdat(*M, Inc);<br>
+ ProfileVarsComdat = getOrCreateProfileComdat(*M, *Fn, Inc);<br>
<br>
uint64_t NumCounters = Inc->getNumCounters()->getZExtValue();<br>
LLVMContext &Ctx = M->getContext();<br>
<br>
<br>
<br></div></div>_______________________________________________<br>
llvm-commits mailing list<br>
<a href="mailto:llvm-commits@lists.llvm.org" target="_blank">llvm-commits@lists.llvm.org</a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits</a><br>
<br></blockquote></div><br></div></div>
</blockquote></div><br></div></div>