<div dir="ltr"><div dir="ltr">On Wed, 20 Jan 2021 at 09:14, Ali Shuja Siddiqui (alissidd) via cfe-dev <<a href="mailto:cfe-dev@lists.llvm.org">cfe-dev@lists.llvm.org</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="gmail-m_-373191350763611235WordSection1">
<p class="MsoNormal">Hello,<u></u><u></u></p>
<p class="MsoNormal">I am interested in learning about clang’s -Wsometimes-uninitialized implementation and how is it different than gcc’s -Wmaybe-uninitialized. For example, if we consider this code:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">#include <stdlib.h><u></u><u></u></p>
<p class="MsoNormal">int test(int random, int cond1, int cond2) {<u></u><u></u></p>
<p class="MsoNormal"> volatile int *ptr;<u></u><u></u></p>
<p class="MsoNormal"> if (cond1) { //<<<< cond1<u></u><u></u></p>
<p class="MsoNormal"> ptr = (int *)calloc(1, sizeof(int));<u></u><u></u></p>
<p class="MsoNormal"> }<u></u><u></u></p>
<p class="MsoNormal"> if (cond2) { //<<<< cond2 , independent of cond1<u></u><u></u></p>
<p class="MsoNormal"> *ptr = 1;<u></u><u></u></p>
<p class="MsoNormal"> }<u></u><u></u></p>
<p class="MsoNormal"> return 0;<u></u><u></u></p>
<p class="MsoNormal">}<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">With gcc 8.3.1, I get the following output:<u></u><u></u></p>
<p class="MsoNormal"><b>$</b> gcc -g -c func.c -O1 -Wuninitialized<u></u><u></u></p>
<p class="MsoNormal">func.c: In function ‘test’:<u></u><u></u></p>
<p class="MsoNormal">func.c:8:10: warning: ‘ptr’ may be used uninitialized in this function [-Wmaybe-uninitialized]<u></u><u></u></p>
<p class="MsoNormal"> *ptr = 1;<u></u><u></u></p>
<p class="MsoNormal">(debug-llvm-11) /nobackup/alissidd/CSCvv46785$ vim func.c<u></u><u></u></p>
<p class="MsoNormal">(debug-llvm-11) /nobackup/alissidd/CSCvv46785$ gcc -O1 -Wuninitialized -Wmaybe-uninitialized -c func.c<u></u><u></u></p>
<p class="MsoNormal">func.c: In function ‘test’:<u></u><u></u></p>
<p class="MsoNormal">func.c:8:10: warning: ‘ptr’ may be used uninitialized in this function [-Wmaybe-uninitialized]<u></u><u></u></p>
<p class="MsoNormal"> *ptr = 1;<u></u><u></u></p>
<p class="MsoNormal"> ~~~~~^~~<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">However, with clang 11.0.0 I don’t get any warnings:<u></u><u></u></p>
<p class="MsoNormal"><b>$</b> clang -O1 -Wuninitialized -Wsometimes-uninitialized -c func.c<u></u><u></u></p>
<p class="MsoNormal"><b>$<u></u><u></u></b></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">It was detected by the static analyzer though:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Scan-build clang -O1 -Wuninitialized -Wsometimes-uninitialized -c func.c<u></u><u></u></p>
<p class="MsoNormal">scan-build: Using 'clang-11' for static analysis<u></u><u></u></p>
<p class="MsoNormal">func.c:8:10: warning: Dereference of undefined pointer value [core.NullDereference]<u></u><u></u></p>
<p class="MsoNormal"> *ptr = 1;<u></u><u></u></p>
<p class="MsoNormal"> ~~~~~^~~<u></u><u></u></p>
<p class="MsoNormal">func.c:10:10: warning: Potential leak of memory pointed to by 'ptr' [unix.Malloc]<u></u><u></u></p>
<p class="MsoNormal"> return 0;<u></u><u></u></p>
<p class="MsoNormal"> ^<u></u><u></u></p>
<p class="MsoNormal">2 warnings generated.<u></u><u></u></p>
<p class="MsoNormal">scan-build: Analysis run complete.<u></u><u></u></p>
<p class="MsoNormal">scan-build: 2 bugs found.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">I did see the -Wsometimes-uninitialized flag working, if for example, I move the use of the
<b>ptr</b> variable outside of a condition:<u></u><u></u></p>
<p class="MsoNormal">#include <stdlib.h><u></u><u></u></p>
<p class="MsoNormal">int test(int random, int cond1, int cond2) {<u></u><u></u></p>
<p class="MsoNormal"> volatile int *ptr;<u></u><u></u></p>
<p class="MsoNormal"> if (cond1) { //<<<< cond1<u></u><u></u></p>
<p class="MsoNormal"> ptr = (int *)calloc(1, sizeof(int));<u></u><u></u></p>
<p class="MsoNormal"> }<u></u><u></u></p>
<p class="MsoNormal"> if (cond2) { //<<<< cond2 , independent of cond1<u></u><u></u></p>
<p class="MsoNormal"> *ptr = 1;<u></u><u></u></p>
<p class="MsoNormal"> }<u></u><u></u></p>
<p class="MsoNormal"> <b>return ptr</b>;<u></u><u></u></p>
<p class="MsoNormal">}<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><b>$</b> clang -O1 -Wuninitialized -Wsometimes-uninitialized -c func.c<u></u><u></u></p>
<p class="MsoNormal">func.c:10:10: warning: incompatible pointer to integer conversion returning 'volatile int *' from a function with result type 'int' [-Wint-conversion]<u></u><u></u></p>
<p class="MsoNormal"> return ptr;<u></u><u></u></p>
<p class="MsoNormal"> ^~~<u></u><u></u></p>
<p class="MsoNormal">func.c:4:7: warning: variable 'ptr' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized]<u></u><u></u></p>
<p class="MsoNormal"> if (cond1) { //<<<< cond1<u></u><u></u></p>
<p class="MsoNormal"> ^~~~~<u></u><u></u></p>
<p class="MsoNormal">func.c:10:10: note: uninitialized use occurs here<u></u><u></u></p>
<p class="MsoNormal"> return ptr;<u></u><u></u></p>
<p class="MsoNormal"> ^~~<u></u><u></u></p>
<p class="MsoNormal">func.c:4:3: note: remove the 'if' if its condition is always true<u></u><u></u></p>
<p class="MsoNormal"> if (cond1) { //<<<< cond1<u></u><u></u></p>
<p class="MsoNormal"> ^~~~~~~~~~~<u></u><u></u></p>
<p class="MsoNormal">func.c:3:20: note: initialize the variable 'ptr' to silence this warning<u></u><u></u></p>
<p class="MsoNormal"> volatile int *ptr;<u></u><u></u></p>
<p class="MsoNormal"> ^<u></u><u></u></p>
<p class="MsoNormal"> = NULL<u></u><u></u></p>
<p class="MsoNormal">2 warnings generated.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">I’m new to clang development and would appreciate the help with the following questions:<u></u><u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_-373191350763611235MsoListParagraph" style="margin-left:0in">Are clang’s -Wsometimes-uninitialized and gcc’s -Wmaybe-unintialized comparable?</li></ul></div></div></blockquote><div>No. GCC's -Wmaybe-uninitialized is analogous to Clang's -Wconditional-uninitialized. Both warnings diagnose cases where the compiler cannot prove that the variable is initialized at each use. (That is: both warnings aim to avoid false negatives, at the expense of having false positives in cases where the compiler couldn't prove whether the variable was initialized or not.) GCC's -Wmaybe-uninitialized is a check performed after some optimizations (so depends on optimization level) whereas Clang's -Wconditional-uninitialized is a check performed by a simplistic static analysis on the original unoptimized program, so they have different strengths and weaknesses -- Clang's warning is stable in the face of changes to the optimizer and optimization level, whereas GCC's warning can see through the things the optimizer can see through and can eliminate more false positives as a result. (Clang's warning also only works on scalar variables, whereas GCC's can reason about struct members.)</div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div lang="EN-US" style="overflow-wrap: break-word;"><div class="gmail-m_-373191350763611235WordSection1"><ul style="margin-top:0in" type="disc"><li class="gmail-m_-373191350763611235MsoListParagraph" style="margin-left:0in"><u></u><u></u></li><li class="gmail-m_-373191350763611235MsoListParagraph" style="margin-left:0in">As is shown that a use of an uninitialized variable within a condition could not be detected by the -Wuninitialized/-Wsometimes-uninitialized flag, is this behavior by design?</li></ul></div></div></blockquote><div>Yes. The -Wsometimes-uninitialized flag is intended to diagnose cases where there is definitely a bug -- where we can prove the variable can be used while uninitialized -- on the assumption that the program contains no dead code. In your first test case, we cannot be sure the program contains a bug: it could be the case that cond2 is only nonzero when cond1 is also nonzero, in which case the program is correct. But in the second test case, the program definitely contains a bug, assuming the `if (cond1)` is not redundant: if it's possible that that `if` branch is not taken (if there is no dead code), then there is a live execution path through the function that results in the variable being used uninitialized.<br></div><div><br></div><div>Hence the naming: "conditional-uninitialized" / "maybe-uninitialized" warnings appear where the compiler is not sure whether there is an uninitialized use. "sometimes-uninitialized" warnings appear when the compiler is sure that there's an uninitialized use for some real executions of the function. And "uninitialized" warnings appear when the compiler is sure that there's an uninitialized use on every execution of the function. (-Wuninitialized also enables -Wsometimes-uninitialized because in practice people use -Wuninitialized when they want no false positives, and -Wsometimes-uninitialized is intended to have no false positives.)</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div lang="EN-US" style="overflow-wrap: break-word;"><div class="gmail-m_-373191350763611235WordSection1"><ul style="margin-top:0in" type="disc"><li class="gmail-m_-373191350763611235MsoListParagraph" style="margin-left:0in"><u></u><u></u></li><li class="gmail-m_-373191350763611235MsoListParagraph" style="margin-left:0in">I’m trying to understand the source code of the -Wuninitialized and -Wsometimes-initialized in particular and I was able to single out the files that maybe used to implement it:<u></u><u></u></li><ul style="margin-top:0in" type="circle">
<li class="MsoNormal" style="vertical-align:middle">clang/lib/Analysis/UninitializedValues.cpp<u></u><u></u></li><li class="MsoNormal" style="vertical-align:middle">clang/include/clang/Analysis/Analyses/UninitializedValues.h<u></u><u></u></li><li class="MsoNormal" style="vertical-align:middle">clang/lib/Sema/AnalysisBasedWarnings.cpp<u></u><u></u></li></ul>
</ul>
<p class="MsoNormal" style="margin-left:0.5in">Any pointers about how should I go about reading and understanding it will be very helpful.</p></div></div></blockquote><div><br></div><div>The implementation is written following the traditional structure of a data-flow analysis (as an iterated application of a transfer function along a control flow graph), so reading up about data-flow analyses should give you a better idea of what it's doing and how.</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div lang="EN-US" style="overflow-wrap: break-word;"><div class="gmail-m_-373191350763611235WordSection1">
<p class="MsoNormal">Thank you,<u></u><u></u></p>
<p class="MsoNormal">Ali<u></u><u></u></p>
</div>
</div>
_______________________________________________<br>
cfe-dev mailing list<br>
<a href="mailto:cfe-dev@lists.llvm.org" target="_blank">cfe-dev@lists.llvm.org</a><br>
<a href="https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev" rel="noreferrer" target="_blank">https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev</a><br>
</blockquote></div></div>