<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On 30 June 2015 at 14:30, Nuno Lopes <span dir="ltr"><<a href="mailto:nunoplopes@sapo.pt" target="_blank">nunoplopes@sapo.pt</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class=""><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

Interesting.  Could you give an example why knowing a function will<br>

halt is<br>

essential for the heap-to-stack conversion?<br>

</blockquote>

<br>

The key situation you need to establish in order to do heap-to-stack conversion, is that you can see the calls to free (or realloc, etc.) along all control-flow paths. If you can, then you can perform the conversion (because any additional calls to free that you can't observe would lead to a double free, and thus undefined behavior). Thus, if we have this situation:<br>

<br>

void bar(int *a);<br>

void foo() {<br>

  int *a = (int*) malloc(sizeof(int)*40);<br>

  bar(a);<br>

  free(a);<br>

}<br>

<br>

we can perform heap-to-stack conversion iff we know that bar(int *) always returns normally. If it never returns (perhaps by looping indefinitely) then it might capture the pointer, pass it off to some other thread, and that other thread might call free() (or it might just call free() itself before looping indefinitely). In short, we need to know whether the call to free() after the call to bar() is dead. If we know that it is reached, then we can perform heap-to-stack conversion. Also worth noting is that because we unconditionally free(a) after the call to bar(a), it would not be legal for bar(a) to call realloc on a (because if realloc did reallocate the buffer we'd end up freeing it twice when bar(a) did eventually return).<br>

</blockquote>

<br></span>

I see, thanks!<br>

Your argument is that knowing that bar returns implies that 'a' cannot be captured or reallocated, otherwise it would be UB.  Makes sense, yes.<br></blockquote><div><br></div><div>I'm afraid it's worse than that.</div><div><br></div><div>void caller() {</div><div>  int *ptr = malloc(sizeof(int));</div><div>  callee(ptr);</div><div>  free(ptr);<br>}</div><div><br></div><div>void callee(int *ptr) {</div><div>  if (...) {</div><div>    free(ptr);</div><div>    log("get critical log message out to the humans, maybe my final message will be the last hint they need to finally resolve the bugs inside myself");</div><div>  }</div><div>}<br></div><div><br></div><div>C and C++ do permit undefined behaviour to have interactions backwards in time. In essence, knowing that UB must happen later can cause impossible things to happen now. However, LLVM offers an implementation-defined guarantee that no UB occurs until the earliest instruction where it occurs.</div><div><br></div><div>Your reasoning that we can't have a free in the callee because we'd hit a double-free (and hence UB) in the caller is not sufficient to perform heap-to-stack transform with that guarantee, because it will move the UB sooner to the free() in the callee. That violates our implementation guarantee that the log message will be emitted (because we'll have already entered UB-land).</div><div><br></div><div>The alternatives are to define double-free as not entering full UB (similar to poison, but we also have problems to solve with load, store, call, branch on value computed through signed overflow, etc.), or to remove the guarantee that the log message will be emitted.</div><div><br></div><div>Nick<br></div></div></div></div>