<html><head><meta http-equiv="Content-Type" content="text/html; charset=us-ascii"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">On May 8, 2018, at 9:55 AM, Adrian Prantl <<a href="mailto:aprantl@apple.com" class="">aprantl@apple.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div class=""><br class=""><br class=""><blockquote type="cite" class="">On May 7, 2018, at 11:29 PM, Jonas Paulsson via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" class="">llvm-dev@lists.llvm.org</a>> wrote:<br class=""><br class="">Hi, (Resent with proper subject line)<br class=""><br class="">I recently found myself in trouble because the crash I had disappeared<br class="">with -g, so I could not debug the program. This happened because the<br class="">optimizer did not remember to consider DBG_VALUEs instruction so it<br class="">changed its behavior, and the bug went hiding.<br class=""></blockquote><br class="">[Here are some (very high-level) thoughts.]<br class=""><br class="">As you suspect, that's clearly a bug. LLVM's ideal behavior is that the presence of debug informations shall not affect the contents of the .text section. As you also noticed, we're not quite there yet.<br class=""><br class=""><blockquote type="cite" class=""><br class="">I then started discussing this on <a href="https://reviews.llvm.org/D45878" class="">https://reviews.llvm.org/D45878</a>, and<br class="">since this is something that should be handled by all different machine<br class="">passes, the immediate question is now what utility functions should be<br class="">made available for common use.<br class=""><br class="">1. A pass such as MachineSink.cpp must first of all remember to consider<br class="">DBG_VALUEs when iterating over instructions, so that it does not e.g.<br class="">"break if next instruction is not X" and therefore change its results<br class="">with -g.<br class=""></blockquote><br class="">We started a similar discussion at the LLVM IR level very recently. As a result of this we added new iterators to that make it easy to skip of debug intrinsics when doing analysis passes...<br class=""><br class=""><blockquote type="cite" class=""><br class="">2. At a second priority,<br class=""><br class="">a) it should handle DBG_VALUEs when moving / erasing MachineInstrs.<br class=""></blockquote><br class="">... but, skipping them during analysis is not enough, they also must move when the value they are describing moves.<br class=""></div></div></blockquote><div><br class=""></div>A starting point for diagnosing issues which arise when MI's aren't moved when needed might be a use-before-def verifier at the MIR level (related: <a href="https://reviews.llvm.org/D46100" class="">https://reviews.llvm.org/D46100</a>).</div><div><br class=""></div><div><br class=""><blockquote type="cite" class=""><div class=""><div class=""><blockquote type="cite" class="">b) it should " // Merge or erase debug location to ensure<br class="">consistent stepping in profilers and debuggers." (MachineSink.cpp).<br class=""><br class="">Well, this is in fact the first point here: I think this should be<br class="">documented somewhere by a comment so that it is clear exactly how these<br class="">things should be handled. My personal understanding right now is that 2<br class="">is not vital but should be done on a best-effort basis. 1 however is<br class="">really bad if it happens (which it currently does).<br class=""></blockquote><br class="">Loosing DBG_VALUEs is bad because local variables will become unavailable in the debugger. This is completely unacceptable when compiling unoptimized code. In optimized code the expectations are somewhat lower, but there is a lot of low-hanging fruit left to pick where LLVM clearly could do a better job of preserving DBG_VALUEs.<br class=""><br class="">Incorrect debug locations are also unacceptable in unoptimized code, since they will cause the debugger's single-stepping to behave unpredictably, thus destroying the debugging experience. In optimized code, the expectations are a bit lower, but *incorrect* debug locations are terrible since they will cause misleading crash traces and otherwise undebuggable code. Due to the nature of what an optimizing compiler does, debug information is not always salvageable, but the correct thing to do in this case is to remove the debug information (e.g., by inserting line: 0 locations) instead of providing incorrect or partially correct information.<br class=""><br class=""><blockquote type="cite" class=""><br class="">We are discussing where to place these utility functions, and if<br class="">splice() should optionally be made to do this also.</blockquote></div></div></blockquote><div><br class=""></div>This sounds like a good idea to me.</div><div><br class=""></div><div><br class=""><blockquote type="cite" class=""><div class=""><div class=""><blockquote type="cite" class=""> It seems that<br class="">collectDebugValues() in MachineSink.cpp is a good starting point, but we<br class="">probably want to do an even better search than just looking at the first<br class="">def operand.</blockquote></div></div></blockquote><div><br class=""></div><div>thanks,</div><div>vedant</div><div><br class=""></div><br class=""><blockquote type="cite" class=""><div class=""><div class=""><blockquote type="cite" class=""><br class="">It would be nice to get some general feedback from the community at this<br class="">point so we know which direction to take...<br class=""></blockquote><br class="">Thank you for starting this discussion! We should do what we can to make easy for pass authors to do the right thing. We should also do what we can to force pass authors to have to thing about debug information and how to update it correctly, even if that is annoying :-)<br class=""><br class="">Would creating a MIR version of the debugify IR pass be helpful in finding more of the existing bugs?<br class=""><br class="">-- adrian<br class=""><br class=""><blockquote type="cite" class="">/Jonas<br class=""><br class="">_______________________________________________<br class="">LLVM Developers mailing list<br class=""><a href="mailto:llvm-dev@lists.llvm.org" class="">llvm-dev@lists.llvm.org</a><br class="">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev<br class=""></blockquote><br class=""></div></div></blockquote></div><br class=""></body></html>