<div dir="ltr"><div dir="ltr">On Tue, Sep 24, 2019 at 4:06 PM Vitaly Buka <<a href="mailto:vitalybuka@google.com">vitalybuka@google.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div>For code line this:</div><div>void used(void *);<br>union unmatched { char c[2]; char i[7]; };<br>void test1() {<br> union unmatched U = {1};<br> used(&U);<br>}<br></div></div><div><br></div><div>tryEmitAbstractForInitializer already emits <br></div><div>@__const.test1.U = private unnamed_addr constant { [2 x i8], [5 x i8] } { [2 x i8] c"\01\00", [5 x i8] undef }, align 1<br></div><div><br></div><div>Then we replace all undefs with pattern.</div><div>So I believe I need to fix tryEmitAbstractForInitializer to emit zeroes. However it breaks a bunch of tests.</div><div><br></div><div>Does this make sense?</div></div></blockquote><div><br></div><div>Yes. We'll presumably need to track a bit on APValue's struct and union representations to model whether padding is zeroed or undefined. We already track on CXXConstructExpr whether the padding is zeroed, but don't preserve properly everywhere else.</div><div><br></div><div>This is <a href="http://llvm.org/PR11742">http://llvm.org/PR11742</a> by the way. =)</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Sep 23, 2019 at 4:10 PM Vitaly Buka <<a href="mailto:vitalybuka@google.com" target="_blank">vitalybuka@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Thanks.<div>Somehow I read the same text but interpreted it differently. Now it makes sense, it must be zeroed and trivial-auto-var-init needs improvement.<div>I'll will create a patch.<br><div><br></div></div></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Sep 23, 2019 at 2:36 PM JF Bastien <<a href="mailto:jfbastien@apple.com" target="_blank">jfbastien@apple.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div>Hi Vitaly,</div><div><br></div>Indeed: there are cases where clang happened to honor the standard without intending to (through the magic of memcpy’ing globals), and auto-init=pattern broke this. I got a report about it a while ago and agree it’s an issue, I haven’t had time to fix it. It’s related to what C calls “designated initializers” to initialize structs. The relevant standardese from C17 are in 6.7.9 Initialization (the grammar has “designator” being either “ [ constant-expression ] ” or “ . identifier ”).<div><br>The relevant rules are:<br><br>“””<br>Except where explicitly stated otherwise, for the purposes of this subclause unnamed members of objects of structure and union type do not participate in initialization. Unnamed members of structure objects have indeterminate value even after initialization.<br><br>If an object that has automatic storage duration is not initialized explicitly, its value is indeterminate. If an object that has static or thread storage duration is not initialized explicitly, then:<br>— if it has pointer type, it is initialized to a null pointer;<br>— if it has arithmetic type, it is initialized to (positive or unsigned) zero;<br>— if it is an aggregate, every member is initialized (recursively) according to these rules, and any padding is initialized to zero bits;<br>— if it is a union, the first named member is initialized (recursively) according to these rules, and any padding is initialized to zero bits;<br><br>The initialization shall occur in initializer list order, each initializer provided for a particular subobject overriding any previously listed initializer for the same subobject; all subobjects that are not initialized explicitly shall be initialized implicitly the same as objects that have static storage duration.<br><br>If there are fewer initializers in a brace-enclosed list than there are elements or members of an aggregate, or fewer characters in a string literal used to initialize an array of known size than there are elements in the array, the remainder of the aggregate shall be initialized implicitly the same as objects that have static storage duration.<br>“””<br><br>I had fixed something in <a href="https://reviews.llvm.org/D61280" target="_blank">https://reviews.llvm.org/D61280</a> but it’s not sufficient.</div><div><br></div><div><br></div><div>I don’t think fixing this is giving up: it’s doing what the standard mandates.</div><div><br></div><div>Are you interested in fixing it?</div><div><br><div><br></div><div><br><blockquote type="cite"><div>On Sep 23, 2019, at 2:20 PM, Vitaly Buka <<a href="mailto:vitalybuka@google.com" target="_blank">vitalybuka@google.com</a>> wrote:</div><br><div><div dir="ltr"><div style="background-color:rgb(255,255,254)"><div>Hi everyone,</div><div><br></div><div>I am trying to enable -ftrivial-auto-var-init=pattern on various code and noticed inconvenient inconsistency with code like [1].</div><div><br></div><div>According standard, as I understand, only the first member of the union must be initialized and the tail of the second member can stay uninitialized. If we use -ftrivial-auto-var-init=pattern it fills the tail using our pattern. However by default GCC, clang (without -ftrivial-auto-var-init), msvc all initialize entire union with 0s. Not sure if it's just coincidence or guaranteed feature.</div><div><br>So -ftrivial-auto-var-init=pattern breaks such code. Especially bad if you don't know that U is a union and ={} looks good there.<br></div><div><br></div><div>Should we consider giving up here and using zeroes for union tails even with -ftrivial-auto-var-init=pattern?</div><div><br></div><div>1. Example:<br></div><div><span style="color:rgb(0,0,255)">union</span> U {</div><div> <span style="color:rgb(0,0,255)"> char</span> small[<span style="color:rgb(9,136,90)">2</span>];</div><div> <span style="color:rgb(0,0,255)"> char</span> large[<span style="color:rgb(9,136,90)">100</span>]; </div><div>};</div><div><span style="color:rgb(0,0,255)">void</span> f(<span style="color:rgb(0,0,255)">void</span>*);</div><div><span style="color:rgb(0,0,255)">void</span> test() {<br></div><div> <span style="color:rgb(0,0,255)"> union</span> U u = {};</div><div> f(&u);</div><div>}<br></div></div></div>
</div></blockquote></div><br></div></div></blockquote></div>
</blockquote></div>
</blockquote></div></div>