<div dir="ltr"><div class="gmail_quote"><div dir="ltr">On Wed, Aug 9, 2017 at 9:51 PM Hal Finkel <<a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000">

    <p><br>

    </p>

    <div class="m_5578066034655955596moz-cite-prefix">On 08/09/2017 11:03 PM, Chandler

      Carruth wrote:<br>

    </div>

    <blockquote type="cite">

      <div dir="ltr">Hal already answered much of this, just continuing

        this part of the discussion...

        <div><br>

          <div class="gmail_quote">

            <div dir="ltr">On Wed, Aug 9, 2017 at 8:56 PM Xinliang David

              Li via llvm-commits <<a href="mailto:llvm-commits@lists.llvm.org" target="_blank">llvm-commits@lists.llvm.org</a>>

              wrote:<br>

            </div>

            <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

              <div dir="ltr">

                <div class="gmail_extra">

                  <div class="gmail_quote">On Wed, Aug 9, 2017 at 8:37

                    PM, Hal Finkel <span dir="ltr"><<a href="mailto:hfinkel@anl.gov" target="_blank">hfinkel@anl.gov</a>></span>

                    wrote:<br>

                    <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

                      <div bgcolor="#FFFFFF" text="#000000"><span>

                          <p><br>

                          </p>

                          <div class="m_5578066034655955596m_-2021915036012865282m_-1753555538924036895moz-cite-prefix">On

                            08/09/2017 10:14 PM, Xinliang David Li via

                            llvm-commits wrote:<br>

                          </div>

                          <blockquote type="cite">

                            <div dir="ltr">

                              <div class="gmail_extra">

                                <div class="gmail_quote">

                                  <div> Can you elaborate here too? If

                                    there were missed optimization that

                                    later got fixed, there should be

                                    regression tests for them, right? 

                                    And what information is missing?</div>

                                </div>

                              </div>

                            </div>

                          </blockquote>

                          <br>

                        </span> To make a general statement, if we load

                        (a, i8) and (a+2, i16), for example, and these

                        came from some structure, we've lost the

                        information that the load (a+1, i8) would have

                        been legal (i.e. is known to be deferenceable).

                        This is not specific to bit fields, but the fact

                        that we lose information on the dereferenceable

                        byte ranges around memory access turns into a

                        problem when we later can't legally widen. There

                        may be a better way to keep this information

                        other than producing wide loads (which is an

                        imperfect mechanism, especially the way we do it

                        by restricting to legal integer types),</div>

                    </blockquote>

                  </div>

                </div>

              </div>

            </blockquote>

            <div><br>

            </div>

            <div>I don't think we have such a restriction? Maybe I'm

              missing something. When I originally added this logic, it

              definitely was not restricted to legal integer types.</div>

          </div>

        </div>

      </div>

    </blockquote>

    <br></div><div bgcolor="#FFFFFF" text="#000000">

    I believe you're right for bitfields. For general structures,

    however, we certainly load individual fields instead of loading the

    whole structure with some wide integer in order to preserve

    dereferenceability information.</div></blockquote><div><br></div><div>I don't believe structures provide that information. See below.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000"><br>

    <br>

    <blockquote type="cite">

      <div dir="ltr">

        <div>

          <div class="gmail_quote">

            <div><br>

            </div>

            <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

              <div dir="ltr">

                <div class="gmail_extra">

                  <div class="gmail_quote">

                    <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

                      <div bgcolor="#FFFFFF" text="#000000"> but at the

                        moment, we don't have anything better.<br>

                      </div>

                    </blockquote>

                    <div><br>

                    </div>

                  </div>

                </div>

              </div>

              <div dir="ltr">

                <div class="gmail_extra">

                  <div class="gmail_quote">

                    <div>Ok, as you mentioned, widening looks like a

                      workaround to paper over the weakness in IR to

                      annotate the information.  More importantly, my

                      question is whether this is a just theoretical

                      concern.</div>

                  </div>

                </div>

              </div>

            </blockquote>

            <div><br>

            </div>

            <div>I really disagree with this being a workaround.</div>

            <div><br>

            </div>

            <div>I think it is very fundamentally the correct model --

              the semantics are that this is a single, wide memory

              operation that a narrow data type is extracted from.</div>

          </div>

        </div>

      </div>

    </blockquote>

    <br></div><div bgcolor="#FFFFFF" text="#000000">

    That is one option. We do need to preserve this information (maybe

    we can do this with TBAA, or similar, or maybe using some other

    mechanism entirely). However, we do try harder to do this with

    bitfields than with other aggregates. If I have struct { int a, b,

    c, d; } S; and I load S.d, we don't do this by loading a 128-bit

    integer and then extracting some part of it. Should we? Probably

    not.</div></blockquote><div><br></div><div>We cannot, it isn't allowed (I'm pretty sure...)</div><div><br></div><div>1) It violates C++ (and C) memory model -- another thread could be writing to the other variables.</div><div><br></div><div>2) Related to #1, there are applications that rely on this memory model, for example structures where entire regions of the structure live in protected pages and cannot be correctly accessed.</div><div><br></div><div>3) Again related to #1, there are applications that rely on the memory model when doing memory-mapped IO to avoid reading or writing regions that are being updated by the OS or other processes.</div><div><br></div><div>Bitfields are the only place where we have specific license to widen access in the C++ memory model (that I'm aware of)....</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000"> I suspect having better support for aggregate memory access

    would be a better solution. Or, as noted, using metadata or some

    other secondary mechanism.<br></div></blockquote><div><br></div><div>FWIW, I actually agree that if we want to do more of this, we would be better served by a different IR, but I strongly suspect it would look more like first class aggregates rather than metadata so that we could reason about it more fundamentally in terms of SSA.</div><div><br></div><div>But bitfields are (IMO) an importantly different problem in that they are mergeable in interesting and important ways due to being integers and often times sub-byte integers. This is why a single large integer combined with late narrowing seems like a particularly desirable way to represent the fundamental information of the semantic constraints of the program.</div><div><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000">Maybe more aggressively preserving this information for bit fields

    is the right answer, empirically. I can believe that's true. The

    more-general problem still exists, however.</div></blockquote><div><br></div><div>For other languages / semantics, yes. Increasingly I think a (better designed / integrated / spec'ed, etc) system like FCAs would work particularly well at making this easy to express and reason about. But it would be a pretty significant change.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000"><br>The thing that appeals to me about the IR-transformation approach is

    the ability to handle "hand coded" bit fields as effectively as

    language-level bit fields. I've certainly seen my share of these,

    and they're definitely important. Moreover, this is true regardless

    of what we think about the underlying optimal model for preserving

    aggregate derefereceability in general.<br></div></blockquote><div><br></div><div>Completely agree. Teaching LLVM to handle wide integer accesses will be beneficial no matter what decisions are made here.</div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div bgcolor="#FFFFFF" text="#000000">

  </div></blockquote></div></div>