<div dir="ltr">It's hard to say.<div>What you've described sounds close to a neutral type system implemented in metadata.</div><div>In particular, ". It also defines a set of language-neutral formal rules that LLVM codegen follows to determine whether a given pair of accesses are allowed to overlap by rules of the input language. "</div><div>and "the base type followed by field descriptors"</div><div>etc</div><div><br></div><div>Despite the name, our current TBAA does not require or represent types.  It represents a translation of language access rules into a language of hierarchical sets, that are represented by a  tree with weighted edges.</div><div><br></div><div>If you are actually attempting to represent neutral types, certainly, the approach can work, but probably not represent exact semantics for all languages.</div><div><br></div><div>Most LLVM metadata also tries to avoid understanding the language, instead modeling the effects.</div><div><br></div><div>For example, it's unlikely we'd use metadata to say "This is a struct field access to a, and this is one to b" and use that in analysis.  Because it requires the semantics be at the LLVM level, and understand something about the language.</div><div><br></div><div>Instead, we'd usually say "this is an access to offset 0 of memory, with size 4, and this is an  access to offset 4 of memory, with size 8", with the with the semantic that accesses tagged in such a manner can only overlap if the offset, size ranges overlap.  That semantic is language independent.</div><div><br></div><div>But again, this is all very theoretical. I'd be very interested to see what you came up with.</div><div><br></div><div><br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Aug 14, 2017 at 10:10 AM, Ivan A. Kosarev <span dir="ltr"><<a href="mailto:ivan@kosarev.info" target="_blank">ivan@kosarev.info</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

  <div text="#000000" bgcolor="#FFFFFF">

    Sure, I will provide those. I just wanted to make sure this doesn't

    sound like what you know will not work for some reasons I'm not

    aware of.<div><div class="h5"><br>

    <br>

    <div class="m_-8445920500341695595moz-cite-prefix">On 14/08/17 20:04, Daniel Berlin wrote:<br>

    </div>

    <blockquote type="cite">

      <div dir="ltr">Do you have a formal description of your approach

        with examples?

        <div>I have a bit of trouble visualizing exactly what your

          approach does.</div>

      </div>

      <div class="gmail_extra"><br>

        <div class="gmail_quote">On Mon, Aug 14, 2017 at 9:58 AM, Ivan

          A. Kosarev via llvm-dev <span dir="ltr"><<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>></span>

          wrote:<br>

          <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hello

            Steven, Hal and Daniel,<br>

            <br>

            Thanks a lot for your discussion; it really helps with

            summarizing current TBAA issues and ways to resolve them.<br>

            <br>

            Do you guys know anything of the current status of the

            proposed change? Steven, will you please let us know if the

            work is in progress and if there is any ETA you can share?<br>

            <br>

            I'm asking because we are working on an alternative approach

            that not only supports accesses to union members, bit

            fields, fields of aggregate and union types, but also allows

            to represent accesses to aggregates and unions the same way

            we do it for scalars so that !tbaa.struct is replaced with

            plain !tbaa, meaning TBAA information can be propagated

            uniformly regardless of types of accessed objects. As a

            consequence, it supports identification of user types

            defined in different translation units, even if some of them

            are written in C and others are in C++. It also defines a

            set of language-neutral formal rules that LLVM codegen

            follows to determine whether a given pair of accesses are

            allowed to overlap by rules of the input language. As of

            today, we know this implementation covers all currently

            supported TBAA functionality reflected in the test suites

            and to test the new functionality we have SROA improved to

            preserve TBAA information.<br>

            <br>

            The point is, our approach does not try to describe accesses

            as (type, offset) pairs and instead represents access

            sequences explicitly beginning from the base type followed

            by field descriptors, which is what makes the approach so

            flexible. TypeBasedAAResult::Aliases() and

            MDNode::getMostGenericTBAA() are a bit more complex than

            they used to be (they actually use the same internal

            function), but rely exclusively on linear scans of access

            sequences unless we have a situation when have to check if

            one of the accessed types is the type of a member of the

            other one, in which case it seems we just have to traverse

            through fields recursively no matter what.<br>

            <br>

            So, I wonder if this or similar approaches have ever been

            considered before and what are the cons, if there are any

            sounded. Do you think it is worth to consider it now?<br>

            <br>

            Thanks again,<span class="m_-8445920500341695595HOEnZb"><font color="#888888"><br>

                <br>

                -- <br>

              </font></span>

            <div class="m_-8445920500341695595HOEnZb">

              <div class="m_-8445920500341695595h5">

                <br>

                ______________________________<wbr>_________________<br>

                LLVM Developers mailing list<br>

                <a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br>

                <a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/<wbr>mailman/listinfo/llvm-dev</a><br>

              </div>

            </div>

          </blockquote>

        </div>

        <br>

      </div>

    </blockquote>

    <br>

  </div></div></div>

</blockquote></div><br></div>