<html>

  <head>

    <meta http-equiv="content-type" content="text/html; charset=utf-8">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    <br>

    <div class="moz-cite-prefix">On 1/6/2015 1:01 PM, Chandler Carruth

      wrote:<br>

    </div>

    <blockquote class=" cite"

id="mid_CAGCO0KjE2tyiAG_bCoELJgK6MpM4B4nQHaYyT__i5W33BdZ8jA_mail_gmail_com"

cite="mid:CAGCO0KjE2tyiAG=bCoELJgK6MpM4B4nQHaYyT=-i5W33BdZ8jA@mail.gmail.com"

      type="cite">

      <meta http-equiv="Content-Type" content="text/html; charset=utf-8">

      <div dir="ltr">

        <div class="gmail_extra"><br>

          <div class="gmail_quote">On Mon, Jan 5, 2015 at 10:51 PM, Owen

            Anderson <span dir="ltr"><<a moz-do-not-send="true"

                href="mailto:resistor@mac.com" target="_blank">resistor@mac.com</a>></span>

            wrote:<br>

            <blockquote id="Cite_2050134" class="gmail_quote cite"

              style="margin:0 0 0 .8ex;border-left:1px #ccc

              solid;padding-left:1ex">

              <div id=":619" class="a3s" style="overflow:hidden">Hi

                Sameer,<br>

                <span class=""><br>

                  > On Jan 5, 2015, at 4:51 AM, Sahasrabuddhe, Sameer

                  <<a moz-do-not-send="true"

                    href="mailto:Sameer.Sahasrabuddhe@amd.com">Sameer.Sahasrabuddhe@amd.com</a>>

                  wrote:<br>

                  ><br>

                  > Right. The second version of my patches fixes the

                  bitcode encoding. But now I see another potential

                  problem with future bitcode if we require an ordering

                  on the scopes. What happens when a backend later

                  introduces a new scope that goes into the middle of

                  the order? If they renumber the scopes to accomodate

                  this, then existing bitcode for that backend will no

                  longer work. The bitcode reader/writer cannot

                  compensate for this since the values are

                  backend-specific. If we agree that this problem is

                  real, then we cannot force an ordering on the scope

                  numbers.<br>

                  <br>

                </span>That’s an interesting consideration, and

                something I hadn’t thought of.  I’m unsure offhand of

                how much it matters in practice.  The alternative, I

                suppose, is having something like string-named scopes,

                but then we can’t do much with them at the IR level.<br>

              </div>

            </blockquote>

            <div><br>

            </div>

            <div>This has me somewhat non-plussed as well.</div>

          </div>

        </div>

      </div>

    </blockquote>

    <br>

    That really depends on what we want to do at the IR level. Scopes do

    not affect transformations that move non-atomic accesses around

    atomic accesses. The scope on the atomic access should not matter to

    the non-atomic accesses. The interesting case is when the compiler

    tries to optimize atomic accesses with respect to each other, and

    their scopes do not match. But it might be sufficient to leave such

    transformations to the target, since quite possibly, other

    target-specific information might be necessary to make them work or

    even to say whether they are beneficial. <br>

    <br>

    <blockquote class=" cite"

id="mid_CAGCO0KjE2tyiAG_bCoELJgK6MpM4B4nQHaYyT__i5W33BdZ8jA_mail_gmail_com"

cite="mid:CAGCO0KjE2tyiAG=bCoELJgK6MpM4B4nQHaYyT=-i5W33BdZ8jA@mail.gmail.com"

      type="cite">

      <div dir="ltr">

        <div class="gmail_extra">

          <div class="gmail_quote">

            <div> </div>

            <blockquote id="Cite_9202132" class="gmail_quote cite"

              style="margin:0 0 0 .8ex;border-left:1px #ccc

              solid;padding-left:1ex">

              <div id=":619" class="a3s" style="overflow:hidden">

                <span class=""><br>

                  > So far, I have refrained from proposing a keyword

                  for cross thread scope in the text format, because (a)

                  there never was one and (b) it is not strictly needed

                  since it is the default anyway. I am fine either way,

                  but we will first have to decide what the new keyword

                  should be. I find "allthreads" to be a decent

                  counterpart for "singlethread" ... "crossthread" is

                  not good enough since intermediate scopes have

                  multiple threads too.<br>

                  <br>

                </span>This actually raises another question.  In

                principle, the “most visible” scope ought to be

                something like “system” or “device”, meaning a

                completely uncached memory access that is visible to all

                peripherals in a heterogeneous system.  However, this is

                almost certainly not what we want to have for typical

                memory accesses.<br>

                <br>

                To summarize, a prototypical scope nest, from most to

                least visible (aka least to most cacheable) might look

                like:<br>

                <br>

                System  —>  AllThreads  —>  Various

                target-specific local scopes —> SingleThread<br>

                <br>

                If we wanted to go really gonzo, there could be a

                Network scope at the beginning for large-scale HPC

                systems, but I’m not sure how important that is to

                anyone.<br>

              </div>

            </blockquote>

            <div><br>

            </div>

            <div>I probably *should* be in a position to be very

              interested in such a concept.... but honestly, I'm not. If

              I ever wanted to do something like this, I would just

              define the large-scale HPC system as the "system" and a

              single machine/node as some "local" scope.</div>

          </div>

        </div>

      </div>

    </blockquote>

    <br>

    I agree. The most accurate description of the highest scope is

    "address space scope", i.e., all threads that can access the address

    space being accessed. From that view, it does not matter if the

    threads are local, remote or situated on different devices, or such.

    It makes sense to not specify any keyword for this scope, and just

    say that "synchscope(0)" is default and need not be specified. Any

    other scope is an explicit optimization over a narrower set of

    threads.<br>

    <br>

    <blockquote class=" cite"

id="mid_CAGCO0KjE2tyiAG_bCoELJgK6MpM4B4nQHaYyT__i5W33BdZ8jA_mail_gmail_com"

cite="mid:CAGCO0KjE2tyiAG=bCoELJgK6MpM4B4nQHaYyT=-i5W33BdZ8jA@mail.gmail.com"

      type="cite">

      <div dir="ltr">

        <div class="gmail_extra">

          <div class="gmail_quote">

            <div> </div>

            <blockquote id="Cite_6388414" class="gmail_quote cite"

              style="margin:0 0 0 .8ex;border-left:1px #ccc

              solid;padding-left:1ex">

              <div id=":619" class="a3s" style="overflow:hidden">

                As a related question, do we actually need the local

                scopes to be target specific?  Are there systems, real

                or planned, that *aren’t* captured by:<br>

                <br>

                [Network —> ] System  —>  AllThreads  —> 

                ThreadGroup —> SingleThread ?</div>

            </blockquote>

          </div>

          <br>

          Sadly, I don't think this will work. In particular, there are

          real-world accelerators with multiple tiers of thread groups

          that are visible in the cache hierarchy subsystem.</div>

      </div>

    </blockquote>

    <br>

    The HSAIL 1.0 provisional spec has the following scopes: workitem,

    wavefront, workgroup, component, system. A component is anything

    that supports the HSAIL instruction set and can execute commands

    dispatched to it. I am not an authority on this, but to me, it is

    conceivable that there could be other scopes later, analogous to

    things such as one "die" or one "chip" or one "board" or one node in

    a cloud.<br>

    <br>

    <blockquote class=" cite"

id="mid_CAGCO0KjE2tyiAG_bCoELJgK6MpM4B4nQHaYyT__i5W33BdZ8jA_mail_gmail_com"

cite="mid:CAGCO0KjE2tyiAG=bCoELJgK6MpM4B4nQHaYyT=-i5W33BdZ8jA@mail.gmail.com"

      type="cite">

      <div dir="ltr">

        <div class="gmail_extra"><br>

        </div>

        <div class="gmail_extra">I'm starting to think we might actually

          need to let the target define acceptable strings for memory

          scopes and a strict weak ordering over them.... That's really

          complex and heavy weight, but I'm not really confident that

          we're safe committing to something more limited. The good side

          is that we can add the SWO-stuff lazily as needed...</div>

        <div class="gmail_extra"><br>

        </div>

        <div class="gmail_extra">Dunno, thoughts?</div>

      </div>

    </blockquote>

    <br>

    Just the thought of using strings in the IR smells like over-design

    to me. Going back to the original point, are target-independent

    optimizations on scoped atomic operations really so attractive? <br>

    <br>

    But while the topic is wide open, here's another possibly whacky

    approach: we let the scopes be integers, and add a "scope layout"

    string similar to data-layout. The string encodes the ordering of

    the integers. If it is empty, then simple numerical comparisons are

    sufficient. Else the string spells out the exact ordering to be

    used. Any known current target will be happy with the first option.

    If some target inserts an intermediate scope in the future, then

    that version switches from empty to a fully specified string. The

    best part is that we don't even need to do this right now, and only

    come up with a "scope layout" spec when we really hit the problem

    for some future target.<br>

    <br>

    Sameer.<br>

    <br>

  </body>

</html>