<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" style="font-size:12pt;color:#000000;font-family:Calibri,Helvetica,sans-serif;" dir="ltr">
<p style="margin-top:0;margin-bottom:0">Makes sense to me. Different namespace means ODR issues generally go away.</p>
<p style="margin-top:0;margin-bottom:0"><br>
</p>
<p style="margin-top:0;margin-bottom:0">For atomic specifically, it may make sense, long term, to standardize something along the lines of a shared_memory_atomic<T>. On some platforms, that may just be an alias of atomic<T>, but that would allow coexistence
without as severe of an ABI break.</p>
<br>
<br>
<div style="color: rgb(0, 0, 0);">
<hr style="display:inline-block;width:98%" tabindex="-1">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" style="font-size:11pt" color="#000000"><b>From:</b> Olivier Giroux <OGiroux@nvidia.com><br>
<b>Sent:</b> Tuesday, January 1, 2019 10:19 AM<br>
<b>To:</b> Ben Craig; libcxx-dev@lists.llvm.org<br>
<b>Subject:</b> Re: [libcxx-dev] proposal: decoupling Freestanding atomic<T> from libatomic.a</font>
<div> </div>
</div>
<div>
<div>
<div>
<div>
<div style="direction:ltr">Summarizing my earlier mail about our intentions:</div>
<div style="direction:ltr">a. Device-only isn’t interesting. It’s got to be host+device or it doesn’t do enough to help programmers write the programs they want to write.</div>
<div style="direction:ltr">b. We will create a Freestanding C++ library that conforms in every way except the namespace will be decorated. It won’t be std::, it might be cuda::std::, so you can still use your Hosted library as you used to. We’ll verify that
ODR issues are addressed by this and other engineering we’ll do.</div>
<div style="direction:ltr">c. Someday, in the far future when this has worked out, we can talk about Enabling our paths for Hosted and/or in std::, but IMHO that requires the nuclear option of creating new ABI triples like « x86-64-cuda-linux » only compatible
with « x86-64-linux » at the C boundary. It’s not for 2019 to say the least, except maybe as a toy.</div>
<div><br>
</div>
<div style="direction:ltr">Finally, I’m volunteering (myself and others) to contribute maintenance and enhancements but our delivery will have some ugly bits we don’t upstream. I’m trying to be most useful to upstream though, as possible.</div>
<div><br>
</div>
<div style="direction:ltr">Happy New Year,</div>
<div><br>
</div>
<div style="direction:ltr">Olivier</div>
<div><br>
</div>
<div><br>
</div>
</div>
<div><br>
</div>
<div class="x_ms-outlook-ios-signature"></div>
</div>
<div> </div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="x_divRplyFwdMsg" dir="dir="ltr""><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Ben Craig <ben.craig@ni.com><br>
<b>Sent:</b> Tuesday, January 1, 2019 10:55 AM<br>
<b>To:</b> Olivier Giroux; libcxx-dev@lists.llvm.org<br>
<b>Subject:</b> Re: [libcxx-dev] proposal: decoupling Freestanding atomic from libatomic.a
<div> </div>
</font></div>
<meta content="text/html; charset=Windows-1252">
<div id="x_divtagdefaultwrapper" dir="ltr" style="font-size:12pt; color:#000000; font-family:Calibri,Helvetica,sans-serif">
<p style="margin-top:0; margin-bottom:0">Are you hoping to enable this just on the device side, or also on the host side? If it is just on the device side, then the ABI is yours to choose and/or break. If it's on the host side, that gets trickier. I'm also
wondering if you are trying to make it possible for the layout of std::atomic to match between host and device to enable cross domain synchronization.</p>
<p style="margin-top:0; margin-bottom:0"><br>
</p>
<p style="margin-top:0; margin-bottom:0">If your work extends to the host side of things, then it seems like you are falling into the camp that wants to mix freestanding and hosted in the same application, and that feels like an ODR trap waiting to spring.</p>
<br>
<br>
<div style="color:rgb(0,0,0)">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="x_divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> Olivier Giroux <OGiroux@nvidia.com><br>
<b>Sent:</b> Monday, December 31, 2018 8:53 PM<br>
<b>To:</b> Ben Craig; libcxx-dev@lists.llvm.org<br>
<b>Subject:</b> Re: [libcxx-dev] proposal: decoupling Freestanding atomic<T> from libatomic.a</font>
<div> </div>
</div>
<div lang="EN-US">
<div class="x_x_WordSection1">
<p class="x_x_MsoNormal">The best reason I can see to layer libcxx atomics on the C11 atomics is that there’s probably no other way to ensure interoperability between C1* and C++1* for toolchains that want to do that. I agree with your assessment that a toolchain
using embeddeds lock for C1* _Atomic(T) will just work with libcxx – I haven’t any doubt.</p>
<p class="x_x_MsoNormal"> </p>
<p class="x_x_MsoNormal">There are at least two issues RE: _Atomic(T) where my situation is concerned:</p>
<ol start="1" type="1" style="margin-top:0in">
<li class="x_x_MsoListParagraph" style="margin-left:0in">We’re not interested in implementing C1* at the moment, and it’s not actually a requirement to implement C++1* either. Hence we’re more interested in the __atomic_* backend than the __c11_atomic_* backend
to libcxx atomic.
<ol start="1" type="a" style="margin-top:0in">
<li class="x_x_MsoListParagraph" style="margin-left:0in">This is a great reason for libcxx to retain this backend: implementations of C++ that don’t depend on C.</li></ol>
</li><li class="x_x_MsoListParagraph" style="margin-left:0in">We don’t control what host CPU compilers will do for _Atomic(T) and even if we did, we’re locked into platform ABI choices for host CPUs that have largely shied from embedded locks (except on Windows)
and we’d still have to match.
<ol start="1" type="a" style="margin-top:0in">
<li class="x_x_MsoListParagraph" style="margin-left:0in">Given our C++ centric focus, it’s much simpler to address this in the library layer.</li></ol>
</li></ol>
<p class="x_x_MsoNormal"> </p>
<p class="x_x_MsoNormal">This being said, my proposed change makes it possible to turn this path on or off regardless of whether the __c11_atomic_* or the __atomic_* backend is used. At the moment I’m poposing to turn it on merely because LIBCXX_FREESTANDING
is set, but we could be more selective than that, it could be part of what __config selects when in a Freestanding configuration. We do need to be careful that there exist at least one public configuration that uses it; I don’t want to put this path in there
and only turn it on in a private configuration.</p>
<p class="x_x_MsoNormal"> </p>
<p class="x_x_MsoNormal">Olivier</p>
<p class="x_x_MsoNormal"> </p>
<div style="border:none; border-top:solid #B5C4DF 1.0pt; padding:3.0pt 0in 0in 0in">
<p class="x_x_MsoNormal"><b><span style="font-size:12.0pt; color:black">From: </span>
</b><span style="font-size:12.0pt; color:black">Ben Craig <ben.craig@ni.com><br>
<b>Date: </b>Monday, December 31, 2018 at 6:16 PM<br>
<b>To: </b>Olivier Giroux <OGiroux@nvidia.com>, "libcxx-dev@lists.llvm.org" <libcxx-dev@lists.llvm.org><br>
<b>Subject: </b>Re: [libcxx-dev] proposal: decoupling Freestanding atomic<T> from libatomic.a</span></p>
</div>
<div>
<p class="x_x_MsoNormal"> </p>
</div>
<div id="x_x_divtagdefaultwrapper">
<p><span style="font-size:12.0pt; color:black">I think the most robust solution is for your compiler to implement C11 _Atomic, and for that type to be sufficiently large to hold your value and your spin lock. I think that most of the libcxx implementation
will "just work" at that point, although there are possibly some hidden bugs where something mistakenly assumes that sizeof(T) = = sizeof(_Atomic(T)).</span></p>
<p><span style="font-size:12.0pt; color:black"> </span></p>
<p><span style="font-size:12.0pt; color:black">Making the compiler generate _Atomic correctly also gives you a lot of C compatibility.</span></p>
<p><span style="font-size:12.0pt; color:black"> </span></p>
<p><span style="font-size:12.0pt; color:black">If C11 compatibility isn't a big concern, and / or getting that feature into the compilers that you need isn't an option, then your approach sounds fine to me. I don't make the decisions around here though :)</span></p>
<p><span style="font-size:12.0pt; color:black"> </span></p>
<p><span style="font-size:12.0pt; color:black">For the binary compatibility concerns... I don't have great answers. On the one hand, I want to believe that the choice of freestanding or hosted is a platform decision, and that you shouldn't ever mix freestanding
object files with hosted object files. On the other hand, an awful lot of people that aren't doing embedded, drivers, or GPU programming seem interested in freestanding, and I'm afraid that they may very well want to build one chunk of code with a freestanding
flag as a style checker / optimizer, and another part of their code with hosted. I'm not sure if it make sense to try and support that second group.</span></p>
<p><span style="font-size:12.0pt; color:black"> </span></p>
<p><span style="font-size:12.0pt; color:black">Another binary compatibility note... my search skills are failing me, but I recall seeing some blog post or documentation that basically states that, when in doubt, you should generate a library call. You can
migrate from a library call to inline code over time, but you can't migrate the other direction without breaking compatibility. I don't think that affects the specific design questions you had though.</span></p>
</div>
</div>
<div>
<hr>
</div>
<div>This email message is for the sole use of the intended recipient(s) and may contain confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by
reply email and destroy all copies of the original message. </div>
<div>
<hr>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>