<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">On Jun 11, 2019, at 6:27 AM, Guillaume Chatelet <<a href="mailto:gchatelet@google.com" class="">gchatelet@google.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class="">I spent some time reading the <a href="https://web.archive.org/web/20181230041359if_/http://www.open-std.org/jtc1/sc22/wg14/www/abq/c17_updated_proposed_fdis.pdf" class="">C standard</a>:<div class=""><br class=""></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px" class=""><div class="">5.1.2.3 Program execution</div><div class="">2. Accessing a volatile object, modifying an object, modifying a file, or calling a function that does any
of those operations are all side effects, which are changes in the state of the execution environment...</div></blockquote><div class=""><br class=""></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px" class=""><div class="">6.7.3 Type qualifiers</div><div class="">8. An object that has volatile-qualified type may be modified in ways unknown to the implementation
or have other unknown side effects. Therefore any expression referring to such an object shall be
evaluated strictly according to the rules of the abstract machine, as described in 5.1.2.3. Furthermore,
at every sequence point the value last stored in the object shall agree with that prescribed by the
abstract machine, except as modified by the unknown factors mentioned previously. What
constitutes an access to an object that has volatile-qualified type is implementation-defined.</div></blockquote><div class=""><br class=""></div><div class=""><div class="">My intuition is that it's unreasonable to do the overlap because it may create side effects in the middle of the read. At the same time, "accessing a volatile object" is implementation defined...</div><div class="">As such "volatile" does not provide a lot of guarantees. I guess you have to resort on assembly if you want to really control what's going on.<br class=""></div><div class=""><br class=""></div><div class="">gcc, icc, msvc do not implement the overlap trick. They either read the memory region sequentially or use rep movs.</div><div class="">Let's note that rep movs does not provide any guarantees on the memory access patterns either.</div><div class=""></div><div class=""></div></div><div class=""><br class=""><div class="">Besides I'm not even sure that volatile in the context of @llvm.memcpy intrinsics really relates to the C volatile semantic.</div></div><div class=""><br class=""></div><div class="">Now, I see several ways to move forward:</div><div class="">1. Do nothing and call it implementation defined</div><div class="">2. Fix the code that generates the loads/stores for the isVolatile case so no overlap can occur.</div><div class="">3. Remove "volatile" argument from @llvm.memcpy, @llvm.memmove and @llvm.memset and generates the code in the front end using volatile load/stores.</div><div class=""><br class=""></div><div class=""></div><div class="">3 is probably controversial and involves a lot a changes, it would move some complexity from backend to frontends.</div><div class=""><br class=""></div><div class="">I'm interested in thoughts from other developers here.</div></div></div></blockquote><div><br class=""></div><div>Volatile isn’t specified in any decent normative way. There’s substantial documentation of intent, but C and C++ standards are lacking. We should therefore implement something that’s sensible and works kind-of as expected.</div><div><br class=""></div><div>I’ve documented specification and intent here: <a href="http://wg21.link/P1152R0" class="">wg21.link/P1152R0</a></div><div>If you want to follow-on (without all that documentation): <a href="http://wg21.link/P1152R1" class="">wg21.link/P1152R1</a></div><div><br class=""></div><div>I think we want option 2.: keep volatile memcpy, and implement it as touching each byte exactly once. That’s unlikely to be particularly useful for every direct-to-hardware uses, but it behaves intuitively enough that I think it’s desirable.</div><div><br class=""></div><br class=""><blockquote type="cite" class=""><div class=""><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Jun 10, 2019 at 10:51 PM John Regehr via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" class="">llvm-dev@lists.llvm.org</a>> wrote:<br class=""></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">I agree, this is a bug.<br class="">
<br class="">
John<br class="">
<br class="">
<br class="">
On 6/7/19 11:48 AM, JF Bastien via llvm-dev wrote:<br class="">
> <br class="">
> <br class="">
>> On Jun 5, 2019, at 2:28 PM, Tim Northover via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class="">llvm-dev@lists.llvm.org</a>> wrote:<br class="">
>><br class="">
>> On Wed, 5 Jun 2019 at 13:49, Eli Friedman via llvm-dev<br class="">
>> <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class="">llvm-dev@lists.llvm.org</a>> wrote:<br class="">
>>> I don’t see any particular reason to guarantee that a volatile memcpy will access each byte exactly once. How is that useful?<br class="">
>><br class="">
>> I agree it's probably not that useful, but I think the non-duplicating<br class="">
>> property of volatile is ingrained strongly enough that viewing a<br class="">
>> memcpy as a single load and store to each unit (in an unspecified<br class="">
>> order) should be legitimate; so I think this actually is a bug.<br class="">
>><br class="">
>> As the documentation says though, it's unwise to depend on the<br class="">
>> behaviour of a volatile memcpy.<br class="">
> <br class="">
> I agree with Tim, this seems like a bug. My expectation is that volatile touch each memory location exactly once, unless absolutely impossible (e.g. bitfields on most ISAs).<br class="">
> _______________________________________________<br class="">
> LLVM Developers mailing list<br class="">
> <a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class="">llvm-dev@lists.llvm.org</a><br class="">
> <a href="https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank" class="">https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br class="">
> <br class="">
_______________________________________________<br class="">
LLVM Developers mailing list<br class="">
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank" class="">llvm-dev@lists.llvm.org</a><br class="">
<a href="https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank" class="">https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br class="">
</blockquote></div>
</div></blockquote></div><br class=""></body></html>