<div dir="ltr"><div class="gmail_quote"><div dir="ltr">On Wed, Sep 2, 2015 at 1:42 PM Sanjoy Das <<a href="mailto:sanjoy@playingwithpointers.com">sanjoy@playingwithpointers.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Chandler,<br>
<br>
Thanks for replying!<br>
<br>
> First, as I think Philip already said, I think it is important that a<br>
> readonly or a readnone attribute on a call is absolute. Optimizations<br>
> shouldn't have to go look for an operand bundle. Instead, we should prevent<br>
> the call-side attributes from being added.<br>
<br>
I think Philip's concern was more about the *difference* between the<br>
call side attributes and attributes on the function.<br>
<br>
Say you have<br>
<br>
define i32 @f() {<br>
ret i32 42<br>
}<br>
<br>
define void @g() {<br>
call void @f() [ "foo"(i32 100) ]<br>
ret void<br>
}<br>
<br>
Now I think we all agree that the call to `@f` cannot be marked as<br>
`readnone` to have deopt semantics. We can (I suspect without too<br>
much churn) make sure LLVM does not take such a `call` and mark it as<br>
`readnone`.<br>
<br>
However, `-functionattrs` (and related passes) are still allowed to<br>
mark the *function* (`@f`) as `readnone`, and I think it would be very<br>
weird if we disallowed that (since we'll have to iterate through all<br>
of `@f`'s uses).<br>
<br>
This brings us to the weird situation where we can have a<br>
not-`readnone` call to a function that's marked `readnone`. This was<br>
Philip's concern -- the semantics of the call is no longer the most<br>
precise that can be deduced by looking at both the call and function<br>
attributes. We'd possibly have issues with passes that looked at the<br>
`CS.getCalledFunction()`'s attributes and decided to do an illegal<br>
reordering because the function was marked `readnone`.<br></blockquote><div><br></div><div>While I'm still mulling it over, I think that if we want something like operand bundles, we really need to move to the point where the *only* valid set of attributes to query is the call attributes when trying to understand the semantics of a call instruction. I actually like this model better. It clearly separates the idea that a particular call instruction's semantics are modeled by a particular call instruction attribute set. A particular function's semantics are modeled by *its* attribute set. Depending on the nature of the query, you should look at different ones.</div><div><br></div><div>Historically, getting this wrong only manifested in missed optimizations. With the ability to add extra functionality to call instructions (outside of the called function) we inherently introduce the concept of this being a correctness issue. I think we'll have to carefully audit the optimizer here, but I'm not (yet) too worried about the ramifications.</div><div><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
> I think there may be a separate way of specifying all of this that makes<br>
> things clearer. Operand bundles imply that when lowering, the call may be<br>
> wrapped with a call to an external function before and/or after the called<br>
> function, with the bundled operands escaped into those external functions<br>
> which may capture, etc.<br>
><br>
> This both gives you the escape semantics, and it gives you something else;<br>
> the runtime function might not return! That should (I think) exactly capture<br>
> the semantic issue you were worried about with deopt. Because control may<br>
> never reach the called function, or may never return to the caller even if<br>
> the callee returns, code motion of side-effects would be clearly prohibited.<br>
<br>
This is sort of what I was getting at when I said<br>
<br>
"As a meta point, I think the right way to view operand bundles is as<br>
something that *happens* before and after an call / invoke, not as a<br>
set of values being passed around."<br>
<br>
But with this scheme, the issue with a function's attributes being out<br>
of sync with its actual semantics at a call site still exists.<br>
<br>
I think a reasonable specification is to add a function attribute<br>
`may_deopt_caller`[1]. Only functions that are marked<br>
`may_deopt_caller` can actually access the operand bundles that was<br>
passed to the function at a call site, and `may_deopt_caller` implies<br>
all of the reordering restrictions we are interested in.<br>
`-functionattrs` is not allowed to mark a `may_deopt_caller` function<br>
as `readnone` (say) because they're not. If we wanted to be really<br>
clever, we could even DCE deopt operand bundles in calls to functions<br>
that are not marked `may_deopt_caller`.<br><br>
This does bring up the semantic issue of whether `may_deopt_caller` is<br>
truly a property of the callee, or am I just trying to come up with<br>
arbitrary conservative attributes to sweep a complex issue under the<br>
carpet. I'll have to spend some time thinking about this, but at this<br>
time I think it is the former (otherwise I wouldn't be writhing this<br>
:)) -- typically a callee has to *do* something to deopt its caller,<br>
and that's usually a call to the runtime. `may_deopt_caller` in this<br>
case is a conservative attribute stating that the callee may execute<br>
such a deopting call. The most similar existing attribute I can find<br>
is `returns_twice`.<br></blockquote><div><br></div><div>I really think this just happens to be the special case of deopt, and that it is a mistake to design the IR extension based solely on that use case.</div><div><br></div><div>Consider many of the other decorator patterns that have been discussed as uses of this IR functionality. If the runtime logic invoked before or after the function can read or write memory other than what the callee does, we are moving to a point where the call instruction's annotations (attributes + operand bundles) introduce a *more restrictive* semantic model than the function attributes alone.</div><div><br></div><div>I'm actually much more comfortable with the highly generic approach and eating the cost of teaching the optimizer about this distinction.</div></div></div>