<div dir="ltr">I think this will have a much higher cost than my proposal to constrain how we deduce function attributes (which still fixes Sanjoy's latest example).<div><br></div><div>Specifically, I think this will force us to constrain far too many transformations for the sake of code size in functions that we won't inline. Even if we were never going to deduce function attributes for anything in the function (because its big and reads and writes everything), we'll still have to constrain our transformations just because we *might* later deduce a function attribute that triggers these kinds of bugs.</div><div><br></div><div>Essentially, you're proposing to limit intraprocedural optimization to when we can successfully to interprocedural optimization ("privatization"), where I'm suggesting we limit interprocedural optimization to leave intraprocedural optimization unconstrained. Given the ratio of our optimizations (almost all are intra, very few are inter), I'm much more comfortable with the latter.<br></div></div><br><div class="gmail_quote"><div dir="ltr">On Fri, Feb 26, 2016 at 6:10 PM Hal Finkel <<a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Sanjoy,<br>

<br>

These are both very interesting examples, and demonstrate that the problems extends beyond function attributes (encompassing dead-argument elimination, etc.).<br>

<br>

I'm beginning to think that the best solution, at least when optimizing for speed, is the one that David Li suggested: we need to internalize functions that have been optimized in certain ways (e.g. instructions with potential side effects are removed). The trick here may be to be as intelligent about this as possible to minimize the effect on code size. Maybe this is as easy as checking whether isSafeToSpeculativelyExecute returns false on the deleted instruction? Perhaps when optimizing for size, we need to forbid such deletions.<br>

<br>

Thanks again,<br>

Hal<br>

<br>

----- Original Message -----<br>

> From: "Sanjoy Das" <<a href="mailto:sanjoy@playingwithpointers.com" target="_blank">sanjoy@playingwithpointers.com</a>><br>

> To: "Hal Finkel" <<a href="mailto:hfinkel@anl.gov" target="_blank">hfinkel@anl.gov</a>><br>

> Cc: "Chandler Carruth" <<a href="mailto:chandlerc@google.com" target="_blank">chandlerc@google.com</a>>, "llvm-dev" <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>>, "Philip Reames"<br>

> <<a href="mailto:listmail@philipreames.com" target="_blank">listmail@philipreames.com</a>>, "Duncan P. N. Exon Smith" <<a href="mailto:dexonsmith@apple.com" target="_blank">dexonsmith@apple.com</a>><br>

> Sent: Thursday, February 25, 2016 11:59:27 AM<br>

> Subject: Re: [llvm-dev] Possible soundness issue with available_externally (split from "RFC: Add guard intrinsics")<br>

><br>

> Couple of other examples:<br>

><br>

>   void @foo(i32* %ptr) available_externally {<br>

>     %discard = load i32, i32* %ptr<br>

>   }<br>

>   void bar() {<br>

>     call @foo(i32* %x)<br>

>   }<br>

><br>

> ==><br>

><br>

>   void @foo(i32* %ptr) available_externally {<br>

>   }<br>

>   void bar() {<br>

>     call @foo(i32* %x)<br>

>   }<br>

><br>

> ==><br>

><br>

>   void @foo(i32* %ptr) available_externally {<br>

>   }<br>

>   void bar() {<br>

>     call @foo(i32* undef) ;; non optimized @foo will crash<br>

>   }<br>

><br>

>   ;; Similar example if @foo was dividing something by an integer<br>

>   ;; argument<br>

><br>

> We've actually seen the above in our VM (though back then we<br>

> didn't realize that the problem was more general than the one<br>

> case above).<br>

><br>

> Another one involving `undef` (semantically same as "folding undef",<br>

> but different enough to state separately):<br>

><br>

>   void @foo(i32* %ptr) available_externally {<br>

>     store i32 undef, i32* %ptr<br>

>   }<br>

>   void bar() {<br>

>     %val = load i32, i32* %x<br>

>     call @foo(i32* %x)<br>

>   }<br>

><br>

> ==><br>

><br>

>   void @foo(i32* %ptr) readonly available_externally {<br>

>   }<br>

>   void bar() {<br>

>     %val = load i32, i32* %x<br>

>     call @foo(i32* %x)<br>

>   }<br>

><br>

> ==><br>

><br>

>   void @foo(i32* %ptr) readonly available_externally {<br>

>   }<br>

>   void bar() {<br>

>     call @foo(i32* %x)<br>

>     %val = load i32, i32* %x<br>

>   }<br>

><br>

> With a non-optimized @foo, %val can be garbage.<br>

><br>

><br>

> I'll also note we've not really had bug reports (that I'm aware of)<br>

> around this issue.  Given that, it is possible that this is a purely<br>

> theoretical problem.<br>

><br>

> -- Sanjoy<br>

><br>

<br>

--<br>

Hal Finkel<br>

Assistant Computational Scientist<br>

Leadership Computing Facility<br>

Argonne National Laboratory<br>

</blockquote></div>