[cfe-dev] Speculative load optimization

Tue Oct 10 11:57:15 PDT 2017

> On Oct 10, 2017, at 12:12 PM, Hal Finkel via cfe-dev <cfe-dev at lists.llvm.org> wrote:
> [+Richard, Chandler]
> 
> On 10/09/2017 07:00 PM, Hans Wennborg via cfe-dev wrote:
>> I am not a language lawyer, but I'll atttempt to answer anyway.
>> 
>> On Mon, Oct 9, 2017 at 2:16 PM, Kreitzer, David L via cfe-dev
>> <cfe-dev at lists.llvm.org> <mailto:cfe-dev at lists.llvm.org> wrote:
>>> This llvm patch, https://reviews.llvm.org/D37289 <https://reviews.llvm.org/D37289>, attempts to do an optimization
>>> that involves speculating loads. The patch itself needs some work regardless,
>>> but we are questioning the legality of the optimization, which is currently
>>> performed by both gcc and icc. The desired transformation looks like this:
>>> 
>>>     typedef struct S {
>>>       char padding[4088];
>>>       struct S *p1;
>>>       struct S *p2;
>>>     } S;
>>> 
>>>     struct S* f1(struct S *s, int x)
>>>     {
>>>       S *r;
>>>       if (x)
>>>         r = s->p1;
>>>       else
>>>         r = s->p2;
>>>       return r;
>>>     }
>>> 
>>> TO
>>> 
>>>     struct S* f1(struct S *s, int x)
>>>     {
>>>       return (x) ? s->p1 : s->p2;
>>>     }
>>> 
>>> The fundamental question seems to be whether loading one member of struct S
>>> makes it valid to load other members of the same struct.
>> Yes, I believe that's true. If we're dereferencing s on both paths, it
>> must point to a struct S object, and then loading from any member of
>> that object should be fine.
> 
> I also believe that this is correct.
> 
> I think that Chandler summarized some things to be careful about in this regard here:
>   http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20170807/477944.html <http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20170807/477944.html>
> 
> Of the three points highlighted here, the second clearly might apply:
>> 2) Related to #1, there are applications that rely on this memory model,
>> for example structures where entire regions of the structure live in
>> protected pages and cannot be correctly accessed.
> 
> This, however, is clearly an extension to the standard memory model, and I see no reason to support this by default. Speculatively loading cache lines under contention from other cores might not be a good thing to do for performance reasons, but that's not a correctness issue.

I agree that this optimization is legal under the C and C++ specs.

John.

> 
>  -Hal
> 
>> 
>>> Both gcc & icc
>>> seem to think so and make a distinction between this case and a similar case
>>> where one field is accessed through an lvalue of a different type:
>>> 
>>>     typedef struct T {
>>>       char padding[4088];
>>>       struct S *p1;
>>>     } T;
>>> 
>>>     struct S* f2(struct S *s, int x)
>>>     {
>>>       S *r;
>>>       if (x)
>>>         r = ((T*)s)->p1;
>>>       else
>>>         r = s->p2;
>>>       return r;
>>>     }
>>> 
>>> Neither compiler will transform this case.
>> I suspect it would be within the compiler's rights, but my language
>> knowledge is too weak. Does the cast imply that s points to a valid
>> struct S object (or null, but then we couldn't dereference it)? I'm
>> curious to find out :-)
>> 
>>  - Hans
>> _______________________________________________
>> cfe-dev mailing list
>> cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>
>> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev <http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev>
> 
> -- 
> Hal Finkel
> Lead, Compiler Technology and Programming Languages
> Leadership Computing Facility
> Argonne National Laboratory
> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20171010/c0611ac0/attachment.html>