[llvm-commits] [llvm] r66366 - in /llvm/trunk: lib/Transforms/Scalar/ScalarReplAggregates.cpp test/Transforms/ScalarRepl/2008-06-22-LargeArray.ll test/Transforms/ScalarRepl/vector_memcpy.ll

Sun Mar 8 23:35:20 PDT 2009

On Mar 8, 2009, at 1:09 PM, Eli Friedman wrote:

> On Sun, Mar 8, 2009 at 11:54 AM, Chris Lattner <clattner at apple.com>  
> wrote:
>> Do you have an example that would illustrate this problem?
>
> Compile the following with "clang -S -O2 -mattr=-sse".
>
> #include <string.h>
> #include <stdio.h>
> __attribute__((noinline)) void a(float* x, float* y) {
> float z;
> memcpy(&z, y, 4);
> memcpy(x, &z, 4);
> }
>
> int main() {
>  unsigned x = 2139095041, y;
>  a((float*)&y,(float*)&x);
>  if (y != 2139095041) {
>    printf("y corrupted!\n");
>    return 1;
>  }
>  return 0;
> }

Ok, this is pretty insane, and I'm sure that there are cases where  
SROA would get this "wrong" before too.  I think that the only  
reasonable course of action is to make the IR well defined w.r.t. load  
and store and treat this as an x86 backend bug.  Note that this has  
already been recognized to be a real performance issue (PR3560) as well.

-Chris