[LLVMbugs] [Bug 22306] New: wrong vectorization in the presence of C++11 atomics

Fri Jan 23 02:04:10 PST 2015

http://llvm.org/bugs/show_bug.cgi?id=22306

            Bug ID: 22306
           Summary: wrong vectorization in the presence of C++11 atomics
           Product: new-bugs
           Version: unspecified
          Hardware: PC
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P
         Component: new bugs
          Assignee: unassignedbugs at nondot.org
          Reporter: sohachak at mpi-sws.org
                CC: llvmbugs at cs.uiuc.edu
    Classification: Unclassified

Created attachment 13726
  --> http://llvm.org/bugs/attachment.cgi?id=13726&action=edit
contains testcase .cpp, .ll, .opt.bc files

Hi,

LLVM introduces data race in the following compilation in SLP vectorization
phase.

Source
----------
atomic<int> x[4];
int a[4];

void writeA() {
   for(int i=0;i<4;i++) {
     a[i] = 0;
     x[i].store(i,memory_order_release);
   }
}

Compilation command 
---------------------
clang++ -std=c++11 -emit-llvm -pthread <filename>.cpp -S;opt -O3 <filename>.ll
-o <filename>.opt.bc -S

<filename>.opt.bc - optimized code
-------- 
define void @_Z6writeAv() #3 {
entry:
  store atomic i32 0, i32* getelementptr inbounds ([4 x %"struct.std::atomic"]*
@x, i64 0, i64 0, i32 0, i32 0) release, align 16
  store atomic i32 1, i32* getelementptr inbounds ([4 x %"struct.std::atomic"]*
@x, i64 0, i64 1, i32 0, i32 0) release, align 4
  store atomic i32 2, i32* getelementptr inbounds ([4 x %"struct.std::atomic"]*
@x, i64 0, i64 2, i32 0, i32 0) release, align 8
  store <4 x i32> zeroinitializer, <4 x i32>* bitcast ([4 x i32]* @a to <4 x
i32>*), align 16
  store atomic i32 3, i32* getelementptr inbounds ([4 x %"struct.std::atomic"]*
@x, i64 0, i64 3, i32 0, i32 0) release, align 4
  ret void
}

The transformation steps are as follows

for(int i=0;i<4;i++) {a[i] = 0; x[i] = i;}
1. loop unrolling => a[0] = 0; x[0].store(0,release); a[1] = 0;
x[1].store(1,release); a[2] = 0; x[2].store(2,release); a[3] = 0;
x[3].store(3,release);   
2. statement reordering => x[0].store(0,release); x[1].store(1,release);
x[2].store(2,release); a[0] = 0; a[1] = 0; a[2] = 0; a[3] = 0;
x[3].store(3,release); // wrong reordering
3. SLP vectorization => x[0].store(0,release); x[1].store(1,release);
x[2].store(2,release); a[0:3] = 0; x[3].store(3,release);

Movement of non-atomic shared variable a[] after the release write of atomic
variable x is unsafe in step 2. 

Consider the following thread is running in parallel

int readA() {
  int r=0; 
  if(x[2].load(memory_order_acquire) == 2){
    r = a[2];
  }    
  return r;
}

The source program does not have any data race as the write(a[2])
happens-before read(a[2]) due to the synchronization  between
(x[2].store(2,release) , x[2].load(acquire)).

But the target program is racy since the write(a[2]); x[2].store(2,release) is
reordered.   

The reordering in the SLP vectorization is introducing the error.  

Attached are the testcase cpp, and LLVM IR files.

Regards,
soham

-- 
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-bugs/attachments/20150123/b2eeb6db/attachment.html>