[llvm-bugs] [Bug 52111] New: [LV] Loop vectorizer wrongly propagate 'nuw' flag

Fri Oct 8 00:24:25 PDT 2021

https://bugs.llvm.org/show_bug.cgi?id=52111

            Bug ID: 52111
           Summary: [LV] Loop vectorizer wrongly propagate 'nuw' flag
           Product: libraries
           Version: trunk
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P
         Component: Loop Optimizer
          Assignee: unassignedbugs at nondot.org
          Reporter: diegocaballero at google.com
                CC: llvm-bugs at lists.llvm.org

Created attachment 25337
  --> https://bugs.llvm.org/attachment.cgi?id=25337&action=edit
Reproducer

Hello,

I think we found a correctness issue in the loop vectorizer related to the
incorrect propagation of the ‘nuw’ flag for instructions guarded by a
condition. The following pseudo-code illustrates the problem:

for (i = 0; i<4; ++i) {
  if(i > 0) {
    … = a[i-1]
  } 
}

In this example, the computation of `i-1` could be flagged with ‘nuw’ because
the guarding condition `i > 0` guarantees that the unsigned result will always
be within the range [0 - 2] and, therefore, will never wrap around. However,
when the loop is vectorized and predication flattens the control flow, the
previous property is no longer true:

for (i = 0; i<4; i+=4) {
  mask = {1, 1, 1, 0};
  … = masked_load(&a[i-1], mask); 
}

In this case, the computation of `i-1` is no longer guarded by the original
condition and the unsigned result will wrap-around for i=0. The vectorizer
currently preserves the original ‘nuw’ value for `i-1`, which leads to a poison
value that is later optimized away, resulting in an incorrect vector code.

Please, find attached a test case. The problematic code before vectorization
is:

concat_index_from_operand_id1:                    ; preds =
%concatenate.pivot.1.4
  %20 = add nsw i64 %fusion.indvar.dim.32, -1
  %21 = mul nuw nsw i64 %fusion.indvar.dim.24, 3
  %22 = add nuw nsw i64 %20, %21
  %23 = getelementptr inbounds [1 x [6 x float]], [1 x [6 x float]]* %11, i64
0, i64 0, i64 %22

After vectorization (opt before_lv.ll -scoped-noalias-aa -loop-vectorize
-ipsccp -mcpu=skx -S -o after_lv.ll):

vector.body:                                      ; preds = %vector.ph
  %14 = mul nuw nsw i64 %fusion.indvar.dim.24, 3
  %15 = add nuw nsw i64 -1, %14  // nuw & -1 -> Poison!
  %16 = getelementptr inbounds [1 x [6 x float]], [1 x [6 x float]]* %11, i64
0, i64 0, i64 %15

I haven’t thought too much about the solution. Maybe we could drop the
‘nuw’/’nsw’ flags of uniform operations that are not immediately nested in the
loop body. Perhaps, during predication?

Thanks,
Diego

-- 
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-bugs/attachments/20211008/f5475f32/attachment.html>