[PATCH] D34726: AMDGPU/SI: Don not insert an instruction into worklist twice in movetovalu

Tue Jun 27 16:39:42 PDT 2017

cfang created this revision.
Herald added subscribers: t-tye, tpr, dstuttard, yaxunl, nhaehnle, wdng, kzhuravl.

In movetovalu, when we process an instruction in the worklist, we may delete/modify the instruction. So putting an 
instruction in the worklist may result in handling a deleted/modified instruction, and thus cause trouble.


https://reviews.llvm.org/D34726

Files:
  lib/Target/AMDGPU/SIInstrInfo.cpp
  test/CodeGen/AMDGPU/move-to-valu-worklist.ll


Index: test/CodeGen/AMDGPU/move-to-valu-worklist.ll
===================================================================

--- /dev/null
+++ test/CodeGen/AMDGPU/move-to-valu-worklist.ll
@@ -0,0 +1,31 @@
+; RUN: llc -march=amdgcn -mcpu=fiji -verify-machineinstrs < %s | FileCheck %s
+
+; CHECK-LABEL: {{^}}in_worklist_once:
+; CHECK: buffer_load_dword
+; CHECK: BB0_1:
+; CHECK: s_branch BB0_1
+define amdgpu_kernel void @in_worklist_once() #0 {
+bb:
+  %tmp = load i64, i64* undef
+  br label %bb1
+
+bb1:                                              ; preds = %bb1, %bb
+  %tmp2 = phi i64 [ undef, %bb ], [ %tmp16, %bb1 ]
+  %tmp3 = phi i64 [ %tmp, %bb ], [ undef, %bb1 ]
+  %tmp4 = xor i64 0, %tmp2
+  %tmp5 = xor i64 %tmp4, 0
+  %tmp6 = xor i64 %tmp5, 0
+  %tmp7 = xor i64 %tmp6, 0
+  %tmp8 = xor i64 0, %tmp7
+  %tmp9 = xor i64 0, %tmp3
+  %tmp10 = xor i64 0, %tmp8
+  %tmp11 = shl i64 %tmp10, 14
+  %tmp12 = lshr i64 %tmp10, 50
+  %tmp13 = or i64 %tmp11, %tmp12
+  %tmp14 = xor i64 0, %tmp9
+  %tmp15 = and i64 %tmp9, %tmp13
+  %tmp16 = xor i64 %tmp15, %tmp14
+  br label %bb1
+}
+
+attributes #0 = { nounwind }
Index: lib/Target/AMDGPU/SIInstrInfo.cpp
===================================================================
--- lib/Target/AMDGPU/SIInstrInfo.cpp
+++ lib/Target/AMDGPU/SIInstrInfo.cpp
@@ -3856,7 +3856,9 @@
          E = MRI.use_end(); I != E;) {
     MachineInstr &UseMI = *I->getParent();
     if (!canReadVGPR(UseMI, I.getOperandNo())) {
-      Worklist.push_back(&UseMI);
+      // Do not add to worklist twice!
+      if(Worklist.end() == llvm::find(Worklist, &MI))
+        Worklist.push_back(&UseMI);
 
       do {
         ++I;
@@ -3941,7 +3943,9 @@
       return;
 
     if (MI.findRegisterUseOperandIdx(AMDGPU::SCC) != -1)
-      Worklist.push_back(&MI);
+      // Do not add to worklist twice!
+      if(Worklist.end() == llvm::find(Worklist, &MI))
+        Worklist.push_back(&MI);
   }
 }
 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D34726.104310.patch
Type: text/x-patch
Size: 1918 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170627/29750283/attachment.bin>