[PATCH] D87744: [RegisterCoalescer] passs Undefs to extendToIndices()
Ruiling, Song via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Sep 22 23:01:29 PDT 2020
ruiling updated this revision to Diff 293650.
ruiling added a comment.
further simplify .mir test
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D87744/new/
https://reviews.llvm.org/D87744
Files:
llvm/lib/CodeGen/RegisterCoalescer.cpp
llvm/test/CodeGen/AMDGPU/coalescer-removepartial-extend-undef-subrange.mir
Index: llvm/test/CodeGen/AMDGPU/coalescer-removepartial-extend-undef-subrange.mir
===================================================================
--- /dev/null
+++ llvm/test/CodeGen/AMDGPU/coalescer-removepartial-extend-undef-subrange.mir
@@ -0,0 +1,51 @@
+# RUN: llc -march=amdgcn -run-pass simple-register-coalescing -verify-machineinstrs -o - %s | FileCheck %s
+#
+# CHECK-LABEL: bb.1:
+# CHECK-NOT: COPY
+# CHECK-LABEL: bb.2:
+#
+# The failure occurs when the coalescer tries to removePartialRedundency() on the
+# "%2:vreg_64 = COPY %3" in bb.1. The coalescer tries to prune and extend each
+# subrange of %2, the subrange for %2.sub1 has a def location (in bb.2) in the
+# predecessor path 2->3->1. But for another predecessor path 0->4->1,
+# the subrange has only one undef location in bb.0. If we don't compute Undef set,
+# it will fail to find the reaching def for %2.sub1 in predecessor bb.4 and bb.0
+# and crash with error message:
+# "Use of $noreg does not have a corresponding definition on every path
+# LLVM ERROR: Use not jointly dominated by defs"
+
+---
+name: _amdgpu_ps_main
+alignment: 1
+tracksRegLiveness: true
+body: |
+ bb.0:
+ liveins: $sgpr2, $sgpr3, $vgpr3
+
+ %0:sgpr_32 = COPY $sgpr2
+ undef %1.sub0:vreg_64 = COPY %0
+ undef %2.sub0:vreg_64 = COPY %0
+ S_CBRANCH_VCCNZ %bb.2, implicit undef $vcc
+ S_BRANCH %bb.4
+
+ bb.1:
+ %2:vreg_64 = COPY %3
+ S_NOP 0, implicit %2.sub0
+
+ bb.2:
+ successors: %bb.3(0x04000000), %bb.2(0x7c000000)
+
+ %3:vreg_64 = COPY %2
+ %1.sub0:vreg_64 = COPY %3.sub0
+ %2:vreg_64 = COPY %1
+ S_CBRANCH_EXECNZ %bb.2, implicit undef $exec
+ S_BRANCH %bb.3
+
+ bb.3:
+ S_BRANCH %bb.1
+
+ bb.4:
+ %3:vreg_64 = COPY %2
+ S_BRANCH %bb.1
+
+...
Index: llvm/lib/CodeGen/RegisterCoalescer.cpp
===================================================================
--- llvm/lib/CodeGen/RegisterCoalescer.cpp
+++ llvm/lib/CodeGen/RegisterCoalescer.cpp
@@ -1212,7 +1212,10 @@
}
++I;
}
- LIS->extendToIndices(SR, EndPoints);
+ SmallVector<SlotIndex, 8> Undefs;
+ IntB.computeSubRangeUndefs(Undefs, SR.LaneMask, *MRI,
+ *LIS->getSlotIndexes());
+ LIS->extendToIndices(SR, EndPoints, Undefs);
}
// If any dead defs were extended, truncate them.
shrinkToUses(&IntB);
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D87744.293650.patch
Type: text/x-patch
Size: 2373 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200923/2961602a/attachment.bin>
More information about the llvm-commits
mailing list