[PATCH] D94778: [AArch64] Further restricts when a dup(*ext) can be rearranged
Nicholas Guy via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 15 06:51:21 PST 2021
NickGuy created this revision.
NickGuy added reviewers: SjoerdMeijer, dmgreen, mstorsjo.
NickGuy added a project: LLVM.
Herald added subscribers: danielkiss, hiraditya, kristof.beyls.
NickGuy requested review of this revision.
In most cases, the dup(*ext) pattern can be rearranged to perform
the extension on the vector side, allowing for further vector-specific
optimisations to be made. However the initial checks for this conversion
were insufficient, allowing invalid encodings to be attempted (causing
compilation to fail).
Fixes the crash reported in D91271 <https://reviews.llvm.org/D91271>
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D94778
Files:
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
llvm/test/CodeGen/AArch64/aarch64-dup-ext-crash.ll
Index: llvm/test/CodeGen/AArch64/aarch64-dup-ext-crash.ll
===================================================================
--- /dev/null
+++ llvm/test/CodeGen/AArch64/aarch64-dup-ext-crash.ll
@@ -0,0 +1,33 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
+; RUN: llc < %s -o -| FileCheck %s
+
+target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
+target triple = "aarch64-unknown-linux-gnu"
+
+; This test covers a case where an AArch64 DUP instruction is generated with an
+; invalid encoding, resulting in a crash. We don't care about the specific output
+; here, only that this case no longer causes said crash.
+define dso_local i32 @dupext_crashtest(i32 %e) local_unnamed_addr {
+; CHECK-LABEL: dupext_crashtest:
+for.body.lr.ph:
+ %conv314 = zext i32 %e to i64
+ br label %vector.memcheck
+
+vector.memcheck: ; preds = %for.body.lr.ph
+ br label %vector.ph
+
+vector.ph: ; preds = %vector.memcheck
+ %broadcast.splatinsert = insertelement <2 x i64> poison, i64 %conv314, i32 0
+ %broadcast.splat = shufflevector <2 x i64> %broadcast.splatinsert, <2 x i64> poison, <2 x i32> zeroinitializer
+ br label %vector.body
+
+vector.body: ; preds = %vector.body, %vector.ph
+ %wide.load = load <2 x i32>, <2 x i32>* undef, align 4
+ %0 = zext <2 x i32> %wide.load to <2 x i64>
+ %1 = mul nuw <2 x i64> %broadcast.splat, %0
+ %2 = trunc <2 x i64> %1 to <2 x i32>
+ %3 = select <2 x i1> undef, <2 x i32> undef, <2 x i32> %2
+ %4 = bitcast i32* undef to <2 x i32>*
+ store <2 x i32> %3, <2 x i32>* %4, align 4
+ br label %vector.body
+}
\ No newline at end of file
Index: llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
===================================================================
--- llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
+++ llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
@@ -11831,6 +11831,11 @@
PreExtendType != MVT::i32)
return SDValue();
+ bool IsAssert =
+ ExtendOpcode == ISD::AssertSext || ExtendOpcode == ISD::AssertZext;
+ if (IsAssert && Extend.getValueType() == MVT::i64)
+ return SDValue();
+
EVT PreExtendVT = TargetType.changeVectorElementType(PreExtendType);
if (PreExtendVT.getVectorElementCount() != TargetType.getVectorElementCount())
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D94778.316938.patch
Type: text/x-patch
Size: 2380 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210115/fac2ac54/attachment.bin>
More information about the llvm-commits
mailing list