[PATCH] D151754: [LoadStoreVectorizer] Fix index width != pointer width case
Krzysztof Drewniak via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue May 30 14:19:08 PDT 2023
krzysz00 created this revision.
krzysz00 added reviewers: jlebar, foad.
Herald added subscribers: StephenFan, kerbowa, arphaman, hiraditya, jvesely.
Herald added a project: All.
krzysz00 requested review of this revision.
Herald added subscribers: llvm-commits, pcwang-thead.
Herald added a project: LLVM.
Fixes https://github.com/llvm/llvm-project/issues/62856
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D151754
Files:
llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp
llvm/test/Transforms/LoadStoreVectorizer/AMDGPU/addrspace-7.ll
Index: llvm/test/Transforms/LoadStoreVectorizer/AMDGPU/addrspace-7.ll
===================================================================
--- llvm/test/Transforms/LoadStoreVectorizer/AMDGPU/addrspace-7.ll
+++ llvm/test/Transforms/LoadStoreVectorizer/AMDGPU/addrspace-7.ll
@@ -1,10 +1,18 @@
-; REQUIRES: asserts
-; RUN: not --crash opt -mtriple=amdgcn-amd-amdhsa -passes=load-store-vectorizer -S -o - %s
-; RUN: not --crash opt -mtriple=amdgcn-amd-amdhsa -aa-pipeline=basic-aa -passes='function(load-store-vectorizer)' -S -o - %s
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 2
+; RUN: opt -mtriple=amdgcn-amd-amdhsa -passes=load-store-vectorizer -S -o - %s | FileCheck %s
+; RUN: opt -mtriple=amdgcn-amd-amdhsa -aa-pipeline=basic-aa -passes='function(load-store-vectorizer)' -S -o - %s | FileCheck %s
target datalayout = "e-p:64:64-p1:64:64-p2:32:32-p3:32:32-p4:64:64-p5:32:32-p6:32:32-p7:160:256:256:32-p8:128:128-i64:64-v16:16-v24:32-v32:32-v48:64-v96:128-v192:256-v256:256-v512:512-v1024:1024-v2048:2048-n32:64-S32-A5"
define { float, float } @f() {
+; CHECK-LABEL: define { float, float } @f() {
+; CHECK-NEXT: bb:
+; CHECK-NEXT: [[L1:%.*]] = load float, ptr addrspace(7) null, align 4
+; CHECK-NEXT: [[L2:%.*]] = load float, ptr addrspace(7) getelementptr (i8, ptr addrspace(7) null, i64 24), align 4
+; CHECK-NEXT: [[IV1:%.*]] = insertvalue { float, float } zeroinitializer, float [[L1]], 0
+; CHECK-NEXT: [[IV2:%.*]] = insertvalue { float, float } [[IV1]], float [[L2]], 1
+; CHECK-NEXT: ret { float, float } [[IV2]]
+;
bb:
%l1 = load float, ptr addrspace(7) null
%l2 = load float, ptr addrspace(7) getelementptr (i8, ptr addrspace(7) null, i64 24)
Index: llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp
===================================================================
--- llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp
+++ llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp
@@ -1501,9 +1501,12 @@
if (DistScev != SE.getCouldNotCompute()) {
LLVM_DEBUG(dbgs() << "LSV: SCEV PtrB - PtrA =" << *DistScev << "\n");
ConstantRange DistRange = SE.getSignedRange(DistScev);
- if (DistRange.isSingleElement())
- return (OffsetB - OffsetA + *DistRange.getSingleElement())
- .sextOrTrunc(OrigBitWidth);
+ if (DistRange.isSingleElement()) {
+ // Handle index width (the width of Dist) != pointer width (the width of
+ // the Offset*s at this point).
+ APInt Dist = DistRange.getSingleElement()->sextOrTrunc(NewPtrBitWidth);
+ return (OffsetB - OffsetA + Dist).sextOrTrunc(OrigBitWidth);
+ }
}
std::optional<APInt> Diff =
getConstantOffsetComplexAddrs(PtrA, PtrB, ContextInst, Depth);
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D151754.526792.patch
Type: text/x-patch
Size: 2745 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230530/fa443db8/attachment.bin>
More information about the llvm-commits
mailing list