[llvm] [AMDGPU] Skip handling of non-byte types in promote alloca. (PR #128769)
Sumanth Gundapaneni via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 27 08:22:20 PST 2025
================
@@ -0,0 +1,21 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt -S -mtriple=amdgcn-unknown-amdhsa -passes=amdgpu-promote-alloca < %s | FileCheck %s
+
+; Verify that we do not crash and not promote non-byte alloca types.
+define <8 x i1> @non_byte_alloca_type() {
+; CHECK-LABEL: define <8 x i1> @non_byte_alloca_type() {
+; CHECK-NEXT: [[ENTRY:.*:]]
+; CHECK-NEXT: [[C:%.*]] = icmp ugt <16 x i1> zeroinitializer, zeroinitializer
+; CHECK-NEXT: [[RP:%.*]] = alloca <8 x i1>, align 1
+; CHECK-NEXT: [[TMP0:%.*]] = load <8 x i1>, ptr [[RP]], align 1
+; CHECK-NEXT: store <16 x i1> [[C]], ptr [[RP]], align 2
+; CHECK-NEXT: ret <8 x i1> [[TMP0]]
+;
+entry:
+ %C = icmp ugt <16 x i1> zeroinitializer, zeroinitializer
+ %RP = alloca <8 x i1>, align 1
+ %0 = load <8 x i1>, ptr %RP, align 1
+ store <16 x i1> %C, ptr %RP, align 2
+ ret <8 x i1> %0
+}
+
----------------
sgundapa wrote:
The assertion trigered here is due to subvector being <2 x i1> and the access type being <16 x i1>
The access size for < 16 x i1> is 2 and the computation to derive the subvector relies on this access size and ended with a <2xi1> that triggered the assert due to mismatch in storage size.
assert(DL.getTypeStoreSize(SubVecTy) == DL.getTypeStoreSize(AccessTy));
https://github.com/llvm/llvm-project/pull/128769
More information about the llvm-commits
mailing list