[llvm] [AMDGPU] Skip handling of non-byte types in promote alloca. (PR #128769)

Sumanth Gundapaneni via llvm-commits llvm-commits at lists.llvm.org
Thu Feb 27 08:24:01 PST 2025


================
@@ -0,0 +1,21 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt -S -mtriple=amdgcn-unknown-amdhsa -passes=amdgpu-promote-alloca < %s | FileCheck %s
+
+; Verify that we do not crash and not promote non-byte alloca types.
+define <8 x i1> @non_byte_alloca_type() {
+; CHECK-LABEL: define <8 x i1> @non_byte_alloca_type() {
+; CHECK-NEXT:  [[ENTRY:.*:]]
+; CHECK-NEXT:    [[C:%.*]] = icmp ugt <16 x i1> zeroinitializer, zeroinitializer
+; CHECK-NEXT:    [[RP:%.*]] = alloca <8 x i1>, align 1
+; CHECK-NEXT:    [[TMP0:%.*]] = load <8 x i1>, ptr [[RP]], align 1
+; CHECK-NEXT:    store <16 x i1> [[C]], ptr [[RP]], align 2
+; CHECK-NEXT:    ret <8 x i1> [[TMP0]]
+;
+entry:
+  %C = icmp ugt <16 x i1> zeroinitializer, zeroinitializer
+  %RP = alloca <8 x i1>, align 1
----------------
sgundapa wrote:

Thats correct. Here is an example that might trigger an UB

@G = global <8 x float> <float 4.200000e+01, float 4.200000e+01, float 4.200000e+01, float 4.200000e+01, float 4.200000e+01, float 4.200000e+01, float 4.200000e+01, float 4.200000e+01>

define <8 x i1> @f(float %0, i32 %1, i16 %2) {
BB:
  %LGV = load <8 x float>, ptr @G, align 32
  %RP = alloca <8 x i1>, align 1
  %L = load <8 x float>, ptr %RP, align 32
  %C = fcmp olt <8 x float> %L, %LGV
  ret <8 x i1> %C
}


https://github.com/llvm/llvm-project/pull/128769


More information about the llvm-commits mailing list