[PATCH] D152440: [AMDGPU] Trim trailing undefs from the end of image and buffer store

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jun 8 07:24:58 PDT 2023


arsenm added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp:398
                             &IC.getDominatorTree());
-    if (KnownFPClass.KnownFPClasses != fcPosZero)
+    if (KnownFPClass.KnownFPClasses != fcPosZero && !isa<UndefValue>(Elt))
       break;
----------------
You can skip the whole computeKnownFPClass call in this case


================
Comment at: llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-simplify-image-buffer-stores.ll:93
+;
+  %newvdata1 = insertelement <4 x float> undef, float %vdata1, i32 0
+  %newvdata2 = insertelement <4 x float> %newvdata1, float 1.0, i32 1
----------------
Should use poison in place of undef


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D152440/new/

https://reviews.llvm.org/D152440



More information about the llvm-commits mailing list