[PATCH] D140546: [AMDGPU] Remove permlane discard vdst_in optimization from isel

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Dec 22 06:48:00 PST 2022


foad added a subscriber: nlopes.
foad added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/llvm.amdgcn.permlane.ll:853
   %tidx = call i32 @llvm.amdgcn.workitem.id.x()
-  %v = call i32 @llvm.amdgcn.permlane16(i32 12345, i32 %tidx, i32 %src1, i32 %src2, i1 1, i1 0)
+  %v = call i32 @llvm.amdgcn.permlane16(i32 undef, i32 %tidx, i32 %src1, i32 %src2, i1 1, i1 0)
   store i32 %v, ptr addrspace(1) %out
----------------
arsenm wrote:
> foad wrote:
> > arsenm wrote:
> > > Should use poison
> > Really? I read that "Most instructions return ‘poison’ when one of their arguments is ‘poison’" and that's not what I want here. Besides I am just mimicking InstCombine which still uses undef for this optimization.
> undef is soft deprecated. Undef should be freeze poison
+ @nlopes So is "freeze poison" the best option here? I guess just "poison" would work, since nothing is actually going to remove the whole intrinsic in a codegen test, but it seems conceptually wrong.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D140546/new/

https://reviews.llvm.org/D140546



More information about the llvm-commits mailing list