[PATCH] D79176: [AMDGPU] Remove unnecessary s_waitcnt between VMEM loads

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 30 08:29:46 PDT 2020


foad created this revision.
foad added reviewers: nhaehnle, arsenm, rampitec.
Herald added subscribers: llvm-commits, kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, wdng, jvesely, kzhuravl.
Herald added a project: LLVM.

VMEM loads of the same type (sampler vs no sampler) are guaranteed to
write their result registers in order, so there is no need for an
s_waitcnt even if they write to overlapping vgprs.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D79176

Files:
  llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
  llvm/test/CodeGen/AMDGPU/mubuf-legalize-operands.ll
  llvm/test/CodeGen/AMDGPU/shl.ll
  llvm/test/CodeGen/AMDGPU/wait.ll
  llvm/test/CodeGen/AMDGPU/waitcnt-vmem-waw.mir

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D79176.261233.patch
Type: text/x-patch
Size: 15993 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200430/d0a0f735/attachment.bin>


More information about the llvm-commits mailing list