[PATCH] D75976: [AMDGPU] Optimize AtomicOptimizer

Sebastian Neubauer via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 11 02:19:26 PDT 2020


Flakebi created this revision.
Flakebi added reviewers: foad, arsenm, nhaehnle.
Herald added subscribers: llvm-commits, kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, wdng, jvesely, kzhuravl.
Herald added a project: LLVM.
Flakebi added a parent revision: D65088: [AMDGPU][RFC] New llvm.amdgcn.ballot intrinsic.

Mark the ctpop as convergent so it does not get moved into the
single-lane basic block. This saves us currently one instruction.
Another way to save this instruction is reusing the saved exec register
from the inserted control flow (output of s_saveexec). This is
currently hard to do though it might work when GlobalISel gets used.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D75976

Files:
  llvm/lib/Target/AMDGPU/AMDGPUAtomicOptimizer.cpp
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_buffer.ll
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_global_pointer.ll
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_local_pointer.ll
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_pixelshader.ll
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_raw_buffer.ll
  llvm/test/CodeGen/AMDGPU/atomic_optimizations_struct_buffer.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D75976.249564.patch
Type: text/x-patch
Size: 67453 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200311/85ea0048/attachment.bin>


More information about the llvm-commits mailing list