[PATCH] D17530: AMDGPU: More bits of frame index are known to be zero

Mon Feb 22 18:15:29 PST 2016

arsenm created this revision.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
Herald added a subscriber: arsenm.

The maximum private allocation for the whole GPU is 4G,
so the maximum possible index for a single workitem is the
maximum size divided by the smallest granularity for a dispatch.

This increases the number of known zero high bits, which
enables more offset folding. The maximum private size per
workitem with this is 128M but may be smaller still.

http://reviews.llvm.org/D17530

Files:
  lib/Target/AMDGPU/AMDGPU.td
  lib/Target/AMDGPU/AMDGPUSubtarget.cpp
  lib/Target/AMDGPU/AMDGPUSubtarget.h
  lib/Target/AMDGPU/SIISelLowering.cpp
  test/CodeGen/AMDGPU/private-element-size.ll
  test/CodeGen/AMDGPU/scratch-buffer.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D17530.48763.patch
Type: text/x-patch
Size: 8806 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160223/8cc1653f/attachment.bin>