[PATCH] D17530: AMDGPU: More bits of frame index are known to be zero
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Feb 22 18:15:29 PST 2016
arsenm created this revision.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
Herald added a subscriber: arsenm.
The maximum private allocation for the whole GPU is 4G,
so the maximum possible index for a single workitem is the
maximum size divided by the smallest granularity for a dispatch.
This increases the number of known zero high bits, which
enables more offset folding. The maximum private size per
workitem with this is 128M but may be smaller still.
http://reviews.llvm.org/D17530
Files:
lib/Target/AMDGPU/AMDGPU.td
lib/Target/AMDGPU/AMDGPUSubtarget.cpp
lib/Target/AMDGPU/AMDGPUSubtarget.h
lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/private-element-size.ll
test/CodeGen/AMDGPU/scratch-buffer.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D17530.48763.patch
Type: text/x-patch
Size: 8806 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160223/8cc1653f/attachment.bin>
More information about the llvm-commits
mailing list