[PATCH] D17530: AMDGPU: More bits of frame index are known to be zero
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Feb 22 18:15:29 PST 2016
arsenm created this revision.
arsenm added a reviewer: tstellarAMD.
arsenm added a subscriber: llvm-commits.
Herald added a subscriber: arsenm.
The maximum private allocation for the whole GPU is 4G,
so the maximum possible index for a single workitem is the
maximum size divided by the smallest granularity for a dispatch.
This increases the number of known zero high bits, which
enables more offset folding. The maximum private size per
workitem with this is 128M but may be smaller still.
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 8806 bytes
Desc: not available
More information about the llvm-commits