[PATCH] D31412: Boost unroll threshold for loops reading local memory
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 27 17:21:18 PDT 2017
rampitec created this revision.
Herald added subscribers: nhaehnle, wdng.
This is less important than increase threshold for private memory,
but still brings performance improvements in a wide range of tests.
Unrolling more for local memory serves three purposes: it allows
to combine ds operations if offset becomes static, saves registers
used for offsets in case of static offsets, and allows better lds
latency hiding.
Repository:
rL LLVM
https://reviews.llvm.org/D31412
Files:
lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
test/CodeGen/AMDGPU/unroll.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D31412.93194.patch
Type: text/x-patch
Size: 7186 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170328/2fa98eab/attachment.bin>
More information about the llvm-commits
mailing list