[PATCH] D85530: [AMDGPU] Don't cluster stores
Carl Ritson via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sat Aug 8 18:25:22 PDT 2020
critson added a comment.
I tried this change with game traces on GFX10.
I could not convince myself that there was any statically significant changes in performance.
Some small gains, some small losses, nothing outside the range of variance.
I do however wonder if this would be better as a tuning option?
The same could also be said for load clustering as essentially you are trading VGPR pressure for VMEM access efficiency (e.g. stalls).
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D85530/new/
https://reviews.llvm.org/D85530
More information about the llvm-commits
mailing list