[PATCH] D105920: [AMDGPU] Init scratch only if necessary
Tony Tye via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jul 13 17:32:53 PDT 2021
t-tye added a comment.
It is also not necessary to request the scratch SRD to be set up in the User SGPRs. There is a significant performance cost for small kernels to requesting this unneeded set of 4 SGPRs to be initialized by the hardware.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D105920/new/
https://reviews.llvm.org/D105920
More information about the llvm-commits
mailing list