[PATCH] D123956: [AMDGPU] Refine 64 bit misaligned LDS ops selection
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Apr 21 08:56:35 PDT 2022
rampitec added a comment.
In D123956#3464624 <https://reviews.llvm.org/D123956#3464624>, @arsenm wrote:
> LGTM. Is this benchmark getting permanently added to a suite somewhere?
I wish to, I do not want to write it again when we need to bringup a new target. We can discuss on the next meeting where is the right place.
JFYI I have written same tests for global, mubuf and flat scratch since then. We seem to do a right thing for global, but better to always prefer a single dword access on swizzled memory regardless of alignment.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D123956/new/
https://reviews.llvm.org/D123956
More information about the llvm-commits
mailing list