[PATCH] D123956: [AMDGPU] Refine 64 bit misaligned LDS ops selection

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 21 08:56:35 PDT 2022


rampitec added a comment.

In D123956#3464624 <https://reviews.llvm.org/D123956#3464624>, @arsenm wrote:

> LGTM. Is this benchmark getting permanently added to a suite somewhere?

I wish to, I do not want to write it again when we need to bringup a new target. We can discuss on the next meeting where is the right place.

JFYI I have written same tests for global, mubuf and flat scratch since then. We seem to do a right thing for global, but better to always prefer a single dword access on swizzled memory regardless of alignment.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D123956/new/

https://reviews.llvm.org/D123956



More information about the llvm-commits mailing list