[PATCH] D123524: [AMDGPU] Split unaligned 3 DWORD DS operations

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Apr 12 07:55:58 PDT 2022


rampitec added a comment.

In D123524#3444855 <https://reviews.llvm.org/D123524#3444855>, @foad wrote:

> Looks OK to me. But there will always be benchmarks that go faster and slower with any change like this, because the compiler does not have perfect knowledge about the (mis)alignment of all data.

Yes, sure. The point of the patch is to minimize the cost of the mistake. If the data is really aligned it will be slower now, but if it is really misaligned it is way slower with a single instruction before the patch.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D123524/new/

https://reviews.llvm.org/D123524



More information about the llvm-commits mailing list