[PATCH] D123524: [AMDGPU] Split unaligned 3 DWORD DS operations
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 12 07:55:58 PDT 2022
rampitec added a comment.
In D123524#3444855 <https://reviews.llvm.org/D123524#3444855>, @foad wrote:
> Looks OK to me. But there will always be benchmarks that go faster and slower with any change like this, because the compiler does not have perfect knowledge about the (mis)alignment of all data.
Yes, sure. The point of the patch is to minimize the cost of the mistake. If the data is really aligned it will be slower now, but if it is really misaligned it is way slower with a single instruction before the patch.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D123524/new/
https://reviews.llvm.org/D123524
More information about the llvm-commits
mailing list