[PATCH] D90404: [AMDGPU] Add alignment check for v3 to v4 load type promotion

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Oct 30 10:14:10 PDT 2020


foad added a comment.

In D90404#2364971 <https://reviews.llvm.org/D90404#2364971>, @arsenm wrote:

> I'm not sure how I follow that 8 byte alignment is sufficient

If a dwordx3 load is 8-byte aligned then the three words loaded will be at addresses that are:
first word: 8 byte aligned
second word: 8 byte aligned + 4
third word: 8 byte aligned

Adding a fourth word to this load will put it at an address that is 8 byte aligned + 4, which is guaranteed to be in the same page as the third word, assuming your page size is greater than 4 bytes. So it can't cause any new page faults.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D90404/new/

https://reviews.llvm.org/D90404



More information about the llvm-commits mailing list