[PATCH] D44045: [AMDGPU] Adjusted alignment-check for local address space;
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 2 15:38:39 PST 2018
rampitec added a comment.
I think it is simpler than that. If a local symbol must be 64 dword aligned, it should be declared as a such and not 4 byte aligned as we have.
Such logic shall not be needed at the first place, llvm should be able to deduce proper alignment given proper input.
Although I am not really sure this is true it is always 64 dword aligned. Consider:
local int x;
local int y;
Do you mean this allocation would take 128 dwords? I highly doubt.
I suppose only the first symbol is 64 dword aligned, and everything after is just naturally aligned wrt element type size. So a logic to leverage actual allocation alignment can be useful only after all LDS is allocated and allocation is flattened into a single LDS memory array.
https://reviews.llvm.org/D44045
More information about the llvm-commits
mailing list