[PATCH] D84403: [AMDGPU] Use ds_read/write_b96/b128 when possible for SDag
Marek Olšák via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Sep 9 00:05:02 PDT 2020
mareko added a comment.
In D84403#2254048 <https://reviews.llvm.org/D84403#2254048>, @foad wrote:
> In D84403#2253762 <https://reviews.llvm.org/D84403#2253762>, @mareko wrote:
>
>> In D84403#2251541 <https://reviews.llvm.org/D84403#2251541>, @mbrkusanin wrote:
>>
>>> In D84403#2249309 <https://reviews.llvm.org/D84403#2249309>, @mareko wrote:
>>>
>>>> In D84403#2248880 <https://reviews.llvm.org/D84403#2248880>, @mbrkusanin wrote:
>>>>
>>>>> In D84403#2248536 <https://reviews.llvm.org/D84403#2248536>, @mareko wrote:
>>>>>
>>>>>> This breaks LDS. LLVMSetAlignment(inst, 4) on loads and stores has no effect. The IR says "align 4", yet the backend still selects b128.
>>>>>
>>>>> On what subtargets? GFX9 and 10 should select b128 for align 4. That is the purpose of the patch. Are you saying it selects it for SI, CI or VI?
>>>>
>>>> On GFX10. Apparently b128 with align 4 doesn't work there.
>>>
>>> I've checked a couple Vulkan CTS tests that now produce b128 instructions for SDag and they work fine. I also did not find any regressions on others. Can you give us any more details? Or a test to reproduce the issue?
>>
>> More information:
>>
>> - (gfx9 hasn't been tested)
>> - gfx10.1 has the corruption in WGP mode only (CU mode works)
>> - gfx10.3 works
>>
>> It looks like it's a gfx10.1 hw bug in WGP mode, so a fix or workaround is needed. The driver always uses WGP mode.
>
> Do you know what the driver sets SH_MEM_CONFIG.alignment_mode to?
Unaligned.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D84403/new/
https://reviews.llvm.org/D84403
More information about the llvm-commits
mailing list