[PATCH] D84403: [AMDGPU] Use ds_read/write_b96/b128 when possible for SDag

Marek Olšák via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Sep 2 12:09:42 PDT 2020


mareko added a comment.

In D84403#2251541 <https://reviews.llvm.org/D84403#2251541>, @mbrkusanin wrote:

> In D84403#2249309 <https://reviews.llvm.org/D84403#2249309>, @mareko wrote:
>
>> In D84403#2248880 <https://reviews.llvm.org/D84403#2248880>, @mbrkusanin wrote:
>>
>>> In D84403#2248536 <https://reviews.llvm.org/D84403#2248536>, @mareko wrote:
>>>
>>>> This breaks LDS. LLVMSetAlignment(inst, 4) on loads and stores has no effect. The IR says "align 4", yet the backend still selects b128.
>>>
>>> On what subtargets? GFX9 and 10 should select b128 for align 4. That is the purpose of the patch. Are you saying it selects it for SI, CI or VI?
>>
>> On GFX10. Apparently b128 with align 4 doesn't work there.
>
> I've checked a couple Vulkan CTS tests that now produce b128 instructions for SDag and they work fine. I also did not find any regressions on others. Can you give us any more details? Or a test to reproduce the issue?

The broken application is Unigine Heaven with Mesa OpenGL. It's a pretty standard app, so most likely all tessellation is broken and other LDS users possibly too.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D84403/new/

https://reviews.llvm.org/D84403



More information about the llvm-commits mailing list