[PATCH] D80364: [amdgpu] Teach load widening to handle non-DWORD aligned loads.

Thu May 28 08:45:13 PDT 2020

hliao added a comment.

In D80364#2058794 <https://reviews.llvm.org/D80364#2058794>, @arsenm wrote:

> I'd still like to find a way to avoid a whole extra pass run for this. In the test here, the LoadStoreVectorizer should have vectorized these? Why didn't it?

In D80364#2058794 <https://reviews.llvm.org/D80364#2058794>, @arsenm wrote:

> I'd still like to find a way to avoid a whole extra pass run for this. In the test here, the LoadStoreVectorizer should have vectorized these? Why didn't it?

That's due to the misaligned load after coalescing. This example is written intentionally to skip LSV and verify that common widened load could be CSEd within DAG. In practice, there won't be such input as the 1st i16 load should be properly annotated with align 4.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D80364/new/

https://reviews.llvm.org/D80364