[PATCH] D72348: AMDGPU: Apply i16 add->sub pattern with zext to i32

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jan 7 12:20:17 PST 2020


arsenm added a comment.

In D72348#1808373 <https://reviews.llvm.org/D72348#1808373>, @rampitec wrote:

> In D72348#1808350 <https://reviews.llvm.org/D72348#1808350>, @arsenm wrote:
>
> > In D72348#1808333 <https://reviews.llvm.org/D72348#1808333>, @rampitec wrote:
> >
> > > Will it correctly work with and without sram-ecc? I.e. do we have any assumptions about high 16 content of an i16 value anywhere?
> >
> >
> > That only matters for memory accesses as far as I know. This isn't really a new pattern, and the existing predicates don't check
>
>
> It is more than memory as far as I know, even arithmetic instructions will either zero or preserve the high bits.


This is controlled by a bit starting in gfx9 I think. Eventually we need to split the instruction definitions to add a tied operand for the preserved high case. These are separate problems from this patch anyway


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D72348/new/

https://reviews.llvm.org/D72348





More information about the llvm-commits mailing list