[PATCH] D72348: AMDGPU: Apply i16 add->sub pattern with zext to i32
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jan 7 12:20:17 PST 2020
arsenm added a comment.
In D72348#1808373 <https://reviews.llvm.org/D72348#1808373>, @rampitec wrote:
> In D72348#1808350 <https://reviews.llvm.org/D72348#1808350>, @arsenm wrote:
>
> > In D72348#1808333 <https://reviews.llvm.org/D72348#1808333>, @rampitec wrote:
> >
> > > Will it correctly work with and without sram-ecc? I.e. do we have any assumptions about high 16 content of an i16 value anywhere?
> >
> >
> > That only matters for memory accesses as far as I know. This isn't really a new pattern, and the existing predicates don't check
>
>
> It is more than memory as far as I know, even arithmetic instructions will either zero or preserve the high bits.
This is controlled by a bit starting in gfx9 I think. Eventually we need to split the instruction definitions to add a tied operand for the preserved high case. These are separate problems from this patch anyway
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D72348/new/
https://reviews.llvm.org/D72348
More information about the llvm-commits
mailing list