[PATCH] D125126: [AMDGPU] Enable FLAT LDS DMA on gfx9/10 before gfx940

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sat May 7 02:58:49 PDT 2022


arsenm added a comment.

In D125126#3498084 <https://reviews.llvm.org/D125126#3498084>, @rampitec wrote:

> A potentially better alternative is to use gfx940 names with _LDS_ in the mnemonic instead of a modifier. This is logically a different opcode anyway. The only downside it is not compatible with the documentation and sp3. But then it was not implemented before and therefore not used, so there shall be no compatibility problem on practice. Well, it will also be different from MUBUF. Given the difference in both semantics and addressing mode I personally would prefer it to be different opcodes. At a pseudo level it is certainly easier to have separate ops for this.
>
> Preferences?

It's probably better to have separate opcodes. In general I think the way we try to force all of these subtarget changes onto the same generic pseudos is more trouble than it's worth. It requires more and more code to verify and make use of the features, and it would be cleaner to move towards separate instruction definitions per subtarget


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D125126/new/

https://reviews.llvm.org/D125126



More information about the llvm-commits mailing list