[PATCH] D147159: [AMDGPU][GFX940] Simplify definitions of GLOBAL_ATOMIC

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Mar 30 12:02:04 PDT 2023


rampitec added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/FLATInstructions.td:730
 
+defm GLOBAL_ATOMIC_ADD_F64 : FLAT_Global_Atomic_Pseudo<"global_atomic_add_f64", VReg_64, f64>;
+defm GLOBAL_ATOMIC_MIN_F64 : FLAT_Global_Atomic_Pseudo<"global_atomic_min_f64", VReg_64, f64>;
----------------
mariusz-sikora-at-amd wrote:
> rampitec wrote:
> > I do not understand this. These were only added in gfx90a.
> This patch is only removing 'complexity' from gfx940. For gfx90A we still have Real Instruction under SubtargetPredicate = IsGFX90APlus.
> I will add test for gfx908.
> 
> This patch is only removing 'complexity' from gfx940. For gfx90A we still have Real Instruction under SubtargetPredicate = IsGFX90APlus.
> I will add test for gfx908.

But isn't pseudo still need a predicate too, not just real instruction?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D147159/new/

https://reviews.llvm.org/D147159



More information about the llvm-commits mailing list