[PATCH] D119696: [AMDGPU] Improve v_cmpx usage on GFX10.3.

Thomas Symalla via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Feb 16 06:58:51 PST 2022


tsymalla marked an inline comment as done.
tsymalla added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp:571
+  // After all s_op_saveexec instructions are inserted,
+  // replace (on GFX10.3)
+  // v_cmp_* SGPR, IMM, VGPR
----------------
arsenm wrote:
> nhaehnle wrote:
> > gfx10.3 and later.
> Why only do this on gfx10.3? Every target has v_cmpx?
We decided to do so because it is unclear if on < GFX10 this gives any performance advantage and on 10.1 / 10.2 an additional s_waitcnt_depctr needs to be inserted for correctness, so probably there won't be any performance gain from this on these targets.

@foad might elaborate more details for this.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D119696/new/

https://reviews.llvm.org/D119696



More information about the llvm-commits mailing list