[PATCH] D119696: [AMDGPU] Improve v_cmpx usage on GFX10.3.
Thomas Symalla via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Feb 16 06:58:51 PST 2022
tsymalla marked an inline comment as done.
tsymalla added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp:571
+ // After all s_op_saveexec instructions are inserted,
+ // replace (on GFX10.3)
+ // v_cmp_* SGPR, IMM, VGPR
----------------
arsenm wrote:
> nhaehnle wrote:
> > gfx10.3 and later.
> Why only do this on gfx10.3? Every target has v_cmpx?
We decided to do so because it is unclear if on < GFX10 this gives any performance advantage and on 10.1 / 10.2 an additional s_waitcnt_depctr needs to be inserted for correctness, so probably there won't be any performance gain from this on these targets.
@foad might elaborate more details for this.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D119696/new/
https://reviews.llvm.org/D119696
More information about the llvm-commits
mailing list