[llvm] [AMDGPU][SIPreEmitPeephole] pre-commit tests: mustRetainExeczBranch: use a cost model (PR #109816)

Juan Manuel Martinez CaamaƱo via llvm-commits llvm-commits at lists.llvm.org
Tue Sep 24 08:35:47 PDT 2024


================
@@ -0,0 +1,265 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
+; RUN: llc -mtriple=amdgcn -mcpu=gfx1010 < %s | FileCheck -check-prefixes=GFX10,GFX1010 %s
+; RUN: llc -mtriple=amdgcn -mcpu=gfx1030 < %s | FileCheck -check-prefixes=GFX10,GFX1030 %s
+
+define void @convergent_cmp_no_metadata(i32 noundef inreg %value, ptr addrspace(8) nocapture writeonly inreg %res, i32 noundef inreg %v_offset, i32 noundef inreg %0, i32 noundef inreg %flag) {
----------------
jmmartinez wrote:

That's weird.

You're right, there is nothing convergent here (it would be different if it was a kernel and not a function)

Would the comparison `%cmp = icmp sgt i32 %flag, 0` generate some other kind of branch (other than `s_cbranch_scc1`) if it was not convergent ?

https://github.com/llvm/llvm-project/pull/109816


More information about the llvm-commits mailing list