[llvm] [AMDGPU] ISel for @llvm.amdgcn.cs.chain intrinsic (PR #68186)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Fri Nov 3 10:47:32 PDT 2023
================
@@ -665,6 +665,50 @@ def : GCNPat<
(SI_TCRETURN_GFX Gfx_CCR_SGPR_64:$src0, (i64 0), i32imm:$fpdiff)
>;
+// Pseudo for the llvm.amdgcn.cs.chain intrinsic.
+// This is essentially a tail call, but it also takes a mask to put in EXEC
+// right before jumping to the callee.
+class SI_CS_CHAIN_TC<
+ ValueType execvt,
+ RegisterOperand execrc = !if(!eq(execvt, i32), SSrc_b32, SSrc_b64)>
+ : SPseudoInstSI <(outs),
+ (ins CCR_SGPR_64:$src0, unknown:$callee, i32imm:$fpdiff, execrc:$exec)> {
+ let FixedSize = 0;
+ let isCall = 1;
+ let isTerminator = 1;
+ let isBarrier = 1;
+ let isReturn = 1;
+ let UseNamedOperandTable = 1;
+ let SchedRW = [WriteBranch];
+ let isConvergent = 1;
+
+ let WaveSizePredicate = !if(!eq(execvt, i32), isWave32, isWave64);
+}
+
+def SI_CS_CHAIN_TC_W32 : SI_CS_CHAIN_TC<i32>;
+def SI_CS_CHAIN_TC_W64 : SI_CS_CHAIN_TC<i64>;
+
+// Handle selecting direct & indirect calls via SI_CS_CHAIN_TC_W32/64
+multiclass si_cs_chain_tc_pattern<
+ dag callee, ValueType execvt, RegisterOperand execrc, Instruction tc> {
+def : GCNPat<
+ (AMDGPUtc_return_chain i64:$src0, callee, (i32 timm:$fpdiff), execvt:$exec),
+ (tc CCR_SGPR_64:$src0, callee, i32imm:$fpdiff, execrc:$exec)
+>;
+}
+
+multiclass si_cs_chain_tc_patterns<
+ ValueType execvt,
+ RegisterOperand execrc = !if(!eq(execvt, i32), SSrc_b32, SSrc_b64),
----------------
jayfoad wrote:
Same.
https://github.com/llvm/llvm-project/pull/68186
More information about the llvm-commits
mailing list