[llvm] True16 Add OpSel when optimizing exec mask (PR #128928)
Brox Chen via llvm-commits
llvm-commits at lists.llvm.org
Wed Feb 26 10:13:34 PST 2025
https://github.com/broxigarchen created https://github.com/llvm/llvm-project/pull/128928
True16 VOPCX have the opsel argument. Add it when we create these instructions in SIOptimizeExecMasking.
>From d46333c06e498f06e8e9761a11a056369c2049fd Mon Sep 17 00:00:00 2001
From: guochen2 <guochen2 at amd.com>
Date: Wed, 26 Feb 2025 13:13:05 -0500
Subject: [PATCH] True16 Add OpSel when optimizing exec mask
True16 VOPCX have the opsel argument. Add it when we create these
instructions in SIOptimizeExecMasking.
---
llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp | 2 ++
1 file changed, 2 insertions(+)
diff --git a/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp b/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
index 920c3e11e4718..745e4086bc7fe 100644
--- a/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
+++ b/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
@@ -632,6 +632,8 @@ bool SIOptimizeExecMasking::optimizeVCMPSaveExecSequence(
TryAddImmediateValueFromNamedOperand(AMDGPU::OpName::clamp);
+ TryAddImmediateValueFromNamedOperand(AMDGPU::OpName::op_sel);
+
// The kill flags may no longer be correct.
if (Src0->isReg())
MRI->clearKillFlags(Src0->getReg());
More information about the llvm-commits
mailing list