[llvm] True16 Add OpSel when optimizing exec mask (PR #128928)

Brox Chen via llvm-commits llvm-commits at lists.llvm.org
Wed Feb 26 10:13:34 PST 2025


https://github.com/broxigarchen created https://github.com/llvm/llvm-project/pull/128928

True16 VOPCX have the opsel argument. Add it when we create these instructions in SIOptimizeExecMasking.

>From d46333c06e498f06e8e9761a11a056369c2049fd Mon Sep 17 00:00:00 2001
From: guochen2 <guochen2 at amd.com>
Date: Wed, 26 Feb 2025 13:13:05 -0500
Subject: [PATCH]  True16 Add OpSel when optimizing exec mask

True16 VOPCX have the opsel argument. Add it when we create these
instructions in SIOptimizeExecMasking.
---
 llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp b/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
index 920c3e11e4718..745e4086bc7fe 100644
--- a/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
+++ b/llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp
@@ -632,6 +632,8 @@ bool SIOptimizeExecMasking::optimizeVCMPSaveExecSequence(
 
   TryAddImmediateValueFromNamedOperand(AMDGPU::OpName::clamp);
 
+  TryAddImmediateValueFromNamedOperand(AMDGPU::OpName::op_sel);
+
   // The kill flags may no longer be correct.
   if (Src0->isReg())
     MRI->clearKillFlags(Src0->getReg());



More information about the llvm-commits mailing list