[llvm] [AMDGPU] add s_bitset[10]_b32 optimization for shl+[or, andn2] pattern (PR #134155)
Juan Manuel Martinez CaamaƱo via llvm-commits
llvm-commits at lists.llvm.org
Thu Apr 3 06:22:01 PDT 2025
================
@@ -534,6 +535,56 @@ bool SIShrinkInstructions::shrinkScalarLogicOp(MachineInstr &MI) const {
MachineOperand *SrcReg = Src0;
MachineOperand *SrcImm = Src1;
+ // case 1:
+ // From:
+ // s_lshl_b32 s1, 1, s1
+ // s_or_b32 s0, s0, s1
+ // To:
+ // s_bitset1_b32 s0, s1
+ //
+ // case 2:
+ // s_lshl_b32 s1, 1, s1
+ // s_andn2_b32 s0, s0, s1
+ // To:
+ // s_bitset0_b32 s0, s1
+ if ((MI.getOpcode() == AMDGPU::S_OR_B32 ||
----------------
jmmartinez wrote:
Please move this code into its own function (in the same way as it is done for `shrinkScalarLogicOp`). It would help reduce the nesting of the ifs and have simpler conditions.
https://github.com/llvm/llvm-project/pull/134155
More information about the llvm-commits
mailing list