[all-commits] [llvm/llvm-project] 19727e: [AMDGPU] Enable divergence predicates for ctlz/cttz

alex-t via All-commits all-commits at lists.llvm.org
Mon Dec 20 09:51:45 PST 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 19727e31fb2c0e0b27bd8583d8bda1a42e6d41f8
      https://github.com/llvm/llvm-project/commit/19727e31fb2c0e0b27bd8583d8bda1a42e6d41f8
  Author: alex-t <alexander.timofeev at amd.com>
  Date:   2021-12-20 (Mon, 20 Dec 2021)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SOPInstructions.td
    A llvm/test/CodeGen/AMDGPU/divergence-driven-ctlz-cttz.ll

  Log Message:
  -----------
  [AMDGPU] Enable divergence predicates for ctlz/cttz

ctlz/cttz get lowered to the set of target opcodes
This change enables the ISel to select SALU or VALU form according to the SDNode divergence.
CTLZ - S_FLBIT_I32_B32 if uniform and V_FFBH_U32_e64 if divergent
CTTZ - S_FF1_I32_B32   if uniform and V_FFBL_B32_e64 if divergent
Also @llvm.amdgcn.sffbh.i32 gets lowered to S_FLBIT_I32 if uniform and V_FFBH_I32_e64 if divergent
NOTE: 64bit versions S_FF1_I32_B64 and S_FLBIT_I32_B64 are not currently supported by the DAG ISel.
ctlz/cttz with i64 input are split into two 32bit instructions. Nevertheless, they already have the patterns
and were equipped with the divergence predicates to make sure they will be selected correctly when enabled.

Reviewed By: rampitec

Differential Revision: https://reviews.llvm.org/D116044




More information about the All-commits mailing list