[all-commits] [llvm/llvm-project] 7c3cda: [AArch64][SVE] Prefer SIMD&FP variant of clast[ab]

Cullen Rhodes via All-commits all-commits at lists.llvm.org
Wed Jul 13 01:54:05 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 7c3cda551ac702de4eb8899180aa715896020d43
      https://github.com/llvm/llvm-project/commit/7c3cda551ac702de4eb8899180aa715896020d43
  Author: Cullen Rhodes <cullen.rhodes at arm.com>
  Date:   2022-07-13 (Wed, 13 Jul 2022)

  Changed paths:
    M clang/test/CodeGen/aarch64-sve-intrinsics/acle_sve_clasta.c
    M clang/test/CodeGen/aarch64-sve-intrinsics/acle_sve_clastb.c
    M llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp
    A llvm/test/Transforms/InstCombine/AArch64/sve-intrinsic-opts-clast.ll

  Log Message:
  -----------
  [AArch64][SVE] Prefer SIMD&FP variant of clast[ab]

The scalar variant with GPR source/dest has considerably higher latency
than the SIMD&FP scalar variant across a variety of micro-architectures:

  Core           Scalar    SIMD&FP
  --------------------------------
  Neoverse V1     9 cyc      3 cyc
  Neoverse N2     8 cyc      3 cyc
  Cortex A510     8 cyc      4 cyc
  A64FX          29 cyc      6 cyc




More information about the All-commits mailing list