[all-commits] [llvm/llvm-project] cf3a09: [X86] Enable fast variable per-lane shuffle tuning...

Simon Pilgrim via All-commits all-commits at lists.llvm.org
Thu Apr 7 08:20:38 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: cf3a09369a29eb24b66153f0154237a04555ff12
      https://github.com/llvm/llvm-project/commit/cf3a09369a29eb24b66153f0154237a04555ff12
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2022-04-07 (Thu, 07 Apr 2022)

  Changed paths:
    M llvm/lib/Target/X86/X86.td
    M llvm/test/CodeGen/X86/vector-shuffle-fast-per-lane.ll

  Log Message:
  -----------
  [X86] Enable fast variable per-lane shuffle tuning on all Ryzen targets (PR44795)

rGa3b8695bf592 enabled this for znver3, but AMD SoG, Agner and uops.info all agree that even znver1 has a fast per-lane shuffle op (VPSHUFB), but cross-lane shuffles seem to be slow (PERMPS etc.)

Fixes #44140

Differential Revision: https://reviews.llvm.org/D123306




More information about the All-commits mailing list