[all-commits] [llvm/llvm-project] cf3a09: [X86] Enable fast variable per-lane shuffle tuning...
Simon Pilgrim via All-commits
all-commits at lists.llvm.org
Thu Apr 7 08:20:38 PDT 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: cf3a09369a29eb24b66153f0154237a04555ff12
https://github.com/llvm/llvm-project/commit/cf3a09369a29eb24b66153f0154237a04555ff12
Author: Simon Pilgrim <llvm-dev at redking.me.uk>
Date: 2022-04-07 (Thu, 07 Apr 2022)
Changed paths:
M llvm/lib/Target/X86/X86.td
M llvm/test/CodeGen/X86/vector-shuffle-fast-per-lane.ll
Log Message:
-----------
[X86] Enable fast variable per-lane shuffle tuning on all Ryzen targets (PR44795)
rGa3b8695bf592 enabled this for znver3, but AMD SoG, Agner and uops.info all agree that even znver1 has a fast per-lane shuffle op (VPSHUFB), but cross-lane shuffles seem to be slow (PERMPS etc.)
Fixes #44140
Differential Revision: https://reviews.llvm.org/D123306
More information about the All-commits
mailing list