[PATCH] D149071: [X86] LowerRotate: prefer unpack-based algorithm
Noah Goldstein via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Apr 27 14:33:16 PDT 2023
goldstein.w.n added inline comments.
================
Comment at: llvm/test/CodeGen/X86/min-legal-vector-width.ll:1942
+; CHECK-NEXT: vpackuswb %xmm2, %xmm0, %xmm0
+; CHECK-NEXT: retq
%b8 = sub <16 x i8> <i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8, i8 8>, %b
----------------
Nekotekina wrote:
> Code analyser:
> SKX version (left): https://bit.ly/3NamuO5
> VBMI version (left): https://bit.ly/444ZUfB
>
> Common (right): https://bit.ly/3n7q4xQ
> Unpack version looks better despite more instructions.
This seems potentially not worth it for ICX unless its in a tight inner loop.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D149071/new/
https://reviews.llvm.org/D149071
More information about the llvm-commits
mailing list