[PATCH] D40865: X86 AVX2: Prefer one VPERMV over ShuffleAsRepeatedMaskAndLanePermute

Dibyendu Das via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Dec 7 01:29:57 PST 2017


ddibyend added a comment.

vpermps/vpermpd/vpermd have high cost in Ryzen. I see that this patch creates cases where a vperm* is introduced where one did not exist earlier.  That may cause slowdowns in Ryzen.


https://reviews.llvm.org/D40865





More information about the llvm-commits mailing list