[PATCH] D14901: [X86][SSE] Improve i16 splatting shuffles

Simon Pilgrim via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 16 12:31:16 PST 2015


RKSimon added a comment.

In http://reviews.llvm.org/D14901#312358, @escha wrote:

> Just a side note, but Agner claims pshufb is 1 cycle latency on Wolfdale, Nehalem, and Ivy Bridge as well.


But not any recent Atom or AMD targets - it still doesn't account for the cost of loading the shuffle mask either unfortunately. On the whole I think the 3 op threshold is about right.


Repository:
  rL LLVM

http://reviews.llvm.org/D14901





More information about the llvm-commits mailing list