[PATCH] D14050: [X86][SSE] Shuffle blends with zero

Elena Demikhovsky via llvm-commits llvm-commits at lists.llvm.org
Wed Oct 28 06:24:38 PDT 2015


delena added a comment.

In more general case it will work if one of V1 or V2 is vector of constants with '0' in the right place. When you calculate computeZeroableShuffleElements() you check this option.

if (Zeroable[i]) {

  You just should know what input to choose. You don't need to rebuild V1 or V2. 
  And you **always **can define mask for the "zeroable" element, no fallthru in this case.

If computeZeroableShuffleElements() was returning not only mask, but also input number (V1 or V2) per zeroable element, you'd just use this information.


Repository:
  rL LLVM

http://reviews.llvm.org/D14050





More information about the llvm-commits mailing list