[PATCH] D41440: [X86][SSE] Split large PAVGB/PAVGW vectors to legal widths

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 20 11:01:39 PST 2017


craig.topper added a comment.

What happens with types that are multiple of 128 bits,



================
Comment at: lib/Target/X86/X86ISelLowering.cpp:33862
   if (Subtarget.hasBWI()) {
-    if (VT.getSizeInBits() > 512)
+    if (VT.getSizeInBits() > 512 && (VT.getSizeInBits() % 512) != 0)
       return SDValue();
----------------
Isn't the modulo guaranteed true by the isPowerOf2_32(NumElems) check above? And if it wasn't it would mean we would do something different for a 384-bit vector with only SSE2(since its divisible by 128) than we would for AVX2(since its not divisible by 256)


Repository:
  rL LLVM

https://reviews.llvm.org/D41440





More information about the llvm-commits mailing list