[PATCH] D61263: [X86][SSE] Fold scalar horizontal add/sub for non-0/1 element extractions

Sanjay Patel via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Apr 30 16:32:10 PDT 2019


spatel added a comment.

I might've overlooked it - do we have test coverage for a 256-bit source vector where we extract from the upper elements?



================
Comment at: lib/Target/X86/X86ISelLowering.cpp:19034
+  // equivalent, so extract the 256/512-bit source op to 128-bit if we can.
   // This is free: ymm/zmm -> xmm.
+  if (NumEltsPerLane <= LExtIndex)
----------------
Need to update this comment to something like:
This is a shuffle or free if the left index is 0.


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D61263/new/

https://reviews.llvm.org/D61263





More information about the llvm-commits mailing list