[PATCH] D80085: [LV] Fix FoldTail under user VF and UF

Ayal Zaks via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun May 17 09:02:45 PDT 2020


Ayal created this revision.
Ayal added reviewers: fhahn, gilr, anhtuyen.
Herald added subscribers: llvm-commits, rkruppe, hiraditya.
Herald added a project: LLVM.

LV considers an internally computed MaxVF to decide if a constant trip-count is
a multiple of any subsequently chosen VF, and conclude that no scalar remainder
iterations (tail) will be left for FoldTail to handle. If an external VF is provided via
-force-vector-width, it must be considered instead of the internal MaxVF.
If an external UF is provided via -force-vector-interleave, it too must be
considered in addition to MaxVF or user VF.

Fixes PR45679.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D80085

Files:
  llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
  llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
  llvm/test/Transforms/LoopVectorize/pr45679-fold-tail-by-masking.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D80085.264498.patch
Type: text/x-patch
Size: 12760 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200517/578717ac/attachment-0001.bin>


More information about the llvm-commits mailing list