[PATCH] D80085: [LV] Fix FoldTail under user VF and UF
Ayal Zaks via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun May 17 09:02:45 PDT 2020
Ayal created this revision.
Ayal added reviewers: fhahn, gilr, anhtuyen.
Herald added subscribers: llvm-commits, rkruppe, hiraditya.
Herald added a project: LLVM.
LV considers an internally computed MaxVF to decide if a constant trip-count is
a multiple of any subsequently chosen VF, and conclude that no scalar remainder
iterations (tail) will be left for FoldTail to handle. If an external VF is provided via
-force-vector-width, it must be considered instead of the internal MaxVF.
If an external UF is provided via -force-vector-interleave, it too must be
considered in addition to MaxVF or user VF.
Fixes PR45679.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D80085
Files:
llvm/lib/Transforms/Vectorize/LoopVectorizationPlanner.h
llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
llvm/test/Transforms/LoopVectorize/pr45679-fold-tail-by-masking.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D80085.264498.patch
Type: text/x-patch
Size: 12760 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200517/578717ac/attachment-0001.bin>
More information about the llvm-commits
mailing list