[PATCH] D121899: [LoopVectorize] Optimise away the icmp when tail-folding for some low trip counts
David Sherwood via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon May 23 01:35:31 PDT 2022
david-arm updated this revision to Diff 431302.
david-arm added a comment.
- I've removed the need for a separate TripCount VPValue in the VPlan class because we're always going to need the original scalar trip count, and the value is the same for each Part anyway. Now we just store a copy in the VPTransformState so that the execute() functions can access it.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D121899/new/
https://reviews.llvm.org/D121899
Files:
llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
llvm/lib/Transforms/Vectorize/VPlan.cpp
llvm/lib/Transforms/Vectorize/VPlan.h
llvm/test/Transforms/LoopVectorize/AArch64/sve-low-trip-count.ll
llvm/test/Transforms/LoopVectorize/X86/constant-fold.ll
llvm/test/Transforms/LoopVectorize/X86/outer_loop_test1_no_explicit_vect_width.ll
llvm/test/Transforms/LoopVectorize/X86/pr34438.ll
llvm/test/Transforms/LoopVectorize/X86/pr42674.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D121899.431302.patch
Type: text/x-patch
Size: 11176 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20220523/9243441f/attachment.bin>
More information about the llvm-commits
mailing list