[PATCH] D121899: [LoopVectorize] Optimise away the icmp when tail-folding for some low trip counts

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon May 23 01:35:31 PDT 2022


david-arm updated this revision to Diff 431302.
david-arm added a comment.

- I've removed the need for a separate TripCount VPValue in the VPlan class because we're always going to need the original scalar trip count, and the value is the same for each Part anyway. Now we just store a copy in the VPTransformState so that the execute() functions can access it.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D121899/new/

https://reviews.llvm.org/D121899

Files:
  llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
  llvm/lib/Transforms/Vectorize/VPlan.cpp
  llvm/lib/Transforms/Vectorize/VPlan.h
  llvm/test/Transforms/LoopVectorize/AArch64/sve-low-trip-count.ll
  llvm/test/Transforms/LoopVectorize/X86/constant-fold.ll
  llvm/test/Transforms/LoopVectorize/X86/outer_loop_test1_no_explicit_vect_width.ll
  llvm/test/Transforms/LoopVectorize/X86/pr34438.ll
  llvm/test/Transforms/LoopVectorize/X86/pr42674.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D121899.431302.patch
Type: text/x-patch
Size: 11176 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20220523/9243441f/attachment.bin>


More information about the llvm-commits mailing list