[PATCH] D112552: [LoopVectorize] When tail-folding, don't always predicate uniform loads

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Oct 27 04:54:58 PDT 2021


david-arm added a comment.

In D112552#3088838 <https://reviews.llvm.org/D112552#3088838>, @fhahn wrote:

> Does the test need to be aarch64 specific or could it be target independent?

Hi @fhahn, I think that's a really good idea. The main problem I had when trying this is that tail-folding requires masked load/store support, so when you make this target-independent we end up scalarising the loads/stores and never expose the path I changed in this patch. If there is a way to force masked loads to be legal then I could try that?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D112552/new/

https://reviews.llvm.org/D112552



More information about the llvm-commits mailing list