[PATCH] D112552: [LoopVectorize] When tail-folding, don't always predicate uniform loads
David Sherwood via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Oct 27 04:54:58 PDT 2021
david-arm added a comment.
In D112552#3088838 <https://reviews.llvm.org/D112552#3088838>, @fhahn wrote:
> Does the test need to be aarch64 specific or could it be target independent?
Hi @fhahn, I think that's a really good idea. The main problem I had when trying this is that tail-folding requires masked load/store support, so when you make this target-independent we end up scalarising the loads/stores and never expose the path I changed in this patch. If there is a way to force masked loads to be legal then I could try that?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D112552/new/
https://reviews.llvm.org/D112552
More information about the llvm-commits
mailing list