[PATCH] D79100: [LV][TTI] Emit new IR intrinsic llvm.get.active.mask for tail-folded loops
Eli Friedman via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed May 20 14:18:43 PDT 2020
efriedma added a comment.
I think all Simon is asking for is that the first argument of the intrinsic should be a scalar, equal to the first lane of the vector induction variable. How much work would that be?
================
Comment at: llvm/lib/Transforms/Vectorize/LoopVectorize.cpp:6812
// Introduce the early-exit compare IV <= BTC to form header block mask.
// This is used instead of IV < TC because TC may wrap, unlike BTC.
// Start by constructing the desired canonical IV.
----------------
It would be nice to explicitly note here that we're assuming the vector factor is a power of two (so the induction variable can't wrap).
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D79100/new/
https://reviews.llvm.org/D79100
More information about the llvm-commits
mailing list