[all-commits] [llvm/llvm-project] 240aa9: [LoopVectorize] Simplify scalar cost calculation i...
david-arm via All-commits
all-commits at lists.llvm.org
Fri Mar 26 04:27:32 PDT 2021
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 240aa96cf25d880dde7a0db5d96918cfaa4b8891
https://github.com/llvm/llvm-project/commit/240aa96cf25d880dde7a0db5d96918cfaa4b8891
Author: David Sherwood <david.sherwood at arm.com>
Date: 2021-03-26 (Fri, 26 Mar 2021)
Changed paths:
M llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
M llvm/test/Transforms/LoopVectorize/AArch64/no_vector_instructions.ll
Log Message:
-----------
[LoopVectorize] Simplify scalar cost calculation in getInstructionCost
This patch simplifies the calculation of certain costs in
getInstructionCost when isScalarAfterVectorization() returns a true value.
There are a few places where we multiply a cost by a number N, i.e.
unsigned N = isScalarAfterVectorization(I, VF) ? VF.getKnownMinValue() : 1;
return N * TTI.getArithmeticInstrCost(...
After some investigation it seems that there are only these cases that occur
in practice:
1. VF is a scalar, in which case N = 1.
2. VF is a vector. We can only get here if: a) the instruction is a
GEP/bitcast with scalar uses, or b) this is an update to an induction variable
that remains scalar.
I have changed the code so that N is assumed to always be 1. For GEPs
the cost is always 0, since this is calculated later on as part of the
load/store cost. For all other cases I have added an assert that none of the
users needs scalarising, which didn't fire in any unit tests.
Only one test required fixing and I believe the original cost for the scalar
add instruction to have been wrong, since only one copy remains after
vectorisation.
Differential Revision: https://reviews.llvm.org/D98512
More information about the All-commits
mailing list