[PATCH] D78216: [TTI] Add DemandedElts to getScalarizationOverhead
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Apr 17 13:32:28 PDT 2020
RKSimon marked an inline comment as done.
RKSimon added inline comments.
================
Comment at: llvm/lib/Target/X86/X86TargetTransformInfo.cpp:2691
+ Cost += (PowerOf2Ceil(NumSubVecs) - 1) * LT.first;
+ Cost += DemandedElts.countPopulation();
+ }
----------------
craig.topper wrote:
> What about the 0 cost for fp insertion into element 0 of each subvector?
Nice catch - I'll update the patch tomorrow
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D78216/new/
https://reviews.llvm.org/D78216
More information about the llvm-commits
mailing list