[PATCH] D111031: [WIP][X86][Costmodel] Load/store i64/f64 Stride=3 VF=16 interleaving costs
Roman Lebedev via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Oct 3 14:38:29 PDT 2021
lebedev.ri added a comment.
In D111031#3039005 <https://reviews.llvm.org/D111031#3039005>, @RKSimon wrote:
> We are quite limited - even adding all xmm0 to the llvm-mca asm capture (~{xmm0} etc.) has very little effect - short of hand coding something, I think what you have here is at least a realistic magnitude vaue.
Right.
> I'm happy to accept this if you want.
Hmm, what if we omit the epilogue stores in the load case: https://godbolt.org/z/43ao6qYvK -- does load interleaving cost of `18` seem more/less realistic?
It would be good to also ignore the prologue loads, since we'll cost them twice
And likewise for interleaved store case: https://godbolt.org/z/1EjY3fqPG -- does interleaving store cost of `18` seem more/less realistic?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D111031/new/
https://reviews.llvm.org/D111031
More information about the llvm-commits
mailing list