[PATCH] D40008: [X86][TTI] update costs of interleaved load\store of i64\double

Roman Lebedev via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Apr 26 00:48:25 PDT 2021


lebedev.ri added a comment.
Herald added a subscriber: pengfei.
Herald added a project: LLVM.

@RKSimon @magabari I'd like to add some more tuples, but i have a question: how are the costs actually derived?
For example, the assembly for interleaved load of i16 w/ stride 2: https://godbolt.org/z/hjb3d5x6E
What's it cost? I'm guessing it's not just `10`, aka the instruction count excluding the loads/stores?
Is it 4 from `Block RThroughput: 3.3` from MCA: https://godbolt.org/z/3eKcrdP4f ?
Which CPU should be used for these numbers?


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D40008/new/

https://reviews.llvm.org/D40008



More information about the llvm-commits mailing list