[all-commits] [llvm/llvm-project] 2005ae: [X86][SLM] WriteVecIMul instructions only take 1uo...

Simon Pilgrim via All-commits all-commits at lists.llvm.org
Sat Sep 4 07:10:11 PDT 2021


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 2005ae15a66dd5d8a9845f3652192a70bd36d921
      https://github.com/llvm/llvm-project/commit/2005ae15a66dd5d8a9845f3652192a70bd36d921
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2021-09-04 (Sat, 04 Sep 2021)

  Changed paths:
    M llvm/lib/Target/X86/X86ScheduleSLM.td
    M llvm/test/CodeGen/X86/slow-pmulld.ll
    M llvm/test/tools/llvm-mca/X86/SLM/resources-sse2.s
    M llvm/test/tools/llvm-mca/X86/SLM/resources-sse41.s
    M llvm/test/tools/llvm-mca/X86/SLM/resources-ssse3.s

  Log Message:
  -----------
  [X86][SLM] WriteVecIMul instructions only take 1uop (REAPPLIED)

The xmm variant have half the throughput (and +1cy latency) of the mmx variants, but are still 1uop.

I still need to do more thorough testing of SLM on test-suite before fixing the obvious bad numbers for WritePMULLD.

But this helps the D103695 helper script get to more accurate numbers for vXi32 multiplies of extended operands (i.e. we can use PMADDWD, PMULLW/PMULHW etc). Matches what Intel AoM / Agner / llvm-exegesis reports.


  Commit: cb8d96e72f4c41b86738a5347a5c15e98037f358
      https://github.com/llvm/llvm-project/commit/cb8d96e72f4c41b86738a5347a5c15e98037f358
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2021-09-04 (Sat, 04 Sep 2021)

  Changed paths:
    M llvm/lib/ProfileData/SampleProfReader.cpp

  Log Message:
  -----------
  Fix Wdocumentation unknown parameter warning. NFCI.


Compare: https://github.com/llvm/llvm-project/compare/ac51d6920871...cb8d96e72f4c


More information about the All-commits mailing list