[all-commits] [llvm/llvm-project] 0c9c92: [X86][XOP] Tidyup VPHADD/VPHSUB unary horizontal o...

Simon Pilgrim via All-commits all-commits at lists.llvm.org
Thu Mar 3 04:08:10 PST 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 0c9c92ffc0acde32376b7ba8dcbe7fec90de3ff3
      https://github.com/llvm/llvm-project/commit/0c9c92ffc0acde32376b7ba8dcbe7fec90de3ff3
  Author: Simon Pilgrim <llvm-dev at redking.me.uk>
  Date:   2022-03-03 (Thu, 03 Mar 2022)

  Changed paths:
    M llvm/lib/Target/X86/X86InstrXOP.td
    M llvm/test/tools/llvm-mca/X86/Generic/resources-xop.s

  Log Message:
  -----------
  [X86][XOP] Tidyup VPHADD/VPHSUB unary horizontal ops default schedule class

Based off Agner and AMD SoG tables, the XOP VPHADD/VPHSUB unary horizontal ops are as fast as basic arithmetic ops, not the slower SSSE3 binary horizontal add/sub ops. This also matches what the bdver2 model already lists.

Noticed while investigating reduction add optimizations.




More information about the All-commits mailing list