[PATCH] D110588: [X86] decomposeMulByConstant - decompose legal vXi32 mutliplies on SlowPMULLD targets

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Oct 1 09:26:33 PDT 2021


craig.topper added a comment.

In D110588#3036506 <https://reviews.llvm.org/D110588#3036506>, @RKSimon wrote:

> ping - any thoughts? should we do not do this at all? limit this to SlowPMULLD or always do it for vXi32 and vXi64?

I think it makes sense for SlowPMULLD. I think this makes sense for vXi32 on Haswell and later Intel Core CPUs. That's where it went from one uop to two serialize 5 cycle uops.  I think it makes sense for vXi64 on all Intel CPUs with PMULLQ since that instruction is 3 serialized uops.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D110588/new/

https://reviews.llvm.org/D110588



More information about the llvm-commits mailing list