[PATCH] D57789: [CGP] form usub with overflow from sub+icmp

Guozhi Wei via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Mar 14 11:47:52 PDT 2019


Carrot added a comment.

This patch causes 5% regression of one of our eigen benchmarks on Haswell.

The problem is when it combines the CMP in a hot block with SUB in a cold block into a single SUB in hot block, on a two address architecture like x86, if the operand of CMP has other uses, it needs to make an extra COPY before the original CMP, so there is one more instruction in hot block.

Another patch r355823 papered over the problem in our code, but it didn't fix the root cause.

The regression is only observed on Haswell, it doesn't impact Skylake.


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D57789/new/

https://reviews.llvm.org/D57789





More information about the llvm-commits mailing list