[PATCH] D57789: [CGP] form usub with overflow from sub+icmp
Guozhi Wei via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Mar 14 11:47:52 PDT 2019
Carrot added a comment.
This patch causes 5% regression of one of our eigen benchmarks on Haswell.
The problem is when it combines the CMP in a hot block with SUB in a cold block into a single SUB in hot block, on a two address architecture like x86, if the operand of CMP has other uses, it needs to make an extra COPY before the original CMP, so there is one more instruction in hot block.
Another patch r355823 papered over the problem in our code, but it didn't fix the root cause.
The regression is only observed on Haswell, it doesn't impact Skylake.
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D57789/new/
https://reviews.llvm.org/D57789
More information about the llvm-commits
mailing list