[PATCH] D28314: Change sqrt partial inlining to depend on sqrt argument rather than result.

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jan 5 03:46:56 PST 2017


RKSimon added a subscriber: spatel.
RKSimon added a comment.

As mentioned on PR31455, on X86 btver2, this changes goes from being slower to gcc (88cy vs 84cy) (which hoists the sqrtsd) to actually being slightly faster (82cy). This is for a tight loop of ::sqrt() calls across an array of 65535 pre-randomized doubles (~10% of which use the sqrt call and the rest use sqrtsd). This will be mainly due to reduced speculative usage of the FSQRT unit.


https://reviews.llvm.org/D28314





More information about the llvm-commits mailing list