[PATCH] [AArch64] Lower sdiv x, pow2 using add + select + shift.
james at jamesmolloy.co.uk
Sun Jul 13 11:58:56 PDT 2014
Indeed - my testing showed that the branched version was no faster than the
csel version on an OoO core anyway, so I was not advocating the branched
On 10 July 2014 11:26, Silviu Baranga <silviu.baranga at gmail.com> wrote:
> It seems to me that the branch mispredict cost for the case where the
> values of X are random would outweigh the benefits of this transformation
> for your alternative code sequence, even on OoO cores.
> I don't think it would entirely ok to make that assumption here (X >= 0
> This point obviously doesn't matter for the csel solution.
> llvm-commits mailing list
> llvm-commits at cs.uiuc.edu
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the llvm-commits