nikic wrote: > > Codegen looks a bit worse > > mul typically takes ~3 cycles. Do you mean we should revert the transform to reduce code size? Hm, good point. Then it's fine, at least for architectures with cmov. https://github.com/llvm/llvm-project/pull/84792