<div dir="ltr">Clang's -target option is supposed to take a cpu type and an operating system. So "-target i386" is giving it no operatiing system. This is preventing frame pointer elimination which is why ebp is being updated. If you pass "-target i386-linux" you get sightly better code.<br><div><br></div><div>The division/remainder operations are turned into library calls as part of instruction selection. This code is somewhat independent of how other calls are handled. We probably don't support tail calls in it. Is it really realistic that a user would have a non-inlined function that contains just a division? Why should we optimize for that case?</div><div><div><br clear="all"><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature">~Craig</div></div><br></div></div></div><br><div class="gmail_quote"><div dir="ltr">On Sat, Dec 1, 2018 at 9:37 AM Stefan Kanthak via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Compile the following functions with "-O3 -target i386"<br>
(see <<a href="https://godbolt.org/z/VmKlXL" rel="noreferrer" target="_blank">https://godbolt.org/z/VmKlXL</a>>):<br>
<br>
long long div(long long foo, long long bar)<br>
{<br>
return foo / bar;<br>
}<br>
<br>
On the left the generated code; on the right the expected,<br>
properly optimised code:<br>
<br>
div: # @div<br>
push ebp |<br>
mov ebp, esp |<br>
push dword ptr [ebp + 20] |<br>
push dword ptr [ebp + 16] |<br>
push dword ptr [ebp + 12] |<br>
push dword ptr [ebp + 8] |<br>
call __divdi3 | jmp __divdi3<br>
add esp, 16 |<br>
pop ebp |<br>
ret |<br>
<br>
<br>
long long mod(long long foo, long long bar)<br>
{<br>
return foo % bar;<br>
}<br>
<br>
mod: # @mod<br>
push ebp |<br>
mov ebp, esp |<br>
push dword ptr [ebp + 20] |<br>
push dword ptr [ebp + 16] |<br>
push dword ptr [ebp + 12] |<br>
push dword ptr [ebp + 8] |<br>
call __moddi3 | jmp __moddi3<br>
add esp, 16 |<br>
pop ebp |<br>
ret |<br>
<br>
<br>
long long mul(long long foo, long long bar)<br>
{<br>
return foo * bar;<br>
}<br>
<br>
mul: # @mul<br>
push ebp<br>
mov ebp, esp<br>
push esi<br>
mov ecx, dword ptr [ebp + 16]<br>
mov esi, dword ptr [ebp + 8]<br>
mov eax, ecx<br>
imul ecx, dword ptr [ebp + 12]<br>
mul esi<br>
imul esi, dword ptr [ebp + 20]<br>
add edx, ecx<br>
add edx, esi<br>
pop esi<br>
pop ebp<br>
ret<br>
_______________________________________________<br>
LLVM Developers mailing list<br>
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
</blockquote></div>