<html>

    <head>

      <base href="https://bugs.llvm.org/">

    </head>

    <body><table border="1" cellspacing="0" cellpadding="8">

        <tr>

          <th>Bug ID</th>

          <td><a class="bz_bug_link 

          bz_status_NEW "

   title="NEW - [x86] default codegen should be more branchy"

   href="https://bugs.llvm.org/show_bug.cgi?id=33013">33013</a>

          </td>

        </tr>

        <tr>

          <th>Summary</th>

          <td>[x86] default codegen should be more branchy

          </td>

        </tr>

        <tr>

          <th>Product</th>

          <td>libraries

          </td>

        </tr>

        <tr>

          <th>Version</th>

          <td>trunk

          </td>

        </tr>

        <tr>

          <th>Hardware</th>

          <td>PC

          </td>

        </tr>

        <tr>

          <th>OS</th>

          <td>All

          </td>

        </tr>

        <tr>

          <th>Status</th>

          <td>NEW

          </td>

        </tr>

        <tr>

          <th>Severity</th>

          <td>enhancement

          </td>

        </tr>

        <tr>

          <th>Priority</th>

          <td>P

          </td>

        </tr>

        <tr>

          <th>Component</th>

          <td>Backend: X86

          </td>

        </tr>

        <tr>

          <th>Assignee</th>

          <td>unassignedbugs@nondot.org

          </td>

        </tr>

        <tr>

          <th>Reporter</th>

          <td>spatel+llvm@rotateright.com

          </td>

        </tr>

        <tr>

          <th>CC</th>

          <td>llvm-bugs@lists.llvm.org

          </td>

        </tr></table>

      <p>

        <div>

        <pre>The default x86 target is something like an Intel big core (ie, it's good at

absorbing the cost of correctly predicted branches, good at predicting those

branches, and good at speculating execution past those branches).

Therefore, we shouldn't favor cmov codegen for IR select as much as we

currently do. Example:

int foo(float x) {

  if (x < 42.0f)

    return x;

  return 12;

}

define i32 @foo(float %x) {

  %cmp = fcmp olt float %x, 4.200000e+01

  %conv = fptosi float %x to i32

  %ret = select i1 %cmp, i32 %conv, i32 12

  ret i32 %ret

}

$ clang -O2 cmovfp.c -S -o -

    movss    LCPI0_0(%rip), %xmm1    ## xmm1 = mem[0],zero,zero,zero

    ucomiss    %xmm0, %xmm1

    cvttss2si    %xmm0, %ecx

    movl    $12, %eax

    cmoval    %ecx, %eax

    retq

Note that gcc and icc will use compare and branch on this example.</pre>

        </div>

      </p>

      <hr>

      <span>You are receiving this mail because:</span>

      <ul>

          <li>You are on the CC list for the bug.</li>

      </ul>

    </body>

</html>