[LLVMbugs] [Bug 11674] New: Codegen for vector float->double cast fails on x86 above SSE3

Wed Dec 28 12:23:09 PST 2011

http://llvm.org/bugs/show_bug.cgi?id=11674

             Bug #: 11674
           Summary: Codegen for vector float->double cast fails on x86
                    above SSE3
           Product: libraries
           Version: trunk
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P
         Component: Backend: X86
        AssignedTo: unassignedbugs at nondot.org
        ReportedBy: jrk at csail.mit.edu
                CC: llvmbugs at cs.uiuc.edu
    Classification: Unclassified

Created attachment 7818
  --> http://llvm.org/bugs/attachment.cgi?id=7818
ll code which tickles error

I've isolated a bug in SSE codegen to the attached example.

    define void @f(<2 x float>* %in, <2 x double>* %out) {
    entry:
      %0 = load <2 x float>* %in, align 8
      %1 = fpext <2 x float> %0 to <2 x double>
      store <2 x double> %1, <2 x double>* %out, align 1
      ret void
    }

The code should load a <2 x float> vector from %in, fpext cast it to a
<2 x double>, and do an unaligned store (movupd) of the result to %out. This
works as expected on earlier SSE targets, generating this with llc -mcpu=core2:

    movss    (%rdi), %xmm1
    movss    4(%rdi), %xmm0
    cvtss2sd    %xmm0, %xmm0
    cvtss2sd    %xmm1, %xmm1
    unpcklpd    %xmm0, %xmm1    ## xmm1 = xmm1[0],xmm0[0]
    movupd    %xmm1, (%rsi)
    ret

Load both, cast float to double (cvtss2sd), pack vectors, and store.

But with llc -mcpu=penryn or greater, it yields nonsense:

    movq    (%rdi), %xmm0
    pshufd    $16, %xmm0, %xmm0       ## xmm0 = xmm0[0,0,1,0]
    movdqu    %xmm0, (%rsi)
    ret

-- 
Configure bugmail: http://llvm.org/bugs/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.