[llvm-bugs] [Bug 34370] [X86][AVX512] suboptimal shuffle sequence instead of one vpermps instruction

via llvm-bugs llvm-bugs at lists.llvm.org
Mon Jan 22 04:33:07 PST 2018


https://bugs.llvm.org/show_bug.cgi?id=34370

Simon Pilgrim <llvm-dev at redking.me.uk> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |FIXED
             Status|NEW                         |RESOLVED

--- Comment #1 from Simon Pilgrim <llvm-dev at redking.me.uk> ---
Current codegen combine the 2 shuffles AND the select into a single binary
shuffle:

    vmovaps (%rdi), %ymm2
    vmovaps {{.*#+}} ymm1 = [7,6,3,11,7,6,14,15]
    vpermi2ps %ymm0, %ymm2, %ymm1
    vmovaps %ymm1, %ymm0
    retq

Added a test case at rL323106

-- 
You are receiving this mail because:
You are on the CC list for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-bugs/attachments/20180122/0aea2714/attachment-0001.html>


More information about the llvm-bugs mailing list