<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:Helvetica;
        panose-1:2 11 6 4 2 2 2 2 2 4;}
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Consolas;
        panose-1:2 11 6 9 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman",serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
pre
        {mso-style-priority:99;
        mso-style-link:"HTML Preformatted Char";
        margin:0in;
        margin-bottom:.0001pt;
        font-size:10.0pt;
        font-family:"Courier New";}
span.apple-converted-space
        {mso-style-name:apple-converted-space;}
span.apple-tab-span
        {mso-style-name:apple-tab-span;}
span.HTMLPreformattedChar
        {mso-style-name:"HTML Preformatted Char";
        mso-style-priority:99;
        mso-style-link:"HTML Preformatted";
        font-family:Consolas;}
span.EmailStyle21
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:8.5in 11.0in;
        margin:1.0in 1.25in 1.0in 1.25in;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Hi Artur.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Unfortunately, it takes longer than I expected as there are some prior commits that need to be done.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">We came to the conclusion (not only due to your issue) that we first need to modify the X86 Code Alignment first to remove effects due to layout changes.
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">It is currently under review: see:
<a href="https://reviews.llvm.org/D39840">https://reviews.llvm.org/D39840</a><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">At the same time we are fixing basic scheduling issues already in Haswell. See:
<a href="https://reviews.llvm.org/D40021">https://reviews.llvm.org/D40021</a><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Having said that I hope to be able to return to this next week.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Artur, would it be possible to take the latest patch from
<a href="https://reviews.llvm.org/D40021">https://reviews.llvm.org/D40021</a> and re-run it on your configuration to see if the regression is still the same?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Are you able to run it only on a SKL machine?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Thanx & Best regards:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Gadi.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><a name="_MailEndCompose"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"><o:p> </o:p></span></a></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><a name="_____replyseparator"></a><b><span style="font-size:11.0pt;font-family:"Calibri",sans-serif">From:</span></b><span style="font-size:11.0pt;font-family:"Calibri",sans-serif"> Artur Pilipenko [mailto:apilipenko@azul.com]
<br>
<b>Sent:</b> Wednesday, November 15, 2017 12:43<br>
<b>To:</b> Haber, Gadi <gadi.haber@intel.com><br>
<b>Cc:</b> llvm-commits@lists.llvm.org<br>
<b>Subject:</b> Re: [llvm] r316492 - [X86][Broadwell] Added the instruction scheduling information for the Broadwell CPU.<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Hi Gadi, <o:p></o:p></p>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Do you have any updates on the regression?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Artur<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal">On 31 Oct 2017, at 13:17, Haber, Gadi <<a href="mailto:gadi.haber@intel.com">gadi.haber@intel.com</a>> wrote:<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Thank you Artur.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"> </span><o:p></o:p></p>
</div>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<div>
<p class="MsoNormal"><b><span style="font-size:11.0pt;font-family:"Calibri",sans-serif">From:</span></b><span class="apple-converted-space"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif"> </span></span><span style="font-size:11.0pt;font-family:"Calibri",sans-serif">Artur
 Pilipenko [<a href="mailto:apilipenko@azul.com"><span style="color:purple">mailto:apilipenko@azul.com</span></a>]<span class="apple-converted-space"> </span><br>
<b>Sent:</b><span class="apple-converted-space"> </span>Monday, October 30, 2017 21:48<br>
<b>To:</b><span class="apple-converted-space"> </span>Haber, Gadi <<a href="mailto:gadi.haber@intel.com"><span style="color:purple">gadi.haber@intel.com</span></a>><br>
<b>Cc:</b><span class="apple-converted-space"> </span><a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a><br>
<b>Subject:</b><span class="apple-converted-space"> </span>Re: [llvm] r316492 - [X86][Broadwell] Added the instruction scheduling information for the Broadwell CPU.</span><o:p></o:p></p>
</div>
</div>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Here we go:<span class="apple-converted-space"> </span><o:p></o:p></p>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal">Before the change:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30038231<span class="apple-converted-space"> </span> vmovsd<span class="apple-converted-space"> </span>(%r12,%rbp,8), %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc4c17b1004ec<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30038237<span class="apple-converted-space"> </span> vmulsd<span class="apple-converted-space"> </span>(%rax,%rcx,8), %xmm0, %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc5fb5904c8<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3003823c<span class="apple-converted-space"> </span> movslq<span class="apple-converted-space"> </span>%r11d, %rdi<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x4963fb<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3003823f<span class="apple-converted-space"> </span> vaddsd<span class="apple-converted-space"> </span>%xmm1, %xmm0, %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc5fb58c1<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30038243<span class="apple-converted-space"> </span> leaq<span class="apple-converted-space"> </span>1(%rcx), %rbp<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x488d6901<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30038247<span class="apple-converted-space"> </span> cmpq<span class="apple-converted-space"> </span>%rdi, %rbp<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x4839fd<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3003824a<span class="apple-converted-space"> </span> jge<span class="apple-converted-space"> </span>148 ; ABS: 0x300382e4<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x0f8d94000000<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal">After the change:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30028231<span class="apple-converted-space"> </span> movslq<span class="apple-converted-space"> </span>%r11d, %rdi<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x4963fb<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30028234<span class="apple-converted-space"> </span> vmovsd<span class="apple-converted-space"> </span>(%r12,%rbp,8), %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc4c17b1004ec<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3002823a<span class="apple-converted-space"> </span> vmulsd<span class="apple-converted-space"> </span>(%rax,%rcx,8), %xmm0, %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc5fb5904c8<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3002823f<span class="apple-converted-space"> </span> vaddsd<span class="apple-converted-space"> </span>%xmm1, %xmm0, %xmm0<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0xc5fb58c1<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30028243<span class="apple-converted-space"> </span> leaq<span class="apple-converted-space"> </span>1(%rcx), %rbp<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x488d6901<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x30028247<span class="apple-converted-space"> </span> cmpq<span class="apple-converted-space"> </span>%rdi, %rbp<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x4839fd<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">0x3002824a<span class="apple-converted-space"> </span> jge<span class="apple-converted-space"> </span>148 ; ABS: 0x300282e4<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">0x0f8d94000000<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">Artur<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<div>
<p class="MsoNormal">On 28 Oct 2017, at 17:23, Haber, Gadi <<a href="mailto:gadi.haber@intel.com"><span style="color:purple">gadi.haber@intel.com</span></a>> wrote:<o:p></o:p></p>
</div>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
<div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">HI Artur</span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"> </span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Is it possible to add the addresses of the instructions so I can make sure this is not a code alignment issue?</span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"> </span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Thanx so much!<br>
<br>
<br>
<br>
</span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D">Gadi.</span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D"> </span><o:p></o:p></p>
</div>
</div>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<div>
<div>
<p class="MsoNormal"><b><span style="font-size:11.0pt;font-family:"Calibri",sans-serif">From:</span></b><span class="apple-converted-space"><span style="font-size:11.0pt;font-family:"Calibri",sans-serif"> </span></span><span style="font-size:11.0pt;font-family:"Calibri",sans-serif">Artur
 Pilipenko [<a href="mailto:apilipenko@azul.com"><span style="color:purple">mailto:apilipenko@azul.com</span></a>]<span class="apple-converted-space"> </span><br>
<b>Sent:</b><span class="apple-converted-space"> </span>Thursday, October 26, 2017 19:36<br>
<b>To:</b><span class="apple-converted-space"> </span>Haber, Gadi <<a href="mailto:gadi.haber@intel.com"><span style="color:purple">gadi.haber@intel.com</span></a>><br>
<b>Cc:</b><span class="apple-converted-space"> </span><a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a>; Artur Pilipenko <<a href="mailto:apilipenko@azul.com"><span style="color:purple">apilipenko@azul.com</span></a>><br>
<b>Subject:</b><span class="apple-converted-space"> </span>Re: [llvm] r316492 - [X86][Broadwell] Added the instruction scheduling information for the Broadwell CPU.</span><o:p></o:p></p>
</div>
</div>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">Sorry, this supposed to be a response to another commit of yours. The problematic change is <a href="https://reviews.llvm.org/rL315175"><span style="color:purple">https://reviews.llvm.org/rL315175</span></a><o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal">Artur<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<div>
<div>
<p class="MsoNormal">On 26 Oct 2017, at 19:33, Artur Pilipenko via llvm-commits <<a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a>> wrote:<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<div>
<p class="MsoNormal">FYI, this change causes a regression on our internal performance testing on skyline machines.<o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal">This patch changes the scheduling of a mov instruction in the hot loop. It results in about 11% performance degradation.<o:p></o:p></p>
</div>
</div>
<div>
<pre style="font-size:inherit;white-space:pre-wrap;font-variant-ligatures: normal;orphans: 2;widows: 2" id="comment_text_3">Before the change:<o:p></o:p></pre>
<pre style="font-size:inherit;white-space:pre-wrap;font-variant-ligatures: normal;orphans: 2;widows: 2" id="comment_text_3"># BB#18:                                #   in Loop: Header=BB0_13 Depth=2<o:p></o:p></pre>
<pre>  vmovsd  (%r12,%rbp,8), %xmm0    # xmm0 = mem[0],zero<o:p></o:p></pre>
<pre>  vmulsd  (%rax,%rcx,8), %xmm0, %xmm0<o:p></o:p></pre>
<pre>  movslq  %r11d, %rdi<o:p></o:p></pre>
<pre>  vaddsd  %xmm1, %xmm0, %xmm0<o:p></o:p></pre>
<pre>  leaq    1(%rcx), %rbp<o:p></o:p></pre>
<pre>  cmpq    %rdi, %rbp<o:p></o:p></pre>
<pre>  jge     .LBB0_30<o:p></o:p></pre>
<pre> <o:p></o:p></pre>
<pre>After the change:<o:p></o:p></pre>
<pre style="font-size:inherit;white-space:pre-wrap;font-variant-ligatures: normal;orphans: 2;widows: 2" id="comment_text_3"># BB#18:                                #   in Loop: Header=BB0_13 Depth=2<o:p></o:p></pre>
<pre>  movslq  %r11d, %rdi<o:p></o:p></pre>
<pre>  vmovsd  (%r12,%rbp,8), %xmm0    # xmm0 = mem[0],zero<o:p></o:p></pre>
<pre>  vmulsd  (%rax,%rcx,8), %xmm0, %xmm0<o:p></o:p></pre>
<pre>  vaddsd  %xmm1, %xmm0, %xmm0<o:p></o:p></pre>
<pre>  leaq    1(%rcx), %rbp<o:p></o:p></pre>
<pre>  cmpq    %rdi, %rbp<o:p></o:p></pre>
<pre>  jge     .LBB0_30<o:p></o:p></pre>
<div>
<div>
<div>
<p class="MsoNormal">Let me know if you need more information, e.g. the .ll file.<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal">Artur<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<div>
<div>
<p class="MsoNormal">On 24 Oct 2017, at 23:19, Gadi Haber via llvm-commits <<a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a>> wrote:<o:p></o:p></p>
</div>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<div>
<div>
<p class="MsoNormal">Author: gadi.haber<br>
Date: Tue Oct 24 13:19:47 2017<br>
New Revision: 316492<br>
<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project?rev=316492&view=rev"><span style="color:purple">http://llvm.org/viewvc/llvm-project?rev=316492&view=rev</span></a><br>
Log:<br>
[X86][Broadwell] Added the instruction scheduling information for the Broadwell CPU.<br>
<br>
Adding the scheduling information for the Browadwell (BDW) CPU target.<br>
<br>
This patch adds the instruction scheduling information for the Broadwell (BDW) architecture target by adding the file X86SchedBroadwell.td located under the X86 Target.<br>
We used the scheduling information retrieved from the Broadwell architects in order to create the file.<br>
The scheduling information includes latency, number of micro-Ops and used ports by each BDW instruction.<br>
<br>
The patch continues the scheduling replacement and insertion effort started with the SandyBridge (SNB) target in r310792, the Haswell (HSW) target in r311879, the SkylakeClient (SKL) target in rL313613 + rL315978 and the SkylakeServer (SKX) in rL315175.<br>
<br>
Performance fluctuations may be expected due to code alignment effects.<br>
<br>
Reviewers: zvi, RKSimon, craig.topper<br>
Differential Revision:<span class="apple-converted-space"> </span><a href="https://reviews.llvm.org/D39054"><span style="color:purple">https://reviews.llvm.org/D39054</span></a><br>
<br>
Change-Id: If6f799e5ff60e1091c8d43b05ea78c53581bae01<br>
<br>
Added:<br>
   llvm/trunk/lib/Target/X86/X86SchedBroadwell.td   (with props)<br>
Modified:<br>
   llvm/trunk/lib/Target/X86/X86.td<br>
   llvm/trunk/lib/Target/X86/X86Schedule.td<br>
   llvm/trunk/test/CodeGen/X86/aes-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/avx-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/avx2-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/bmi-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/f16c-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/fma-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/lea32-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/lea64-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/mmx-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/movbe-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/sse-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/sse2-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/sse3-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/sse41-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/sse42-schedule.ll<br>
   llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll<br>
<br>
Modified: llvm/trunk/lib/Target/X86/X86.td<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86.td?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86.td?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/lib/Target/X86/X86.td (original)<br>
+++ llvm/trunk/lib/Target/X86/X86.td Tue Oct 24 13:19:47 2017<br>
@@ -576,7 +576,7 @@ def BDWFeatures : ProcessorFeatures<HSWF<br>
  FeatureADX,<br>
  FeatureRDSEED<br>
]>;<br>
-class BroadwellProc<string Name> : ProcModel<Name, HaswellModel,<br>
+class BroadwellProc<string Name> : ProcModel<Name, BroadwellModel,<br>
                                             BDWFeatures.Value, [<br>
  ProcIntelBDW<br>
]>;<br>
<br>
Added: llvm/trunk/lib/Target/X86/X86SchedBroadwell.td<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86SchedBroadwell.td?rev=316492&view=auto"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86SchedBroadwell.td?rev=316492&view=auto</span></a><br>
==============================================================================<br>
--- llvm/trunk/lib/Target/X86/X86SchedBroadwell.td (added)<br>
+++ llvm/trunk/lib/Target/X86/X86SchedBroadwell.td Tue Oct 24 13:19:47 2017<br>
@@ -0,0 +1,4076 @@<br>
+//=- X86SchedBroadwell.td - X86 Broadwell Scheduling ---------*- tablegen -*-=//<br>
+//<br>
+//                     The LLVM Compiler Infrastructure<br>
+//<br>
+// This file is distributed under the University of Illinois Open Source<br>
+// License. See LICENSE.TXT for details.<br>
+//<br>
+//===----------------------------------------------------------------------===//<br>
+//<br>
+// This file defines the machine model for Broadwell to support instruction<br>
+// scheduling and other instruction cost heuristics.<br>
+//<br>
+//===----------------------------------------------------------------------===//<br>
+def BroadwellModel : SchedMachineModel {<br>
+  // All x86 instructions are modeled as a single micro-op, and HW can decode 4<br>
+  // instructions per cycle.<br>
+  let IssueWidth = 4;<br>
+  let MicroOpBufferSize = 192; // Based on the reorder buffer.<br>
+  let LoadLatency = 5;<br>
+  let MispredictPenalty = 16;<br>
+<br>
+  // Based on the LSD (loop-stream detector) queue size and benchmarking data.<br>
+  let LoopMicroOpBufferSize = 50;<br>
+<br>
+  // This flag is set to allow the scheduler to assign a default model to<span class="apple-converted-space"> </span><br>
+  // unrecognized opcodes.<br>
+  let CompleteModel = 0;<br>
+}<br>
+<br>
+let SchedModel = BroadwellModel in {<br>
+<br>
+// Broadwell can issue micro-ops to 8 different ports in one cycle.<br>
+<br>
+// Ports 0, 1, 5, and 6 handle all computation.<br>
+// Port 4 gets the data half of stores. Store data can be available later than<br>
+// the store address, but since we don't model the latency of stores, we can<br>
+// ignore that.<br>
+// Ports 2 and 3 are identical. They handle loads and the address half of<br>
+// stores. Port 7 can handle address calculations.<br>
+def BWPort0 : ProcResource<1>;<br>
+def BWPort1 : ProcResource<1>;<br>
+def BWPort2 : ProcResource<1>;<br>
+def BWPort3 : ProcResource<1>;<br>
+def BWPort4 : ProcResource<1>;<br>
+def BWPort5 : ProcResource<1>;<br>
+def BWPort6 : ProcResource<1>;<br>
+def BWPort7 : ProcResource<1>;<br>
+<br>
+// Many micro-ops are capable of issuing on multiple ports.<br>
+def BWPort01  : ProcResGroup<[BWPort0, BWPort1]>;<br>
+def BWPort23  : ProcResGroup<[BWPort2, BWPort3]>;<br>
+def BWPort237 : ProcResGroup<[BWPort2, BWPort3, BWPort7]>;<br>
+def BWPort04  : ProcResGroup<[BWPort0, BWPort4]>;<br>
+def BWPort05  : ProcResGroup<[BWPort0, BWPort5]>;<br>
+def BWPort06  : ProcResGroup<[BWPort0, BWPort6]>;<br>
+def BWPort15  : ProcResGroup<[BWPort1, BWPort5]>;<br>
+def BWPort16  : ProcResGroup<[BWPort1, BWPort6]>;<br>
+def BWPort56  : ProcResGroup<[BWPort5, BWPort6]>;<br>
+def BWPort015 : ProcResGroup<[BWPort0, BWPort1, BWPort5]>;<br>
+def BWPort056 : ProcResGroup<[BWPort0, BWPort5, BWPort6]>;<br>
+def BWPort0156: ProcResGroup<[BWPort0, BWPort1, BWPort5, BWPort6]>;<br>
+<br>
+// 60 Entry Unified Scheduler<br>
+def BWPortAny : ProcResGroup<[BWPort0, BWPort1, BWPort2, BWPort3, BWPort4,<br>
+                              BWPort5, BWPort6, BWPort7]> {<br>
+  let BufferSize=60;<br>
+}<br>
+<br>
+// Loads are 5 cycles, so ReadAfterLd registers needn't be available until 5<br>
+// cycles after the memory operand.<br>
+def : ReadAdvance<ReadAfterLd, 5>;<br>
+<br>
+// Many SchedWrites are defined in pairs with and without a folded load.<br>
+// Instructions with folded loads are usually micro-fused, so they only appear<br>
+// as two micro-ops when queued in the reservation station.<br>
+// This multiclass defines the resource usage for variants with and without<br>
+// folded loads.<br>
+multiclass BWWriteResPair<X86FoldableSchedWrite SchedRW,<br>
+                          ProcResourceKind ExePort,<br>
+                          int Lat> {<br>
+  // Register variant is using a single cycle on ExePort.<br>
+  def : WriteRes<SchedRW, [ExePort]> { let Latency = Lat; }<br>
+<br>
+  // Memory variant also uses a cycle on port 2/3 and adds 5 cycles to the<br>
+  // latency.<br>
+  def : WriteRes<SchedRW.Folded, [BWPort23, ExePort]> {<br>
+     let Latency = !add(Lat, 5);<br>
+  }<br>
+}<br>
+<br>
+// A folded store needs a cycle on port 4 for the store data, but it does not<br>
+// need an extra port 2/3 cycle to recompute the address.<br>
+def : WriteRes<WriteRMW, [BWPort4]>;<br>
+<br>
+// Arithmetic.<br>
+defm : BWWriteResPair<WriteALU,   BWPort0156, 1>; // Simple integer ALU op.<br>
+defm : BWWriteResPair<WriteIMul,  BWPort1,   3>; // Integer multiplication.<br>
+def : WriteRes<WriteIMulH, []> { let Latency = 3; } // Integer multiplication, high part.<br>
+def BWDivider : ProcResource<1>; // Integer division issued on port 0.     <br>
+def : WriteRes<WriteIDiv, [BWPort0, BWDivider]> { // Integer division.<br>
+  let Latency = 25;<br>
+  let ResourceCycles = [1, 10];<br>
+}<br>
+def : WriteRes<WriteIDivLd, [BWPort23, BWPort0, BWDivider]> {<br>
+  let Latency = 29;<br>
+  let ResourceCycles = [1, 1, 10];<br>
+}<br>
+<br>
+def : WriteRes<WriteLEA, [BWPort15]>; // LEA instructions can't fold loads.<br>
+<br>
+// Integer shifts and rotates.<br>
+defm : BWWriteResPair<WriteShift, BWPort06,  1>;<br>
+<br>
+// Loads, stores, and moves, not folded with other operations.<br>
+def : WriteRes<WriteLoad,  [BWPort23]> { let Latency = 5; }<br>
+def : WriteRes<WriteStore, [BWPort237, BWPort4]>;<br>
+def : WriteRes<WriteMove,  [BWPort0156]>;<br>
+<br>
+// Idioms that clear a register, like xorps %xmm0, %xmm0.<br>
+// These can often bypass execution ports completely.<br>
+def : WriteRes<WriteZero,  []>;<br>
+<br>
+// Branches don't produce values, so they have no latency, but they still<br>
+// consume resources. Indirect branches can fold loads.<br>
+defm : BWWriteResPair<WriteJump,  BWPort06,   1>;<br>
+<br>
+// Floating point. This covers both scalar and vector operations.<br>
+defm : BWWriteResPair<WriteFAdd,   BWPort1, 3>; // Floating point add/sub/compare.<br>
+defm : BWWriteResPair<WriteFMul,   BWPort0, 5>; // Floating point multiplication.<br>
+defm : BWWriteResPair<WriteFDiv,   BWPort0, 12>; // 10-14 cycles. // Floating point division.<br>
+defm : BWWriteResPair<WriteFSqrt,  BWPort0, 15>; // Floating point square root.<br>
+defm : BWWriteResPair<WriteFRcp,   BWPort0, 5>; // Floating point reciprocal estimate.<br>
+defm : BWWriteResPair<WriteFRsqrt, BWPort0, 5>; // Floating point reciprocal square root estimate.<br>
+// defm WriteFMA    : X86SchedWritePair; // Fused Multiply Add.<br>
+defm : BWWriteResPair<WriteFShuffle,  BWPort5,  1>; // Floating point vector shuffles.<br>
+defm : BWWriteResPair<WriteFBlend,  BWPort015,  1>; // Floating point vector blends.<br>
+def : WriteRes<WriteFVarBlend, [BWPort5]> { // Fp vector variable blends.<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">      <br>
+  let Latency = 2;<br>
+  let ResourceCycles = [2];<br>
+}<span class="apple-converted-space"> </span><br>
+def : WriteRes<WriteFVarBlendLd, [BWPort5, BWPort23]> {<br>
+  let Latency = 6;<br>
+  let ResourceCycles = [2, 1];<br>
+}<br>
+<br>
+// FMA Scheduling helper class.<br>
+// class FMASC { X86FoldableSchedWrite Sched = WriteFAdd; }<br>
+<br>
+// Vector integer operations.<br>
+defm : BWWriteResPair<WriteVecALU,   BWPort15,  1>; // Vector integer ALU op, no logicals.<br>
+defm : BWWriteResPair<WriteVecShift, BWPort0,  1>; // Vector integer shifts.<br>
+defm : BWWriteResPair<WriteVecIMul,  BWPort0,   5>; // Vector integer multiply.<br>
+defm : BWWriteResPair<WriteShuffle,  BWPort5,  1>; // Vector shuffles.<br>
+defm : BWWriteResPair<WriteBlend,  BWPort15,  1>; // Vector blends.<br>
+<br>
+def : WriteRes<WriteVarBlend, [BWPort5]> { // Vector variable blends.<br>
+  let Latency = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def : WriteRes<WriteVarBlendLd, [BWPort5, BWPort23]> {<br>
+  let Latency = 6;<br>
+  let ResourceCycles = [2, 1];<br>
+}<br>
+<br>
+def : WriteRes<WriteMPSAD, [BWPort0, BWPort5]> { // Vector MPSAD.     <br>
+  let Latency = 6;<br>
+  let ResourceCycles = [1, 2];<br>
+}<br>
+def : WriteRes<WriteMPSADLd, [BWPort23, BWPort0, BWPort5]> {<br>
+  let Latency = 6;<br>
+  let ResourceCycles = [1, 1, 2];<br>
+}<br>
+<br>
+// Vector bitwise operations.<br>
+// These are often used on both floating point and integer vectors.<br>
+defm : BWWriteResPair<WriteVecLogic, BWPort015, 1>; // Vector and/or/xor.<br>
+<br>
+// Conversion between integer and float.<br>
+defm : BWWriteResPair<WriteCvtF2I, BWPort1, 3>; // Float -> Integer.<br>
+defm : BWWriteResPair<WriteCvtI2F, BWPort1, 4>; // Integer -> Float.<br>
+defm : BWWriteResPair<WriteCvtF2F, BWPort1, 3>; // Float -> Float size conversion.<br>
+<br>
+// Strings instructions.<br>
+// Packed Compare Implicit Length Strings, Return Mask<br>
+// String instructions.<br>
+def : WriteRes<WritePCmpIStrM, [BWPort0]> {<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [3];<br>
+}<br>
+def : WriteRes<WritePCmpIStrMLd, [BWPort0, BWPort23]> {<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [3, 1];<br>
+}<span class="apple-converted-space"> </span><br>
+// Packed Compare Explicit Length Strings, Return Mask<br>
+def : WriteRes<WritePCmpEStrM, [BWPort0, BWPort16, BWPort5]> {<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [3, 2, 4];<br>
+}<br>
+def : WriteRes<WritePCmpEStrMLd, [BWPort05, BWPort16, BWPort23]> {<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [6, 2, 1];<br>
+}<span class="apple-converted-space"> </span><br>
+  // Packed Compare Implicit Length Strings, Return Index<br>
+def : WriteRes<WritePCmpIStrI, [BWPort0]> {<br>
+  let Latency = 11;<br>
+  let ResourceCycles = [3];<br>
+}<br>
+def : WriteRes<WritePCmpIStrILd, [BWPort0, BWPort23]> {<br>
+  let Latency = 11;<br>
+  let ResourceCycles = [3, 1];<br>
+}     <br>
+// Packed Compare Explicit Length Strings, Return Index<br>
+def : WriteRes<WritePCmpEStrI, [BWPort05, BWPort16]> {<br>
+  let Latency = 11;<br>
+  let ResourceCycles = [6, 2];<br>
+}<br>
+def : WriteRes<WritePCmpEStrILd, [BWPort0, BWPort16, BWPort5, BWPort23]> {<br>
+  let Latency = 11;<br>
+  let ResourceCycles = [3, 2, 2, 1];<br>
+}<br>
+<br>
+// AES instructions.<br>
+def : WriteRes<WriteAESDecEnc, [BWPort5]> { // Decryption, encryption.<br>
+  let Latency = 7;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def : WriteRes<WriteAESDecEncLd, [BWPort5, BWPort23]> {<br>
+  let Latency = 7;<br>
+  let ResourceCycles = [1, 1];<br>
+}<br>
+def : WriteRes<WriteAESIMC, [BWPort5]> { // InvMixColumn.<br>
+  let Latency = 14;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def : WriteRes<WriteAESIMCLd, [BWPort5, BWPort23]> {<br>
+  let Latency = 14;<br>
+  let ResourceCycles = [2, 1];<br>
+}<br>
+def : WriteRes<WriteAESKeyGen, [BWPort0, BWPort5]> { // Key Generation.<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [2, 8];<br>
+}<br>
+def : WriteRes<WriteAESKeyGenLd, [BWPort0, BWPort5, BWPort23]> {<br>
+  let Latency = 10;<br>
+  let ResourceCycles = [2, 7, 1];<br>
+}<br>
+<br>
+// Carry-less multiplication instructions.<br>
+def : WriteRes<WriteCLMul, [BWPort0, BWPort5]> {<br>
+  let Latency = 7;<br>
+  let ResourceCycles = [2, 1];<br>
+}<br>
+def : WriteRes<WriteCLMulLd, [BWPort0, BWPort5, BWPort23]> {<br>
+  let Latency = 7;<br>
+  let ResourceCycles = [2, 1, 1];<br>
+}<br>
+<br>
+// Catch-all for expensive system instructions.<br>
+def : WriteRes<WriteSystem,     [BWPort0156]> { let Latency = 100; } // def WriteSystem : SchedWrite;<br>
+<br>
+// AVX2.<br>
+defm : BWWriteResPair<WriteFShuffle256,  BWPort5,  3>; // Fp 256-bit width vector shuffles.<br>
+defm : BWWriteResPair<WriteShuffle256,  BWPort5,  3>;  // 256-bit width vector shuffles.<br>
+def : WriteRes<WriteVarVecShift, [BWPort0, BWPort5]> { // Variable vector shifts.<br>
+  let Latency = 2;<br>
+  let ResourceCycles = [2, 1];<br>
+}<br>
+def : WriteRes<WriteVarVecShiftLd, [BWPort0, BWPort5, BWPort23]> {<br>
+  let Latency = 6;<br>
+  let ResourceCycles = [2, 1, 1];<br>
+}<br>
+<br>
+// Old microcoded instructions that nobody use.<br>
+def : WriteRes<WriteMicrocoded, [BWPort0156]> { let Latency = 100; } // def WriteMicrocoded : SchedWrite;<br>
+<br>
+// Fence instructions.<br>
+def : WriteRes<WriteFence,  [BWPort23, BWPort4]>;<br>
+<br>
+// Nop, not very useful expect it provides a model for nops!<br>
+def : WriteRes<WriteNop, []>;<br>
+<br>
+////////////////////////////////////////////////////////////////////////////////<br>
+// Horizontal add/sub  instructions.<br>
+////////////////////////////////////////////////////////////////////////////////<br>
+// HADD, HSUB PS/PD<br>
+// x,x / v,v,v.<br>
+def : WriteRes<WriteFHAdd, [BWPort1]> {<br>
+  let Latency = 3;<br>
+}<br>
+<br>
+// x,m / v,v,m.<br>
+def : WriteRes<WriteFHAddLd, [BWPort1, BWPort23]> {<br>
+  let Latency = 7;<br>
+  let ResourceCycles = [1, 1];<br>
+}<br>
+<br>
+// PHADD|PHSUB (S) W/D.<br>
+// v <- v,v.<br>
+def : WriteRes<WritePHAdd, [BWPort15]>;<br>
+<br>
+// v <- v,m.<br>
+def : WriteRes<WritePHAddLd, [BWPort15, BWPort23]> {<br>
+  let Latency = 5;<br>
+  let ResourceCycles = [1, 1];<br>
+}<br>
+<br>
+// Remaining instrs.<br>
+<br>
+def BWWriteResGroup1 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_MOVD64from64rr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_MOVD64grr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PMOVMSKBrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLDrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLQrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSLLWrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRADri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRADrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRAWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRAWrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLDrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLQrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MMX_PSRLWrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MOVPDI2DIrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "MOVPQIto64rr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSLLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSLLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSLLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSRADri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSRAWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSRLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSRLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "PSRLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VMOVPDI2DIrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VMOVPQIto64rr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLDYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLQYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLVQYrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLVQrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLWYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSLLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRADYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRADri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRAWYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRAWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLDYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLDri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLQYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLQri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLVQYrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLVQrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLWYri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VPSRLWri")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VTESTPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VTESTPDrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VTESTPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup1], (instregex "VTESTPSrr")>;<br>
+<br>
+def BWWriteResGroup2 : SchedWriteRes<[BWPort1]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup2], (instregex "COMP_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "COM_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "MMX_MASKMOVQ64")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "MMX_MASKMOVQ64")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "UCOM_FPr")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "UCOM_Fr")>;<br>
+def: InstRW<[BWWriteResGroup2], (instregex "VMASKMOVDQU")>;<br>
+<br>
+def BWWriteResGroup3 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ANDNPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ANDNPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ANDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ANDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "INSERTPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_MOVD64rr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_MOVD64to64rr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_MOVQ2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PALIGNR64irr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PSHUFBrr64")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PSHUFWri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKHBWirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKHDQirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKHWDirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKLBWirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKLDQirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MMX_PUNPCKLWDirr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOV64toPQIrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVAPDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVAPSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVDDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVDI2PDIrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVHLPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVLHPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVSDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVSHDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVSLDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVSSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVUPDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "MOVUPSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ORPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "ORPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PACKSSDWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PACKSSWBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PACKUSDWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PACKUSWBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PALIGNRrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PBLENDWrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXBDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXBQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVSXWQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXBDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXBQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PMOVZXWQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSHUFBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSHUFDri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSHUFHWri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSHUFLWri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSLLDQri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PSRLDQri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKHBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKHDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKHQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKHWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKLBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKLDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKLQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "PUNPCKLWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "SHUFPDrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "SHUFPSrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "UNPCKHPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "UNPCKHPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "UNPCKLPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "UNPCKLPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDNPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDNPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDNPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDNPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VANDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VBROADCASTSSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VINSERTPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOV64toPQIrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVAPDYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVAPDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVAPSYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVAPSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVDDUPYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVDDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVDI2PDIrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVHLPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVLHPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSHDUPYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSHDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSLDUPYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSLDUPrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVSSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVUPDYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVUPDrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVUPSYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VMOVUPSrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VORPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VORPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VORPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VORPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKSSDWYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKSSDWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKSSWBYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKSSWBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKUSDWYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKUSDWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKUSWBYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPACKUSWBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPALIGNRYrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPALIGNRrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPBLENDWYrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPBLENDWrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPBROADCASTDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPBROADCASTQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPDYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPDri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPSYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPSri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPERMILPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXBDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXBQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVSXWQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXBDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXBQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPMOVZXWQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFBYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFBrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFDYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFDri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFHWYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFHWri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFLWYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSHUFLWri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSLLDQYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSLLDQri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSRLDQYri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPSRLDQri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHQDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHWDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKHWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLBWrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLQDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLWDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VPUNPCKLWDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VSHUFPDYrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VSHUFPDrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VSHUFPSYrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VSHUFPSrri")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKHPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKHPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKHPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKHPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKLPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKLPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKLPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VUNPCKLPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VXORPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VXORPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VXORPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "VXORPSrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "XORPDrr")>;<br>
+def: InstRW<[BWWriteResGroup3], (instregex "XORPSrr")>;<br>
+<br>
+def BWWriteResGroup4 : SchedWriteRes<[BWPort6]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup4], (instregex "JMP(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup5 : SchedWriteRes<[BWPort01]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup5], (instregex "FINCSTP")>;<br>
+def: InstRW<[BWWriteResGroup5], (instregex "FNOP")>;<br>
+<br>
+def BWWriteResGroup6 : SchedWriteRes<[BWPort06]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADC(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADC(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADC8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADCX32rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADCX64rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADOX32rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "ADOX64rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BT(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BT(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTC(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTC(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTR(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTR(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTS(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "BTS(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CDQ")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVAE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVB(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVG(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVGE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVL(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVLE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVNE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVNO(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVNP(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVNS(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVO(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVP(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CMOVS(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "CQO")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JAE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JAE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JA_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JA_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JBE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JBE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JB_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JB_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JGE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JGE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JG_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JG_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JLE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JLE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JL_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JL_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JMP_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JMP_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNE_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNE_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNO_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNO_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNP_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNP_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNS_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JNS_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JO_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JO_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JP_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JP_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JS_1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "JS_4")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "RORX32ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "RORX64ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SAR(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SAR(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SAR8r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SAR8ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SARX32rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SARX64rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SBB(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SBB(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SBB8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETAEr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETBr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETEr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETGEr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETGr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETLEr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETLr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETNEr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETNOr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETNPr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETNSr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETOr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETPr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SETSr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHL(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHL(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHL8r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHL8ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHLX32rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHLX64rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHR(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHR(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHR8r1")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHR8ri")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHRX32rr")>;<br>
+def: InstRW<[BWWriteResGroup6], (instregex "SHRX64rr")>;<br>
+<br>
+def BWWriteResGroup7 : SchedWriteRes<[BWPort15]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup7], (instregex "ANDN32rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "ANDN64rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSI32rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSI64rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSMSK32rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSMSK64rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSR32rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BLSR64rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BZHI32rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "BZHI64rr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "LEA(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PABSBrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PABSDrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PABSWrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDDirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDQirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDSBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDUSBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDUSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PADDWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PAVGBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PAVGWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPEQBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPEQDirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPEQWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPGTBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPGTDirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PCMPGTWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PMAXSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PMAXUBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PMINSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PMINUBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSIGNBrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSIGNDrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSIGNWrr64")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBDirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBQirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBSBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBUSBirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBUSWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "MMX_PSUBWirr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PABSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PABSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PABSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDUSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDUSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PADDWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PAVGBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PAVGWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPEQBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPEQDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPEQQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPEQWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPGTBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPGTDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PCMPGTWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXUBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXUDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMAXUWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINUBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINUDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PMINUWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSIGNBrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSIGNDrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSIGNWrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBUSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBUSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "PSUBWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPABSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDUSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDUSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDUSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDUSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPADDWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPAVGBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPAVGBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPAVGWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPAVGWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQQYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPEQWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPCMPGTWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMAXUWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPMINUWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNBYrr256")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNBrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNDYrr256")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNDrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNWYrr256")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSIGNWrr128")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBDYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBDrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBQYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBQrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBUSBYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBUSBrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBUSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBUSWrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup7], (instregex "VPSUBWrr")>;<br>
+<br>
+def BWWriteResGroup8 : SchedWriteRes<[BWPort015]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup8], (instregex "BLENDPDrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "BLENDPSrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_MOVD64from64rr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_MOVQ64rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_PANDNirr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_PANDirr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_PORirr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MMX_PXORirr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MOVDQArr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MOVDQUrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "MOVPQI2QIrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "PANDNrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "PANDrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "PORrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "PXORrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VBLENDPDYrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VBLENDPDrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VBLENDPSYrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VBLENDPSrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVDQAYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVDQArr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVDQUYrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVDQUrr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVPQI2QIrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VMOVZPQILo2PQIrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPANDNYrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPANDNrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPANDYrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPANDrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPBLENDDYrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPBLENDDrri")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPORYrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPORrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPXORYrr")>;<br>
+def: InstRW<[BWWriteResGroup8], (instregex "VPXORrr")>;<br>
+<br>
+def BWWriteResGroup9 : SchedWriteRes<[BWPort0156]> {<o:p></o:p></p>
<p class="MsoNormal">+  let Latency = 1;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup9], (instregex "ADD(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "ADD(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "ADD8i8")>;<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup9], (instregex "ADD8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "ADD8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "AND(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "AND(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "AND8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "AND8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "AND8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CBW")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CLC")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMC")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMP(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMP(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMP8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMP8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CMP8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "CWDE")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "DEC(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "DEC8r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "INC(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "INC8r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "LAHF")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOV(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOV8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOV8ri_alt")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOV8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOVSX(16|32|64)rr16")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOVSX(16|32|64)rr32")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOVSX(16|32|64)rr8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOVZX(16|32|64)rr16")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "MOVZX(16|32|64)rr8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "NEG(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "NEG8r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "NOOP")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "NOT(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "NOT8r")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "OR(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "OR(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "OR8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "OR8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "OR8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SAHF")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SGDT64m")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SIDT64m")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SLDT64m")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SMSW16m")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "STC")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "STRm")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SUB(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SUB(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SUB8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SUB8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SUB8rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "SYSCALL")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "TEST(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "TEST8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "TEST8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "TEST8rr")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XCHG(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XOR(16|32|64)ri8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XOR(16|32|64)rr(_REV?)")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XOR8i8")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XOR8ri")>;<br>
+def: InstRW<[BWWriteResGroup9], (instregex "XOR8rr(_REV?)")>;<br>
+<br>
+def BWWriteResGroup10 : SchedWriteRes<[BWPort4,BWPort237]> {<br>
+  let Latency = 1;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup10], (instregex "FBSTPm")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MMX_MOVD64from64rm")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MMX_MOVD64mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MMX_MOVNTQmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MMX_MOVQ64mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOV(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOV8mi")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOV8mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVAPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVAPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVDQAmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVDQUmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVHPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVHPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVLPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVLPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVNTDQmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVNTI_64mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVNTImr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVNTPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVNTPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVPDI2DImr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVPQI2QImr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVPQIto64mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVSSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVUPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "MOVUPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "ST_FP32m")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "ST_FP64m")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "ST_FP80m")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VEXTRACTF128mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VEXTRACTI128mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVAPDYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVAPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVAPSYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVAPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVDQAYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVDQAmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVDQUYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVDQUmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVHPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVHPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVLPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVLPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTDQYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTDQmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTPDYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTPSYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVNTPSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVPDI2DImr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVPQI2QImr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVPQIto64mr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVSDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVSSmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVUPDYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVUPDmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVUPSYmr")>;<br>
+def: InstRW<[BWWriteResGroup10], (instregex "VMOVUPSmr")>;<br>
+<br>
+def BWWriteResGroup11 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup11], (instregex "BLENDVPDrr0")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "BLENDVPSrr0")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "MMX_PINSRWirri")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "PBLENDVBrr0")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "PINSRBrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "PINSRDrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "PINSRQrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "PINSRWrri")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VBLENDVPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VBLENDVPDrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VBLENDVPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VBLENDVPSrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPBLENDVBYrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPBLENDVBrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPINSRBrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPINSRDrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPINSRQrr")>;<br>
+def: InstRW<[BWWriteResGroup11], (instregex "VPINSRWrri")>;<br>
+<br>
+def BWWriteResGroup12 : SchedWriteRes<[BWPort01]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup12], (instregex "FDECSTP")>;<br>
+<br>
+def BWWriteResGroup13 : SchedWriteRes<[BWPort06]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROL(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROL(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROL8r1")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROL8ri")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROR(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROR(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROR8r1")>;<br>
+def: InstRW<[BWWriteResGroup13], (instregex "ROR8ri")>;<br>
+<br>
+def BWWriteResGroup14 : SchedWriteRes<[BWPort0156]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup14], (instregex "LFENCE")>;<br>
+def: InstRW<[BWWriteResGroup14], (instregex "MFENCE")>;<br>
+def: InstRW<[BWWriteResGroup14], (instregex "WAIT")>;<br>
+def: InstRW<[BWWriteResGroup14], (instregex "XGETBV")>;<br>
+<br>
+def BWWriteResGroup15 : SchedWriteRes<[BWPort0,BWPort5]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup15], (instregex "CVTPS2PDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "CVTSS2SDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "EXTRACTPSrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "MMX_PEXTRWirri")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PEXTRBrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PEXTRDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PEXTRQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PEXTRWri")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PEXTRWrr_REV")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSLLDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSLLQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSLLWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSRADrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSRAWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSRLDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSRLQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PSRLWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "PTESTrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VCVTPH2PSYrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VCVTPH2PSrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VCVTPS2PDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VCVTSS2SDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VEXTRACTPSrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPEXTRBrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPEXTRDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPEXTRQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPEXTRWri")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPEXTRWrr_REV")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSLLDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSLLQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSLLWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSRADrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSRAWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSRLDrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSRLQrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPSRLWrr")>;<br>
+def: InstRW<[BWWriteResGroup15], (instregex "VPTESTrr")>;<br>
+<br>
+def BWWriteResGroup16 : SchedWriteRes<[BWPort6,BWPort0156]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup16], (instregex "CLFLUSH")>;<br>
+<br>
+def BWWriteResGroup17 : SchedWriteRes<[BWPort01,BWPort015]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup17], (instregex "MMX_MOVDQ2Qrr")>;<br>
+<br>
+def BWWriteResGroup18 : SchedWriteRes<[BWPort237,BWPort0156]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup18], (instregex "SFENCE")>;<br>
+<br>
+def BWWriteResGroup19 : SchedWriteRes<[BWPort06,BWPort15]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup19], (instregex "BEXTR32rr")>;<br>
+def: InstRW<[BWWriteResGroup19], (instregex "BEXTR64rr")>;<br>
+def: InstRW<[BWWriteResGroup19], (instregex "BSWAP(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup20 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup20], (instregex "ADC8i8")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "ADC8ri")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "CMOVA(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "CMOVBE(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "CWD")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "JRCXZ")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "SBB8i8")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "SBB8ri")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "SETAr")>;<br>
+def: InstRW<[BWWriteResGroup20], (instregex "SETBEr")>;<br>
+<br>
+def BWWriteResGroup21 : SchedWriteRes<[BWPort4,BWPort5,BWPort237]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup21], (instregex "EXTRACTPSmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "PEXTRBmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "PEXTRDmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "PEXTRQmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "PEXTRWmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "STMXCSR")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VEXTRACTPSmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VPEXTRBmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VPEXTRDmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VPEXTRQmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VPEXTRWmr")>;<br>
+def: InstRW<[BWWriteResGroup21], (instregex "VSTMXCSR")>;<br>
+<br>
+def BWWriteResGroup22 : SchedWriteRes<[BWPort4,BWPort6,BWPort237]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup22], (instregex "FNSTCW16m")>;<br>
+<br>
+def BWWriteResGroup23 : SchedWriteRes<[BWPort4,BWPort237,BWPort06]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETAEm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETBm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETEm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETGEm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETGm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETLEm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETLm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETNEm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETNOm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETNPm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETNSm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETOm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETPm")>;<br>
+def: InstRW<[BWWriteResGroup23], (instregex "SETSm")>;<br>
+<br>
+def BWWriteResGroup24 : SchedWriteRes<[BWPort4,BWPort237,BWPort15]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup24], (instregex "MOVBE(16|32|64)mr")>;<br>
+<br>
+def BWWriteResGroup25 : SchedWriteRes<[BWPort4,BWPort237,BWPort0156]> {<br>
+  let Latency = 2;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup25], (instregex "PUSH(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "PUSH(16|32|64)rmr")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "PUSH64i8")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "STOSB")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "STOSL")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "STOSQ")>;<br>
+def: InstRW<[BWWriteResGroup25], (instregex "STOSW")>;<br>
+<br>
+def BWWriteResGroup26 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup26], (instregex "MOVMSKPDrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "MOVMSKPSrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "PMOVMSKBrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VMOVMSKPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VMOVMSKPDrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VMOVMSKPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VMOVMSKPSrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VPMOVMSKBYrr")>;<br>
+def: InstRW<[BWWriteResGroup26], (instregex "VPMOVMSKBrr")>;<br>
+<br>
+def BWWriteResGroup27 : SchedWriteRes<[BWPort1]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDSUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADDSUBPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADD_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADD_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "ADD_FrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "BSF(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "BSR(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CMPPDrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CMPPSrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CMPSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "COMISDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "COMISSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CVTDQ2PSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CVTPS2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "CVTTPS2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "IMUL(32|64)rr(i8?)")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "IMUL8r")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "LZCNT(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MAXPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MAXPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MAXSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MAXSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MINPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MINPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MINSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MINSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MMX_CVTPI2PSirr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "MUL8r")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "PDEP32rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "PDEP64rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "PEXT32rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "PEXT64rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "POPCNT(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SHLD(16|32|64)rri8")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SHRD(16|32|64)rri8")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBR_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBR_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBR_FrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUBSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUB_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUB_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "SUB_FrST0")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "TZCNT(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "UCOMISDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "UCOMISSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSUBPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSUBPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VADDSUBPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPPDYrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPPDrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPPSYrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPPSrri")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCMPSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCOMISDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCOMISSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTDQ2PSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTDQ2PSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTPS2DQYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTPS2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTTPS2DQYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VCVTTPS2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMAXSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VMINSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBPSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBSDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VSUBSSrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VUCOMISDrr")>;<br>
+def: InstRW<[BWWriteResGroup27], (instregex "VUCOMISSrr")>;<br>
+<br>
+def BWWriteResGroup27_16 : SchedWriteRes<[BWPort1, BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup27_16], (instregex "IMUL16rr(i8?)")>;<br>
+<br>
+def BWWriteResGroup28 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VBROADCASTSDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VBROADCASTSSYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VEXTRACTF128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VEXTRACTI128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VINSERTF128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VINSERTI128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTBYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTBrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTWYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPBROADCASTWrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERM2F128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERM2I128rr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERMDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERMPDYri")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERMPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPERMQYri")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXBDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXBQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXWDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVSXWQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXBDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXBQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXWDYrr")>;<br>
+def: InstRW<[BWWriteResGroup28], (instregex "VPMOVZXWQYrr")>;<br>
+<br>
+def BWWriteResGroup29 : SchedWriteRes<[BWPort01]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup29], (instregex "MULPDrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "MULPSrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "MULSDrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "MULSSrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULPDrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULPSrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULSDrr")>;<br>
+def: InstRW<[BWWriteResGroup29], (instregex "VMULSSrr")>;<br>
+<br>
+def BWWriteResGroup30 : SchedWriteRes<[BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup30], (instregex "XADD(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup30], (instregex "XADD8rr")>;<br>
+def: InstRW<[BWWriteResGroup30], (instregex "XCHG8rr")>;<br>
+<br>
+def BWWriteResGroup31 : SchedWriteRes<[BWPort0,BWPort5]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSLLVDYrr")>;<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSLLVDrr")>;<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSRAVDYrr")>;<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSRAVDrr")>;<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSRLVDYrr")>;<br>
+def: InstRW<[BWWriteResGroup31], (instregex "VPSRLVDrr")>;<br>
+<br>
+def BWWriteResGroup32 : SchedWriteRes<[BWPort5,BWPort15]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHADDSWrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHADDWrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHADDrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHSUBDrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHSUBSWrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "MMX_PHSUBWrr64")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHADDDrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHADDSWrr128")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHADDWrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHSUBDrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHSUBSWrr128")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "PHSUBWrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDDYrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDDrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDSWrr128")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDSWrr256")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDWYrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHADDWrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBDYrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBDrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBSWrr128")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBSWrr256")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup32], (instregex "VPHSUBWrr")>;<br>
+<br>
+def BWWriteResGroup33 : SchedWriteRes<[BWPort5,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup33], (instregex "MMX_PACKSSDWirr")>;<br>
+def: InstRW<[BWWriteResGroup33], (instregex "MMX_PACKSSWBirr")>;<br>
+def: InstRW<[BWWriteResGroup33], (instregex "MMX_PACKUSWBirr")>;<br>
+<br>
+def BWWriteResGroup34 : SchedWriteRes<[BWPort6,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup34], (instregex "CLD")>;<br>
+<br>
+def BWWriteResGroup35 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCL(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCL(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCL8r1")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCL8ri")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCR(16|32|64)r1")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCR(16|32|64)ri")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCR8r1")>;<br>
+def: InstRW<[BWWriteResGroup35], (instregex "RCR8ri")>;<br>
+<br>
+def BWWriteResGroup36 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup36], (instregex "ROL(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "ROL8rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "ROR(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "ROR8rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SAR(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SAR8rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SHL(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SHL8rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SHR(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup36], (instregex "SHR8rCL")>;<br>
+<br>
+def BWWriteResGroup37 : SchedWriteRes<[BWPort4,BWPort6,BWPort237,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup37], (instregex "CALL(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup38 : SchedWriteRes<[BWPort4,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 3;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup38], (instregex "CALL64pcrel32")>;<br>
+def: InstRW<[BWWriteResGroup38], (instregex "SETAm")>;<br>
+def: InstRW<[BWWriteResGroup38], (instregex "SETBEm")>;<br>
+<br>
+def BWWriteResGroup39 : SchedWriteRes<[BWPort0,BWPort1]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTSD2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTSD2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTSS2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTSS2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTTSD2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTTSD2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTTSS2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "CVTTSS2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTSD2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTSD2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTSS2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTSS2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTTSD2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTTSD2SIrr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTTSS2SI64rr")>;<br>
+def: InstRW<[BWWriteResGroup39], (instregex "VCVTTSS2SIrr")>;<br>
+<br>
+def BWWriteResGroup40 : SchedWriteRes<[BWPort0,BWPort5]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VCVTPS2PDYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSLLDYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSLLQYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSLLWYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSRADYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSRAWYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSRLDYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSRLQYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPSRLWYrr")>;<br>
+def: InstRW<[BWWriteResGroup40], (instregex "VPTESTYrr")>;<br>
+<br>
+def BWWriteResGroup41 : SchedWriteRes<[BWPort0,BWPort0156]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup41], (instregex "FNSTSW16r")>;<br>
+<br>
+def BWWriteResGroup42 : SchedWriteRes<[BWPort1,BWPort5]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTDQ2PDrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTPD2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTPD2PSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTSD2SSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTSI2SD64rr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTSI2SDrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTSI2SSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "CVTTPD2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "IMUL(32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MMX_CVTPD2PIirr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MMX_CVTPI2PDirr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MMX_CVTPS2PIirr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MMX_CVTTPD2PIirr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MMX_CVTTPS2PIirr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MUL(32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "MULX64rr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTDQ2PDrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTPD2DQrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTPD2PSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTPS2PHrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTSD2SSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTSI2SD64rr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTSI2SDrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTSI2SSrr")>;<br>
+def: InstRW<[BWWriteResGroup42], (instregex "VCVTTPD2DQrr")>;<br>
+<br>
+def BWWriteResGroup42_16 : SchedWriteRes<[BWPort1,BWPort06,BWPort0156]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 4;<br>
+}<br>
+def: InstRW<[BWWriteResGroup42_16], (instregex "IMUL16r")>;<br>
+def: InstRW<[BWWriteResGroup42_16], (instregex "MUL16r")>;<br>
+<br>
+def BWWriteResGroup43 : SchedWriteRes<[BWPort0,BWPort4,BWPort237]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup43], (instregex "FNSTSWm")>;<br>
+<br>
+def BWWriteResGroup44 : SchedWriteRes<[BWPort1,BWPort4,BWPort237]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup44], (instregex "ISTT_FP16m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "ISTT_FP32m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "ISTT_FP64m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "IST_F16m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "IST_F32m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "IST_FP16m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "IST_FP32m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "IST_FP64m")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "VCVTPS2PHYmr")>;<br>
+def: InstRW<[BWWriteResGroup44], (instregex "VCVTPS2PHmr")>;<br>
+<br>
+def BWWriteResGroup45 : SchedWriteRes<[BWPort0156]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [4];<br>
+}<br>
+def: InstRW<[BWWriteResGroup45], (instregex "FNCLEX")>;<br>
+<br>
+def BWWriteResGroup46 : SchedWriteRes<[BWPort015,BWPort0156]> {<br>
+  let Latency = 4;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup46], (instregex "VZEROUPPER")>;<br>
+<br>
+def BWWriteResGroup47 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMADDUBSWrr64")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMADDWDirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMULHRSWrr64")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMULHUWirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMULHWirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMULLWirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PMULUDQirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MMX_PSADBWirr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MUL_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MUL_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "MUL_FrST0")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PCLMULQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PCMPGTQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PHMINPOSUWrr128")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMADDUBSWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMADDWDrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULHRSWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULHUWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULHWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULLWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PMULUDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "PSADBWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "RCPPSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "RCPSSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "RSQRTPSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "RSQRTSSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPCLMULQDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPCMPGTQYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPCMPGTQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPHMINPOSUWrr128")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMADDUBSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMADDUBSWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMADDWDYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMADDWDrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHRSWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHRSWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHUWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHUWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULHWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULLWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULLWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULUDQYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPMULUDQrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPSADBWYrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VPSADBWrr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VRCPPSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VRCPSSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VRSQRTPSr")>;<br>
+def: InstRW<[BWWriteResGroup47], (instregex "VRSQRTSSr")>;<br>
+<br>
+def BWWriteResGroup48 : SchedWriteRes<[BWPort01]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD132SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD213SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADD231SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMADDSUB231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB132SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB213SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUB231SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFMSUBADD231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD132SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD213SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMADD231SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB132SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB213SSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231PDYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231PDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231PSYr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231PSr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231SDr")>;<br>
+def: InstRW<[BWWriteResGroup48], (instregex "VFNMSUB231SSr")>;<br>
+<br>
+def BWWriteResGroup49 : SchedWriteRes<[BWPort23]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup49], (instregex "LDDQUrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MMX_MOVD64from64rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MMX_MOVD64rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MMX_MOVD64to64rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MMX_MOVQ64rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOV(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOV64toPQIrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOV8rm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVAPDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVAPSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVDDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVDI2PDIrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVDQArm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVDQUrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVNTDQArm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSHDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSLDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSX(16|32|64)rm16")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSX(16|32|64)rm32")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVSX(16|32|64)rm8")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVUPDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVUPSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVZX(16|32|64)rm16")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "MOVZX(16|32|64)rm8")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "PREFETCHNTA")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "PREFETCHT0")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "PREFETCHT1")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "PREFETCHT2")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VBROADCASTSSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VLDDQUrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOV64toPQIrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVAPDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVAPSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVDDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVDI2PDIrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVDQArm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVDQUrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVNTDQArm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVQI2PQIrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVSDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVSHDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVSLDUPrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVSSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVUPDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VMOVUPSrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VPBROADCASTDrm")>;<br>
+def: InstRW<[BWWriteResGroup49], (instregex "VPBROADCASTQrm")>;<br>
+<br>
+def BWWriteResGroup50 : SchedWriteRes<[BWPort1,BWPort5]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup50], (instregex "CVTSI2SS64rr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "HADDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "HADDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "HSUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "HSUBPSrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VCVTSI2SS64rr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHADDPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHADDPDrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHADDPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHADDPSrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHSUBPDYrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHSUBPDrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHSUBPSYrr")>;<br>
+def: InstRW<[BWWriteResGroup50], (instregex "VHSUBPSrr")>;<br>
+<br>
+def BWWriteResGroup51 : SchedWriteRes<[BWPort1,BWPort6,BWPort06]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup51], (instregex "STR(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup52 : SchedWriteRes<[BWPort1,BWPort06,BWPort0156]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup52], (instregex "MULX32rr")>;<br>
+<o:p></o:p></p>
<p class="MsoNormal">+def BWWriteResGroup53 : SchedWriteRes<[BWPort0,BWPort4,BWPort237,BWPort15]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 4;<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+  let ResourceCycles = [1,1,1,1];<br>
+}<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup53], (instregex "VMASKMOVPDYmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VMASKMOVPDmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VMASKMOVPSYmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VMASKMOVPSmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VPMASKMOVDYmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VPMASKMOVDmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VPMASKMOVQYmr")>;<br>
+def: InstRW<[BWWriteResGroup53], (instregex "VPMASKMOVQmr")>;<br>
+<br>
+def BWWriteResGroup54 : SchedWriteRes<[BWPort6,BWPort0156]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,4];<br>
+}<br>
+def: InstRW<[BWWriteResGroup54], (instregex "PAUSE")>;<br>
+<br>
+def BWWriteResGroup55 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,4];<br>
+}<br>
+def: InstRW<[BWWriteResGroup55], (instregex "XSETBV")>;<br>
+<br>
+def BWWriteResGroup56 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [2,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup56], (instregex "CMPXCHG(16|32|64)rr")>;<br>
+def: InstRW<[BWWriteResGroup56], (instregex "CMPXCHG8rr")>;<br>
+<br>
+def BWWriteResGroup57 : SchedWriteRes<[BWPort4,BWPort237,BWPort0156]> {<br>
+  let Latency = 5;<br>
+  let NumMicroOps = 6;<br>
+  let ResourceCycles = [1,1,4];<br>
+}<br>
+def: InstRW<[BWWriteResGroup57], (instregex "PUSHF16")>;<br>
+def: InstRW<[BWWriteResGroup57], (instregex "PUSHF64")>;<br>
+<br>
+def BWWriteResGroup58 : SchedWriteRes<[BWPort23]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup58], (instregex "LD_F32m")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "LD_F64m")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "LD_F80m")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VBROADCASTF128")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VBROADCASTI128")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VBROADCASTSDYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VBROADCASTSSYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VLDDQUYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVAPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVAPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVDDUPYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVDQAYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVDQUYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVNTDQAYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVSHDUPYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVSLDUPYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVUPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VMOVUPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VPBROADCASTDYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VPBROADCASTQYrm")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "ROUNDPDr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "ROUNDPSr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "ROUNDSDr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "ROUNDSSr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDPDr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDPSr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDSDr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDSSr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDYPDr")>;<br>
+def: InstRW<[BWWriteResGroup58], (instregex "VROUNDYPSr")>;<br>
+<br>
+def BWWriteResGroup59 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup59], (instregex "CVTPS2PDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "CVTSS2SDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSLLDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSLLQrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSLLWrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSRADrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSRAWrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSRLDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSRLQrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "MMX_PSRLWrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VCVTPH2PSYrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VCVTPH2PSrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VCVTPS2PDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VCVTSS2SDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VPSLLVQrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VPSRLVQrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VTESTPDrm")>;<br>
+def: InstRW<[BWWriteResGroup59], (instregex "VTESTPSrm")>;<br>
+<br>
+def BWWriteResGroup60 : SchedWriteRes<[BWPort1,BWPort5]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup60], (instregex "VCVTDQ2PDYrr")>;<br>
+def: InstRW<[BWWriteResGroup60], (instregex "VCVTPD2DQYrr")>;<br>
+def: InstRW<[BWWriteResGroup60], (instregex "VCVTPD2PSYrr")>;<br>
+def: InstRW<[BWWriteResGroup60], (instregex "VCVTPS2PHYrr")>;<br>
+def: InstRW<[BWWriteResGroup60], (instregex "VCVTTPD2DQYrr")>;<br>
+<br>
+def BWWriteResGroup61 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ANDNPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ANDNPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ANDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ANDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "INSERTPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PALIGNR64irm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PINSRWirmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PSHUFBrm64")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PSHUFWmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKHBWirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKHDQirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKHWDirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKLBWirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKLDQirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MMX_PUNPCKLWDirm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MOVHPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MOVHPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MOVLPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "MOVLPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ORPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "ORPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PACKSSDWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PACKSSWBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PACKUSDWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PACKUSWBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PALIGNRrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PBLENDWrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PINSRBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PINSRDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PINSRQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PINSRWrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXBDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXBQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVSXWQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXBDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXBQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PMOVZXWQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PSHUFBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PSHUFDmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PSHUFHWmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PSHUFLWmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKHBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKHDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKHQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKHWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKLBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKLDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKLQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "PUNPCKLWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "SHUFPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "SHUFPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "UNPCKHPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "UNPCKHPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "UNPCKLPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "UNPCKLPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VANDNPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VANDNPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VANDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VANDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VINSERTPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VMOVHPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VMOVHPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VMOVLPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VMOVLPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VORPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VORPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPACKSSDWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPACKSSWBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPACKUSDWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPACKUSWBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPALIGNRrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPBLENDWrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPERMILPDmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPERMILPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPERMILPSmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPERMILPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPINSRBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPINSRDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPINSRQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPINSRWrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXBDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXBQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVSXWQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXBDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXBQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPMOVZXWQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPSHUFBrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPSHUFDmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPSHUFHWmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPSHUFLWmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKHBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKHDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKHQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKHWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKLBWrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKLDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKLQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VPUNPCKLWDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VSHUFPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VSHUFPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VUNPCKHPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VUNPCKHPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VUNPCKLPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VUNPCKLPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VXORPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "VXORPSrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "XORPDrm")>;<br>
+def: InstRW<[BWWriteResGroup61], (instregex "XORPSrm")>;<br>
+<br>
+def BWWriteResGroup62 : SchedWriteRes<[BWPort6,BWPort23]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup62], (instregex "FARJMP64")>;<br>
+def: InstRW<[BWWriteResGroup62], (instregex "JMP(16|32|64)m")>;<br>
+<br>
+def BWWriteResGroup63 : SchedWriteRes<[BWPort23,BWPort06]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADC(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADC8rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADCX32rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADCX64rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADOX32rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "ADOX64rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "BT(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVAE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVB(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVG(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVGE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVL(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVLE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVNE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVNO(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVNP(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVNS(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVO(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVP(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "CMOVS(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "RORX32mi")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "RORX64mi")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SARX32rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SARX64rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SBB(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SBB8rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SHLX32rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SHLX64rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SHRX32rm")>;<br>
+def: InstRW<[BWWriteResGroup63], (instregex "SHRX64rm")>;<br>
+<br>
+def BWWriteResGroup64 : SchedWriteRes<[BWPort23,BWPort15]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup64], (instregex "ANDN32rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "ANDN64rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSI32rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSI64rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSMSK32rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSMSK64rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSR32rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BLSR64rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BZHI32rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "BZHI64rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PABSBrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PABSDrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PABSWrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDDirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDQirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDSBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDUSBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDUSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PADDWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PAVGBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PAVGWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPEQBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPEQDirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPEQWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPGTBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPGTDirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PCMPGTWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PMAXSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PMAXUBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PMINSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PMINUBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSIGNBrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSIGNDrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSIGNWrm64")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBDirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBQirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBSBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBUSBirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBUSWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MMX_PSUBWirm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "MOVBE(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PABSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PABSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PABSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDUSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDUSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PADDWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PAVGBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PAVGWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPEQBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPEQDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPEQQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPEQWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPGTBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPGTDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PCMPGTWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXUBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXUDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMAXUWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINUBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINUDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PMINUWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSIGNBrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSIGNDrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSIGNWrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBUSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBUSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "PSUBWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPABSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPABSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPABSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDUSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDUSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPADDWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPAVGBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPAVGWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPEQBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPEQDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPEQQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPEQWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPGTBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPGTDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPCMPGTWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXUBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXUDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMAXUWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINSDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINUBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINUDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPMINUWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSIGNBrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSIGNDrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSIGNWrm128")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBDrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBQrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBUSBrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBUSWrm")>;<br>
+def: InstRW<[BWWriteResGroup64], (instregex "VPSUBWrm")>;<br>
+<br>
+def BWWriteResGroup65 : SchedWriteRes<[BWPort23,BWPort015]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup65], (instregex "BLENDPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "BLENDPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "MMX_PANDNirm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "MMX_PANDirm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "MMX_PORirm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "MMX_PXORirm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "PANDNrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "PANDrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "PORrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "PXORrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VBLENDPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VBLENDPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VINSERTF128rm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VINSERTI128rm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VPANDNrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VPANDrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VPBLENDDrmi")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VPORrm")>;<br>
+def: InstRW<[BWWriteResGroup65], (instregex "VPXORrm")>;<br>
+<br>
+def BWWriteResGroup66 : SchedWriteRes<[BWPort23,BWPort0156]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup66], (instregex "ADD(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "ADD8rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "AND(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "AND8rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP8mi")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP8mr")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "CMP8rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "OR(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "OR8rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "POP(16|32|64)r")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "POP(16|32|64)rmr")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "SUB(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "SUB8rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "TEST(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "TEST8mi")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "TEST8mr")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "XOR(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup66], (instregex "XOR8rm")>;<br>
+<br>
+def BWWriteResGroup67 : SchedWriteRes<[BWPort1,BWPort06,BWPort0156]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup67], (instregex "SHLD(16|32|64)rrCL")>;<br>
+def: InstRW<[BWWriteResGroup67], (instregex "SHRD(16|32|64)rrCL")>;<br>
+<br>
+def BWWriteResGroup68 : SchedWriteRes<[BWPort1,BWPort6,BWPort06,BWPort0156]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup68], (instregex "SLDT(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup69 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort06]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup69], (instregex "BTC(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "BTR(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "BTS(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SAR(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SAR(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SAR8m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SAR8mi")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHL(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHL(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHL8m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHL8mi")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHR(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHR(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHR8m1")>;<br>
+def: InstRW<[BWWriteResGroup69], (instregex "SHR8mi")>;<br>
+<br>
+def BWWriteResGroup70 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup70], (instregex "ADD(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "ADD(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "ADD8mi")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "ADD8mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "AND(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "AND(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "AND8mi")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "AND8mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "DEC(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "DEC8m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "INC(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "INC8m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "NEG(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "NEG8m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "NOT(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "NOT8m")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "OR(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "OR(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "OR8mi")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "OR8mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "POP(16|32|64)rmm")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "PUSH(16|32|64)rmm")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "SUB(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "SUB(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "SUB8mi")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "SUB8mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "XOR(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "XOR(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "XOR8mi")>;<br>
+def: InstRW<[BWWriteResGroup70], (instregex "XOR8mr")>;<br>
+<br>
+def BWWriteResGroup71 : SchedWriteRes<[BWPort6,BWPort0156]> {<br>
+  let Latency = 6;<br>
+  let NumMicroOps = 6;<br>
+  let ResourceCycles = [1,5];<br>
+}<br>
+def: InstRW<[BWWriteResGroup71], (instregex "STD")>;<br>
+<br>
+def BWWriteResGroup72 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup72], (instregex "AESDECLASTrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "AESDECrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "AESENCLASTrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "AESENCrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "VAESDECLASTrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "VAESDECrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "VAESENCLASTrr")>;<br>
+def: InstRW<[BWWriteResGroup72], (instregex "VAESENCrr")>;<br>
+<br>
+def BWWriteResGroup73 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSLLDYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSLLQYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSLLVQYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSLLWYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRADYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRAWYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRLDYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRLQYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRLVQYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VPSRLWYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VTESTPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup73], (instregex "VTESTPSYrm")>;<br>
+<br>
+def BWWriteResGroup74 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup74], (instregex "FCOM32m")>;<br>
+def: InstRW<[BWWriteResGroup74], (instregex "FCOM64m")>;<br>
+def: InstRW<[BWWriteResGroup74], (instregex "FCOMP32m")>;<br>
+def: InstRW<[BWWriteResGroup74], (instregex "FCOMP64m")>;<br>
+<br>
+def BWWriteResGroup75 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VANDNPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VANDNPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VANDPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VANDPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VORPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VORPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPACKSSDWYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPACKSSWBYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPACKUSDWYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPACKUSWBYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPALIGNRYrmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPBLENDWYrmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPERMILPDYmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPERMILPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPERMILPSYmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPERMILPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPSHUFBYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPSHUFDYmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPSHUFHWYmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPSHUFLWYmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKHBWYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKHDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKHQDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKHWDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKLBWYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKLDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKLQDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VPUNPCKLWDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VSHUFPDYrmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VSHUFPSYrmi")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VUNPCKHPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VUNPCKHPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VUNPCKLPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VUNPCKLPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VXORPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup75], (instregex "VXORPSYrm")>;<br>
+<br>
+def BWWriteResGroup76 : SchedWriteRes<[BWPort23,BWPort15]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPABSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPABSDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPABSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDUSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDUSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPADDWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPAVGBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPAVGWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPEQBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPEQDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPEQQYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPEQWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPGTBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPGTDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPCMPGTWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXSDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXUBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXUDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMAXUWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINSDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINUBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINUDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPMINUWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSIGNBYrm256")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSIGNDYrm256")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSIGNWYrm256")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBDYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBQYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBUSBYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBUSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup76], (instregex "VPSUBWYrm")>;<br>
+<br>
+def BWWriteResGroup77 : SchedWriteRes<[BWPort23,BWPort015]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VBLENDPDYrmi")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VBLENDPSYrmi")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VPANDNYrm")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VPANDYrm")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VPBLENDDYrmi")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VPORYrm")>;<br>
+def: InstRW<[BWWriteResGroup77], (instregex "VPXORYrm")>;<br>
+<br>
+def BWWriteResGroup78 : SchedWriteRes<[BWPort0,BWPort5]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup78], (instregex "MPSADBWrri")>;<br>
+def: InstRW<[BWWriteResGroup78], (instregex "VMPSADBWYrri")>;<br>
+def: InstRW<[BWWriteResGroup78], (instregex "VMPSADBWrri")>;<br>
+<br>
+def BWWriteResGroup79 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup79], (instregex "BLENDVPDrm0")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "BLENDVPSrm0")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "MMX_PACKSSDWirm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "MMX_PACKSSWBirm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "MMX_PACKUSWBirm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "PBLENDVBrm0")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VBLENDVPDrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VBLENDVPSrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VMASKMOVPDrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VMASKMOVPSrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VPBLENDVBrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VPMASKMOVDrm")>;<br>
+def: InstRW<[BWWriteResGroup79], (instregex "VPMASKMOVQrm")>;<br>
+<br>
+def BWWriteResGroup80 : SchedWriteRes<[BWPort23,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup80], (instregex "LEAVE64")>;<br>
+def: InstRW<[BWWriteResGroup80], (instregex "SCASB")>;<br>
+def: InstRW<[BWWriteResGroup80], (instregex "SCASL")>;<br>
+def: InstRW<[BWWriteResGroup80], (instregex "SCASQ")>;<br>
+def: InstRW<[BWWriteResGroup80], (instregex "SCASW")>;<br>
+<br>
+def BWWriteResGroup81 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSLLDrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSLLQrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSLLWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSRADrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSRAWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSRLDrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSRLQrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PSRLWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "PTESTrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSLLDrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSLLQrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSLLWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSRADrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSRAWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSRLDrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSRLQrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPSRLWrm")>;<br>
+def: InstRW<[BWWriteResGroup81], (instregex "VPTESTrm")>;<br>
+<br>
+def BWWriteResGroup82 : SchedWriteRes<[BWPort0,BWPort01,BWPort23]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup82], (instregex "FLDCW16m")>;<br>
+<br>
+def BWWriteResGroup83 : SchedWriteRes<[BWPort0,BWPort23,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup83], (instregex "LDMXCSR")>;<br>
+def: InstRW<[BWWriteResGroup83], (instregex "VLDMXCSR")>;<br>
+<br>
+def BWWriteResGroup84 : SchedWriteRes<[BWPort6,BWPort23,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup84], (instregex "LRETQ")>;<br>
+def: InstRW<[BWWriteResGroup84], (instregex "RETQ")>;<br>
+<br>
+def BWWriteResGroup85 : SchedWriteRes<[BWPort23,BWPort06,BWPort15]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup85], (instregex "BEXTR32rm")>;<br>
+def: InstRW<[BWWriteResGroup85], (instregex "BEXTR64rm")>;<br>
+<br>
+def BWWriteResGroup86 : SchedWriteRes<[BWPort23,BWPort06,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup86], (instregex "CMOVA(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup86], (instregex "CMOVBE(16|32|64)rm")>;<br>
+<br>
+def BWWriteResGroup87 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort06]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROL(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROL(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROL8m1")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROL8mi")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROR(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROR(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROR8m1")>;<br>
+def: InstRW<[BWWriteResGroup87], (instregex "ROR8mi")>;<br>
+<br>
+def BWWriteResGroup88 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup88], (instregex "XADD(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup88], (instregex "XADD8rm")>;<br>
+<br>
+def BWWriteResGroup89 : SchedWriteRes<[BWPort4,BWPort6,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup89], (instregex "CALL(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup89], (instregex "FARCALL64")>;<br>
+<br>
+def BWWriteResGroup90 : SchedWriteRes<[BWPort6,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 7;<br>
+  let NumMicroOps = 7;<br>
+  let ResourceCycles = [2,2,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup90], (instregex "LOOP")>;<br>
+<br>
+def BWWriteResGroup91 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDSUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "ADDSUBPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "BSF(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "BSR(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CMPPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CMPPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CMPSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "COMISDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "COMISSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CVTDQ2PSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CVTPS2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "CVTTPS2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "IMUL64m")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "IMUL(32|64)rm(i8?)")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "IMUL8m")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "LZCNT(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MAXPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MAXPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MAXSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MAXSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MINPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MINPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MINSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MINSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MMX_CVTPI2PSirm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MMX_CVTPS2PIirm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MMX_CVTTPS2PIirm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MUL64m")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "MUL8m")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "PDEP32rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "PDEP64rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "PEXT32rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "PEXT64rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "POPCNT(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "SUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "SUBPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "SUBSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "SUBSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "TZCNT(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "UCOMISDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "UCOMISSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDSUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VADDSUBPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCMPPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCMPPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCMPSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCMPSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCOMISDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCOMISSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCVTDQ2PSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCVTPS2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VCVTTPS2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMAXPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMAXPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMAXSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMAXSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMINPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMINPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMINSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VMINSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VSUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VSUBPSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VSUBSDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VSUBSSrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VUCOMISDrm")>;<br>
+def: InstRW<[BWWriteResGroup91], (instregex "VUCOMISSrm")>;<br>
+<br>
+def BWWriteResGroup91_16 : SchedWriteRes<[BWPort1, BWPort0156, BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<span class="apple-converted-space"> </span><br>
+}<br>
+def: InstRW<[BWWriteResGroup91_16], (instregex "IMUL16rm(i8?)")>;<br>
+<br>
+def BWWriteResGroup91_16_2 : SchedWriteRes<[BWPort1, BWPort0156, BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 5;<br>
+}<br>
+def: InstRW<[BWWriteResGroup91_16_2], (instregex "IMUL16m")>;<br>
+def: InstRW<[BWWriteResGroup91_16_2], (instregex "MUL16m")>;<br>
+<br>
+def BWWriteResGroup91_32 : SchedWriteRes<[BWPort1, BWPort0156, BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup91_32], (instregex "IMUL32m")>;<br>
+def: InstRW<[BWWriteResGroup91_32], (instregex "MUL32m")>;<br>
+<br>
+def BWWriteResGroup92 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXBDYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXBQYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXBWYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXWDYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVSXWQYrm")>;<br>
+def: InstRW<[BWWriteResGroup92], (instregex "VPMOVZXWDYrm")>;<br>
+<br>
+def BWWriteResGroup93 : SchedWriteRes<[BWPort01,BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup93], (instregex "MULPDrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "MULPSrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "MULSDrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "MULSSrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "VMULPDrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "VMULPSrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "VMULSDrm")>;<br>
+def: InstRW<[BWWriteResGroup93], (instregex "VMULSSrm")>;<br>
+<br>
+def BWWriteResGroup94 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VBLENDVPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VBLENDVPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VMASKMOVPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VMASKMOVPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VPBLENDVBYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VPMASKMOVDYrm")>;<br>
+def: InstRW<[BWWriteResGroup94], (instregex "VPMASKMOVQYrm")>;<br>
+<br>
+def BWWriteResGroup95 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup95], (instregex "VPSLLVDrm")>;<br>
+def: InstRW<[BWWriteResGroup95], (instregex "VPSRAVDrm")>;<br>
+def: InstRW<[BWWriteResGroup95], (instregex "VPSRLVDrm")>;<br>
+<br>
+def BWWriteResGroup96 : SchedWriteRes<[BWPort5,BWPort23,BWPort15]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHADDSWrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHADDWrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHADDrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHSUBDrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHSUBSWrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "MMX_PHSUBWrm64")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "PHADDDrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "PHADDSWrm128")>;<o:p></o:p></p>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup96], (instregex "PHADDWrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "PHSUBDrm")>;<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup96], (instregex "PHSUBSWrm128")>;<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup96], (instregex "PHSUBWrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHADDDrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHADDSWrm128")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHADDWrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHSUBDrm")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHSUBSWrm128")>;<br>
+def: InstRW<[BWWriteResGroup96], (instregex "VPHSUBWrm")>;<br>
+<br>
+def BWWriteResGroup97 : SchedWriteRes<[BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCL(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCL(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCL8m1")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCL8mi")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCR(16|32|64)m1")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCR(16|32|64)mi")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCR8m1")>;<br>
+def: InstRW<[BWWriteResGroup97], (instregex "RCR8mi")>;<br>
+<br>
+def BWWriteResGroup98 : SchedWriteRes<[BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup98], (instregex "ROR(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup98], (instregex "ROR8mCL")>;<br>
+<br>
+def BWWriteResGroup99 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 6;<br>
+  let ResourceCycles = [1,1,1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup99], (instregex "ADC(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "ADC8mi")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "ADD8mi")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "AND8mi")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "OR8mi")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "SUB8mi")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "XCHG(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "XCHG8rm")>;<br>
+def: InstRW<[BWWriteResGroup99], (instregex "XOR8mi")>;<br>
+<br>
+def BWWriteResGroup100 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 8;<br>
+  let NumMicroOps = 6;<br>
+  let ResourceCycles = [1,1,1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup100], (instregex "ADC(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "ADC8mr")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "CMPXCHG(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "CMPXCHG8rm")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "ROL(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "ROL8mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SAR(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SAR8mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SBB(16|32|64)mi8")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SBB(16|32|64)mr")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SBB8mi")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SBB8mr")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SHL(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SHL8mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SHR(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup100], (instregex "SHR8mCL")>;<br>
+<br>
+def BWWriteResGroup101 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup101], (instregex "ADD_F32m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "ADD_F64m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "ILD_F16m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "ILD_F32m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "ILD_F64m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "SUBR_F32m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "SUBR_F64m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "SUB_F32m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "SUB_F64m")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VADDPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VADDPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VADDSUBPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VADDSUBPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VCMPPDYrmi")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VCMPPSYrmi")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VCVTDQ2PSYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VCVTPS2DQYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VCVTTPS2DQYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VMAXPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VMAXPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VMINPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VMINPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VSUBPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup101], (instregex "VSUBPSYrm")>;<br>
+<br>
+def BWWriteResGroup102 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERM2F128rm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERM2I128rm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERMDYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERMPDYmi")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERMPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPERMQYmi")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPMOVZXBDYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPMOVZXBQYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPMOVZXBWYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPMOVZXDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup102], (instregex "VPMOVZXWQYrm")>;<br>
+<br>
+def BWWriteResGroup103 : SchedWriteRes<[BWPort01,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup103], (instregex "VMULPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup103], (instregex "VMULPSYrm")>;<br>
+<br>
+def BWWriteResGroup104 : SchedWriteRes<[BWPort0,BWPort1,BWPort5]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup104], (instregex "DPPDrri")>;<br>
+def: InstRW<[BWWriteResGroup104], (instregex "VDPPDrri")>;<br>
+<br>
+def BWWriteResGroup105 : SchedWriteRes<[BWPort0,BWPort1,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTSD2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTSD2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTSS2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTSS2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTTSD2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTTSD2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "CVTTSS2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTSD2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTSD2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTSS2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTSS2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTTSD2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTTSD2SIrm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTTSS2SI64rm")>;<br>
+def: InstRW<[BWWriteResGroup105], (instregex "VCVTTSS2SIrm")>;<br>
+<br>
+def BWWriteResGroup106 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup106], (instregex "VCVTPS2PDYrm")>;<br>
+<br>
+def BWWriteResGroup107 : SchedWriteRes<[BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup107], (instregex "CVTDQ2PDrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "CVTPD2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "CVTPD2PSrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "CVTSD2SSrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "CVTTPD2DQrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "MMX_CVTPD2PIirm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "MMX_CVTPI2PDirm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "MMX_CVTTPD2PIirm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "MULX64rm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "VCVTDQ2PDrm")>;<br>
+def: InstRW<[BWWriteResGroup107], (instregex "VCVTSD2SSrm")>;<br>
+<br>
+def BWWriteResGroup108 : SchedWriteRes<[BWPort5,BWPort23,BWPort015]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup108], (instregex "VPBROADCASTBYrm")>;<br>
+def: InstRW<[BWWriteResGroup108], (instregex "VPBROADCASTBrm")>;<br>
+def: InstRW<[BWWriteResGroup108], (instregex "VPBROADCASTWYrm")>;<br>
+def: InstRW<[BWWriteResGroup108], (instregex "VPBROADCASTWrm")>;<br>
+<br>
+def BWWriteResGroup109 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup109], (instregex "VPSLLVDYrm")>;<br>
+def: InstRW<[BWWriteResGroup109], (instregex "VPSRAVDYrm")>;<br>
+def: InstRW<[BWWriteResGroup109], (instregex "VPSRLVDYrm")>;<br>
+<br>
+def BWWriteResGroup110 : SchedWriteRes<[BWPort5,BWPort23,BWPort15]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHADDDYrm")>;<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHADDSWrm256")>;<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHADDWYrm")>;<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHSUBDYrm")>;<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHSUBSWrm256")>;<br>
+def: InstRW<[BWWriteResGroup110], (instregex "VPHSUBWYrm")>;<br>
+<br>
+def BWWriteResGroup111 : SchedWriteRes<[BWPort1,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup111], (instregex "SHLD(16|32|64)mri8")>;<br>
+def: InstRW<[BWWriteResGroup111], (instregex "SHRD(16|32|64)mri8")>;<br>
+<br>
+def BWWriteResGroup112 : SchedWriteRes<[BWPort23,BWPort06,BWPort0156]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup112], (instregex "RDRAND(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup113 : SchedWriteRes<[BWPort1,BWPort6,BWPort23,BWPort0156]> {<br>
+  let Latency = 9;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [1,2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup113], (instregex "LAR(16|32|64)rm")>;<br>
+def: InstRW<[BWWriteResGroup113], (instregex "LSL(16|32|64)rm")>;<br>
+<br>
+def BWWriteResGroup114 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup114], (instregex "PMULLDrr")>;<br>
+def: InstRW<[BWWriteResGroup114], (instregex "VPMULLDYrr")>;<br>
+def: InstRW<[BWWriteResGroup114], (instregex "VPMULLDrr")>;<br>
+<br>
+def BWWriteResGroup115 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMADDUBSWrm64")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMADDWDirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMULHRSWrm64")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMULHUWirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMULHWirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMULLWirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PMULUDQirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "MMX_PSADBWirm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PCLMULQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PCMPGTQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PHMINPOSUWrm128")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMADDUBSWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMADDWDrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULHRSWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULHUWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULHWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULLWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PMULUDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "PSADBWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "RCPPSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "RCPSSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "RSQRTPSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "RSQRTSSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPCLMULQDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPCMPGTQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPHMINPOSUWrm128")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMADDUBSWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMADDWDrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULHRSWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULHUWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULHWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULLWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPMULUDQrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VPSADBWrm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VRCPPSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VRCPSSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VRSQRTPSm")>;<br>
+def: InstRW<[BWWriteResGroup115], (instregex "VRSQRTSSm")>;<br>
+<br>
+def BWWriteResGroup116 : SchedWriteRes<[BWPort01,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD132SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD132SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD213SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD213SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD231SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADD231SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMADDSUB231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB132SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB132SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB213SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB213SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB231SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUB231SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFMSUBADD231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD132SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD132SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD213SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD213SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD231SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMADD231SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB132PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB132PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB132SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB132SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB213PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB213PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB213SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB213SSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB231PDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB231PSm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB231SDm")>;<br>
+def: InstRW<[BWWriteResGroup116], (instregex "VFNMSUB231SSm")>;<br>
+<br>
+def BWWriteResGroup117 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup117], (instregex "FICOM16m")>;<br>
+def: InstRW<[BWWriteResGroup117], (instregex "FICOM32m")>;<br>
+def: InstRW<[BWWriteResGroup117], (instregex "FICOMP16m")>;<br>
+def: InstRW<[BWWriteResGroup117], (instregex "FICOMP32m")>;<br>
+<br>
+def BWWriteResGroup118 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup118], (instregex "VPTESTYrm")>;<br>
+<br>
+def BWWriteResGroup119 : SchedWriteRes<[BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup119], (instregex "HADDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "HADDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "HSUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "HSUBPSrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "VHADDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "VHADDPSrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "VHSUBPDrm")>;<br>
+def: InstRW<[BWWriteResGroup119], (instregex "VHSUBPSrm")>;<br>
+<br>
+def BWWriteResGroup120 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup120], (instregex "CVTTSS2SI64rm")>;<br>
+<br>
+def BWWriteResGroup121 : SchedWriteRes<[BWPort1,BWPort23,BWPort06,BWPort0156]> {<br>
+  let Latency = 10;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup121], (instregex "MULX32rm")>;<br>
+<br>
+def BWWriteResGroup122 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup122], (instregex "DIVPSrr")>;<br>
+def: InstRW<[BWWriteResGroup122], (instregex "DIVSSrr")>;<br>
+def: InstRW<[BWWriteResGroup122], (instregex "VDIVPSrr")>;<br>
+def: InstRW<[BWWriteResGroup122], (instregex "VDIVSSrr")>;<br>
+<br>
+def BWWriteResGroup123 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup123], (instregex "MUL_F32m")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "MUL_F64m")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPCMPGTQYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMADDUBSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMADDWDYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULHRSWYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULHUWYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULHWYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULLWYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPMULUDQYrm")>;<br>
+def: InstRW<[BWWriteResGroup123], (instregex "VPSADBWYrm")>;<br>
+<br>
+def BWWriteResGroup124 : SchedWriteRes<[BWPort01,BWPort23]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADD231PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMADDSUB231PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUB231PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFMSUBADD231PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMADD231PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB132PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB132PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB213PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB213PSYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB231PDYm")>;<br>
+def: InstRW<[BWWriteResGroup124], (instregex "VFNMSUB231PSYm")>;<br>
+<br>
+def BWWriteResGroup125 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup125], (instregex "PCMPISTRIrr")>;<br>
+def: InstRW<[BWWriteResGroup125], (instregex "PCMPISTRM128rr")>;<br>
+def: InstRW<[BWWriteResGroup125], (instregex "VPCMPISTRIrr")>;<br>
+def: InstRW<[BWWriteResGroup125], (instregex "VPCMPISTRM128rr")>;<br>
+<br>
+def BWWriteResGroup126 : SchedWriteRes<[BWPort0,BWPort015]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup126], (instregex "VRCPPSYr")>;<br>
+def: InstRW<[BWWriteResGroup126], (instregex "VRSQRTPSYr")>;<br>
+<br>
+def BWWriteResGroup127 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup127], (instregex "ROUNDPDm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "ROUNDPSm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "ROUNDSDm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "ROUNDSSm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "VROUNDPDm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "VROUNDPSm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "VROUNDSDm")>;<br>
+def: InstRW<[BWWriteResGroup127], (instregex "VROUNDSSm")>;<br>
+<br>
+def BWWriteResGroup128 : SchedWriteRes<[BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup128], (instregex "VCVTDQ2PDYrm")>;<br>
+<br>
+def BWWriteResGroup129 : SchedWriteRes<[BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup129], (instregex "VHADDPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup129], (instregex "VHADDPSYrm")>;<br>
+def: InstRW<[BWWriteResGroup129], (instregex "VHSUBPDYrm")>;<br>
+def: InstRW<[BWWriteResGroup129], (instregex "VHSUBPSYrm")>;<br>
+<br>
+def BWWriteResGroup130 : SchedWriteRes<[BWPort1,BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 6;<br>
+  let ResourceCycles = [1,1,1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup130], (instregex "SHLD(16|32|64)mrCL")>;<br>
+def: InstRW<[BWWriteResGroup130], (instregex "SHRD(16|32|64)mrCL")>;<br>
+<br>
+def BWWriteResGroup131 : SchedWriteRes<[BWPort1,BWPort06,BWPort0156]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 7;<br>
+  let ResourceCycles = [2,2,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup131], (instregex "RCL(16|32|64)rCL")>;<br>
+def: InstRW<[BWWriteResGroup131], (instregex "RCR(16|32|64)rCL")>;<br>
+<br>
+def BWWriteResGroup132 : SchedWriteRes<[BWPort1,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [1,4,1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup132], (instregex "RCL8rCL")>;<br>
+<br>
+def BWWriteResGroup133 : SchedWriteRes<[BWPort06,BWPort0156]> {<br>
+  let Latency = 11;<br>
+  let NumMicroOps = 11;<br>
+  let ResourceCycles = [2,9];<br>
+}<br>
+def: InstRW<[BWWriteResGroup133], (instregex "LOOPE")>;<br>
+def: InstRW<[BWWriteResGroup133], (instregex "LOOPNE")>;<br>
+<br>
+def BWWriteResGroup134 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 12;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup134], (instregex "AESDECLASTrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "AESDECrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "AESENCLASTrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "AESENCrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "VAESDECLASTrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "VAESDECrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "VAESENCLASTrm")>;<br>
+def: InstRW<[BWWriteResGroup134], (instregex "VAESENCrm")>;<br>
+<br>
+def BWWriteResGroup135 : SchedWriteRes<[BWPort1,BWPort23]> {<br>
+  let Latency = 12;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup135], (instregex "ADD_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "ADD_FI32m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "SUBR_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "SUBR_FI32m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "SUB_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "SUB_FI32m")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "VROUNDYPDm")>;<br>
+def: InstRW<[BWWriteResGroup135], (instregex "VROUNDYPSm")>;<br>
+<br>
+def BWWriteResGroup136 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 12;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup136], (instregex "MPSADBWrmi")>;<br>
+def: InstRW<[BWWriteResGroup136], (instregex "VMPSADBWrmi")>;<br>
+<br>
+def BWWriteResGroup137 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 13;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup137], (instregex "SQRTPSr")>;<br>
+def: InstRW<[BWWriteResGroup137], (instregex "SQRTSSr")>;<br>
+<br>
+def BWWriteResGroup138 : SchedWriteRes<[BWPort0,BWPort5,BWPort23]> {<br>
+  let Latency = 13;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup138], (instregex "VMPSADBWYrmi")>;<br>
+<br>
+def BWWriteResGroup139 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup139], (instregex "DIVPDrr")>;<br>
+def: InstRW<[BWWriteResGroup139], (instregex "DIVSDrr")>;<br>
+def: InstRW<[BWWriteResGroup139], (instregex "VDIVPDrr")>;<br>
+def: InstRW<[BWWriteResGroup139], (instregex "VDIVSDrr")>;<br>
+def: InstRW<[BWWriteResGroup139], (instregex "VSQRTPSr")>;<br>
+def: InstRW<[BWWriteResGroup139], (instregex "VSQRTSSr")>;<br>
+<br>
+def BWWriteResGroup140 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup140], (instregex "AESIMCrr")>;<br>
+def: InstRW<[BWWriteResGroup140], (instregex "VAESIMCrr")>;<br>
+<br>
+def BWWriteResGroup141 : SchedWriteRes<[BWPort0,BWPort1,BWPort23]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup141], (instregex "MUL_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup141], (instregex "MUL_FI32m")>;<br>
+<br>
+def BWWriteResGroup142 : SchedWriteRes<[BWPort0,BWPort1,BWPort5]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup142], (instregex "DPPSrri")>;<br>
+def: InstRW<[BWWriteResGroup142], (instregex "VDPPSYrri")>;<br>
+def: InstRW<[BWWriteResGroup142], (instregex "VDPPSrri")>;<br>
+<br>
+def BWWriteResGroup143 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [1,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup143], (instregex "DPPDrmi")>;<br>
+def: InstRW<[BWWriteResGroup143], (instregex "VDPPDrmi")>;<br>
+<br>
+def BWWriteResGroup144 : SchedWriteRes<[BWPort1,BWPort6,BWPort23,BWPort0156]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [2,2,1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup144], (instregex "LAR(16|32|64)rr")>;<br>
+<br>
+def BWWriteResGroup145 : SchedWriteRes<[BWPort1,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 10;<br>
+  let ResourceCycles = [2,3,1,4];<br>
+}<br>
+def: InstRW<[BWWriteResGroup145], (instregex "RCR8rCL")>;<br>
+<br>
+def BWWriteResGroup146 : SchedWriteRes<[BWPort0,BWPort1,BWPort6,BWPort0156]> {<br>
+  let Latency = 14;<br>
+  let NumMicroOps = 12;<br>
+  let ResourceCycles = [2,1,4,5];<br>
+}<br>
+def: InstRW<[BWWriteResGroup146], (instregex "XCH_F")>;<br>
+<br>
+def BWWriteResGroup147 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 15;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup147], (instregex "DIVR_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup147], (instregex "DIVR_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup147], (instregex "DIVR_FrST0")>;<br>
+<br>
+def BWWriteResGroup148 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 15;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup148], (instregex "PMULLDrm")>;<br>
+def: InstRW<[BWWriteResGroup148], (instregex "VPMULLDrm")>;<br>
+<br>
+def BWWriteResGroup149 : SchedWriteRes<[BWPort1,BWPort23,BWPort237,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 15;<br>
+  let NumMicroOps = 10;<br>
+  let ResourceCycles = [1,1,1,4,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup149], (instregex "RCL(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup149], (instregex "RCL8mCL")>;<br>
+<br>
+def BWWriteResGroup150 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 16;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup150], (instregex "DIVPSrm")>;<br>
+def: InstRW<[BWWriteResGroup150], (instregex "DIVSSrm")>;<br>
+def: InstRW<[BWWriteResGroup150], (instregex "VDIVPSrm")>;<br>
+def: InstRW<[BWWriteResGroup150], (instregex "VDIVSSrm")>;<br>
+<br>
+def BWWriteResGroup151 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 16;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup151], (instregex "VPMULLDYrm")>;<br>
+<br>
+def BWWriteResGroup152 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 16;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [3,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup152], (instregex "PCMPISTRIrm")>;<br>
+def: InstRW<[BWWriteResGroup152], (instregex "PCMPISTRM128rm")>;<br>
+def: InstRW<[BWWriteResGroup152], (instregex "VPCMPISTRIrm")>;<br>
+def: InstRW<[BWWriteResGroup152], (instregex "VPCMPISTRM128rm")>;<br>
+<br>
+def BWWriteResGroup153 : SchedWriteRes<[BWPort4,BWPort23,BWPort237,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 16;<br>
+  let NumMicroOps = 14;<br>
+  let ResourceCycles = [1,1,1,4,2,5];<br>
+}<br>
+def: InstRW<[BWWriteResGroup153], (instregex "CMPXCHG8B")>;<br>
+<br>
+def BWWriteResGroup154 : SchedWriteRes<[BWPort5]> {<br>
+  let Latency = 16;<br>
+  let NumMicroOps = 16;<br>
+  let ResourceCycles = [16];<br>
+}<br>
+def: InstRW<[BWWriteResGroup154], (instregex "VZEROALL")>;<br>
+<br>
+def BWWriteResGroup155 : SchedWriteRes<[BWPort0,BWPort015]> {<br>
+  let Latency = 17;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup155], (instregex "VDIVPSYrr")>;<br>
+<br>
+def BWWriteResGroup156 : SchedWriteRes<[BWPort0,BWPort23,BWPort015]> {<br>
+  let Latency = 17;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup156], (instregex "VRCPPSYm")>;<br>
+def: InstRW<[BWWriteResGroup156], (instregex "VRSQRTPSYm")>;<br>
+<br>
+def BWWriteResGroup157 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 18;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup157], (instregex "SQRTPSm")>;<br>
+def: InstRW<[BWWriteResGroup157], (instregex "SQRTSSm")>;<br>
+<br>
+def BWWriteResGroup158 : SchedWriteRes<[BWPort0,BWPort5,BWPort0156]> {<br>
+  let Latency = 18;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [4,3,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup158], (instregex "PCMPESTRIrr")>;<br>
+def: InstRW<[BWWriteResGroup158], (instregex "VPCMPESTRIrr")>;<br>
+<br>
+def BWWriteResGroup159 : SchedWriteRes<[BWPort5,BWPort6,BWPort06,BWPort0156]> {<br>
+  let Latency = 18;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [1,1,1,5];<br>
+}<br>
+def: InstRW<[BWWriteResGroup159], (instregex "CPUID")>;<br>
+def: InstRW<[BWWriteResGroup159], (instregex "RDTSC")>;<br>
+<br>
+def BWWriteResGroup160 : SchedWriteRes<[BWPort1,BWPort23,BWPort237,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 18;<br>
+  let NumMicroOps = 11;<br>
+  let ResourceCycles = [2,1,1,3,1,3];<br>
+}<br>
+def: InstRW<[BWWriteResGroup160], (instregex "RCR(16|32|64)mCL")>;<br>
+def: InstRW<[BWWriteResGroup160], (instregex "RCR8mCL")>;<br>
+<br>
+def BWWriteResGroup161 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 19;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup161], (instregex "DIVPDrm")>;<br>
+def: InstRW<[BWWriteResGroup161], (instregex "DIVSDrm")>;<br>
+def: InstRW<[BWWriteResGroup161], (instregex "VDIVPDrm")>;<br>
+def: InstRW<[BWWriteResGroup161], (instregex "VDIVSDrm")>;<br>
+def: InstRW<[BWWriteResGroup161], (instregex "VSQRTPSm")>;<br>
+def: InstRW<[BWWriteResGroup161], (instregex "VSQRTSSm")>;<br>
+<br>
+def BWWriteResGroup162 : SchedWriteRes<[BWPort5,BWPort23]> {<br>
+  let Latency = 19;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup162], (instregex "AESIMCrm")>;<br>
+def: InstRW<[BWWriteResGroup162], (instregex "VAESIMCrm")>;<br>
+<br>
+def BWWriteResGroup163 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 19;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [2,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup163], (instregex "DPPSrmi")>;<br>
+def: InstRW<[BWWriteResGroup163], (instregex "VDPPSrmi")>;<br>
+<br>
+def BWWriteResGroup164 : SchedWriteRes<[BWPort0,BWPort5,BWPort015,BWPort0156]> {<br>
+  let Latency = 19;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [4,3,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup164], (instregex "PCMPESTRM128rr")>;<br>
+def: InstRW<[BWWriteResGroup164], (instregex "VPCMPESTRM128rr")>;<br>
+<br>
+def BWWriteResGroup165 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 20;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup165], (instregex "DIV_FPrST0")>;<br>
+def: InstRW<[BWWriteResGroup165], (instregex "DIV_FST0r")>;<br>
+def: InstRW<[BWWriteResGroup165], (instregex "DIV_FrST0")>;<br>
+def: InstRW<[BWWriteResGroup165], (instregex "SQRTPDr")>;<br>
+def: InstRW<[BWWriteResGroup165], (instregex "SQRTSDr")>;<br>
+<br>
+def BWWriteResGroup166 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23]> {<br>
+  let Latency = 20;<br>
+  let NumMicroOps = 5;<br>
+  let ResourceCycles = [2,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup166], (instregex "VDPPSYrmi")>;<br>
+<br>
+def BWWriteResGroup167 : SchedWriteRes<[BWPort4,BWPort5,BWPort6,BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 20;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [1,1,1,1,1,1,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup167], (instregex "INSB")>;<br>
+def: InstRW<[BWWriteResGroup167], (instregex "INSL")>;<br>
+def: InstRW<[BWWriteResGroup167], (instregex "INSW")>;<br>
+<br>
+def BWWriteResGroup168 : SchedWriteRes<[BWPort0]> {<br>
+  let Latency = 21;<br>
+  let NumMicroOps = 1;<br>
+  let ResourceCycles = [1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup168], (instregex "VSQRTPDr")>;<br>
+def: InstRW<[BWWriteResGroup168], (instregex "VSQRTSDr")>;<br>
+<br>
+def BWWriteResGroup169 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 21;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup169], (instregex "DIV_F32m")>;<br>
+def: InstRW<[BWWriteResGroup169], (instregex "DIV_F64m")>;<br>
+<br>
+def BWWriteResGroup170 : SchedWriteRes<[BWPort0,BWPort015]> {<br>
+  let Latency = 21;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup170], (instregex "VSQRTPSYr")>;<br>
+<br>
+def BWWriteResGroup171 : SchedWriteRes<[BWPort0,BWPort4,BWPort5,BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 21;<br>
+  let NumMicroOps = 19;<br>
+  let ResourceCycles = [2,1,4,1,1,4,6];<br>
+}<br>
+def: InstRW<[BWWriteResGroup171], (instregex "CMPXCHG16B")>;<br>
+<br>
+def BWWriteResGroup172 : SchedWriteRes<[BWPort6,BWPort23,BWPort0156]> {<br>
+  let Latency = 22;<br>
+  let NumMicroOps = 18;<br>
+  let ResourceCycles = [1,1,16];<br>
+}<br>
+def: InstRW<[BWWriteResGroup172], (instregex "POPF64")>;<br>
+<br>
+def BWWriteResGroup173 : SchedWriteRes<[BWPort0,BWPort015]> {<br>
+  let Latency = 23;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup173], (instregex "VDIVPDYrr")>;<br>
+<br>
+def BWWriteResGroup174 : SchedWriteRes<[BWPort0,BWPort23,BWPort015]> {<br>
+  let Latency = 23;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup174], (instregex "VDIVPSYrm")>;<br>
+<br>
+def BWWriteResGroup175 : SchedWriteRes<[BWPort0,BWPort5,BWPort23,BWPort0156]> {<br>
+  let Latency = 23;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [4,3,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup175], (instregex "PCMPESTRIrm")>;<br>
+def: InstRW<[BWWriteResGroup175], (instregex "VPCMPESTRIrm")>;<br>
+<br>
+def BWWriteResGroup176 : SchedWriteRes<[BWPort6,BWPort23,BWPort0156]> {<br>
+  let Latency = 23;<br>
+  let NumMicroOps = 19;<br>
+  let ResourceCycles = [3,1,15];<br>
+}<br>
+def: InstRW<[BWWriteResGroup176], (instregex "XRSTOR(64?)")>;<br>
+<br>
+def BWWriteResGroup177 : SchedWriteRes<[BWPort0,BWPort1,BWPort23]> {<br>
+  let Latency = 24;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup177], (instregex "DIV_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup177], (instregex "DIV_FI32m")>;<br>
+<br>
+def BWWriteResGroup178 : SchedWriteRes<[BWPort0,BWPort5,BWPort23,BWPort015,BWPort0156]> {<br>
+  let Latency = 24;<br>
+  let NumMicroOps = 10;<br>
+  let ResourceCycles = [4,3,1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup178], (instregex "PCMPESTRM128rm")>;<br>
+def: InstRW<[BWWriteResGroup178], (instregex "VPCMPESTRM128rm")>;<br>
+<br>
+def BWWriteResGroup179 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 25;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup179], (instregex "SQRTPDm")>;<br>
+def: InstRW<[BWWriteResGroup179], (instregex "SQRTSDm")>;<br>
+<br>
+def BWWriteResGroup180 : SchedWriteRes<[BWPort0,BWPort23]> {<br>
+  let Latency = 26;<br>
+  let NumMicroOps = 2;<br>
+  let ResourceCycles = [1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup180], (instregex "DIVR_F32m")>;<br>
+def: InstRW<[BWWriteResGroup180], (instregex "DIVR_F64m")>;<br>
+def: InstRW<[BWWriteResGroup180], (instregex "VSQRTPDm")>;<br>
+def: InstRW<[BWWriteResGroup180], (instregex "VSQRTSDm")>;<br>
+<br>
+def BWWriteResGroup181 : SchedWriteRes<[BWPort0,BWPort23,BWPort015]> {<br>
+  let Latency = 27;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup181], (instregex "VSQRTPSYm")>;<br>
+<br>
+def BWWriteResGroup182 : SchedWriteRes<[BWPort0,BWPort1,BWPort23]> {<br>
+  let Latency = 29;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [1,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup182], (instregex "DIVR_FI16m")>;<br>
+def: InstRW<[BWWriteResGroup182], (instregex "DIVR_FI32m")>;<br>
+<br>
+def BWWriteResGroup183 : SchedWriteRes<[BWPort0,BWPort23,BWPort015]> {<br>
+  let Latency = 29;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183], (instregex "VDIVPDYrm")>;<br>
+<br>
+def BWWriteResGroup183_1 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 22;<br>
+  let NumMicroOps = 7;<br>
+  let ResourceCycles = [1,3,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_1], (instregex "VGATHERQPDrm")>;<br>
+<br>
+def BWWriteResGroup183_2 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 23;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [1,3,4,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_2], (instregex "VGATHERQPDYrm")>;<br>
+<br>
+def BWWriteResGroup183_3 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 24;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [1,5,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_3], (instregex "VGATHERQPSYrm")>;<br>
+<br>
+def BWWriteResGroup183_4 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 25;<br>
+  let NumMicroOps = 7;<br>
+  let ResourceCycles = [1,3,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_4], (instregex "VGATHERDPDrm")>;<br>
+def: InstRW<[BWWriteResGroup183_4], (instregex "VGATHERDPSrm")>;<br>
+<br>
+def BWWriteResGroup183_5 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 26;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [1,5,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_5], (instregex "VGATHERDPDYrm")>;<br>
+<br>
+def BWWriteResGroup183_6 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 26;<br>
+  let NumMicroOps = 14;<br>
+  let ResourceCycles = [1,4,8,1];  <o:p></o:p></p>
<p class="MsoNormal">+}<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+def: InstRW<[BWWriteResGroup183_6], (instregex "VGATHERDPSYrm")>;<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+<br>
+def BWWriteResGroup183_7 : SchedWriteRes<[BWPort4, BWPort5, BWPort23, BWPort0156]> {<br>
+  let Latency = 27;<br>
+  let NumMicroOps = 9;<br>
+  let ResourceCycles = [1,5,2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup183_7], (instregex "VGATHERQPSrm")>;<br>
+<br>
+def BWWriteResGroup184 : SchedWriteRes<[BWPort0,BWPort5,BWPort015]> {<br>
+  let Latency = 29;<br>
+  let NumMicroOps = 11;<br>
+  let ResourceCycles = [2,7,2];<br>
+}<br>
+def: InstRW<[BWWriteResGroup184], (instregex "AESKEYGENASSIST128rr")>;<br>
+def: InstRW<[BWWriteResGroup184], (instregex "VAESKEYGENASSIST128rr")>;<br>
+<br>
+def BWWriteResGroup185 : SchedWriteRes<[BWPort4,BWPort6,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 29;<br>
+  let NumMicroOps = 27;<br>
+  let ResourceCycles = [1,5,1,1,19];<br>
+}<br>
+def: InstRW<[BWWriteResGroup185], (instregex "XSAVE64")>;<br>
+<br>
+def BWWriteResGroup186 : SchedWriteRes<[BWPort4,BWPort6,BWPort23,BWPort237,BWPort0156]> {<br>
+  let Latency = 30;<br>
+  let NumMicroOps = 28;<br>
+  let ResourceCycles = [1,6,1,1,19];<br>
+}<br>
+def: InstRW<[BWWriteResGroup186], (instregex "XSAVE(OPT?)")>;<br>
+<br>
+def BWWriteResGroup187 : SchedWriteRes<[BWPort01,BWPort15,BWPort015,BWPort0156]> {<br>
+  let Latency = 31;<br>
+  let NumMicroOps = 31;<br>
+  let ResourceCycles = [8,1,21,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup187], (instregex "MMX_EMMS")>;<br>
+<br>
+def BWWriteResGroup188 : SchedWriteRes<[BWPort0,BWPort5,BWPort23,BWPort015]> {<br>
+  let Latency = 33;<br>
+  let NumMicroOps = 11;<br>
+  let ResourceCycles = [2,7,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup188], (instregex "AESKEYGENASSIST128rm")>;<br>
+def: InstRW<[BWWriteResGroup188], (instregex "VAESKEYGENASSIST128rm")>;<br>
+<br>
+def BWWriteResGroup189 : SchedWriteRes<[BWPort0,BWPort015]> {<br>
+  let Latency = 34;<br>
+  let NumMicroOps = 3;<br>
+  let ResourceCycles = [2,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup189], (instregex "VSQRTPDYr")>;<br>
+<br>
+def BWWriteResGroup190 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23,BWPort0156]> {<br>
+  let Latency = 34;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [2,2,2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup190], (instregex "DIV(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup190], (instregex "DIV8m")>;<br>
+<br>
+def BWWriteResGroup191 : SchedWriteRes<[BWPort5,BWPort6,BWPort23,BWPort06,BWPort0156]> {<br>
+  let Latency = 34;<br>
+  let NumMicroOps = 23;<br>
+  let ResourceCycles = [1,5,3,4,10];<br>
+}<br>
+def: InstRW<[BWWriteResGroup191], (instregex "IN32ri")>;<br>
+def: InstRW<[BWWriteResGroup191], (instregex "IN32rr")>;<br>
+def: InstRW<[BWWriteResGroup191], (instregex "IN8ri")>;<br>
+def: InstRW<[BWWriteResGroup191], (instregex "IN8rr")>;<br>
+<br>
+def BWWriteResGroup193 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort23,BWPort0156]> {<br>
+  let Latency = 35;<br>
+  let NumMicroOps = 8;<br>
+  let ResourceCycles = [2,2,2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup193], (instregex "IDIV(16|32|64)m")>;<br>
+def: InstRW<[BWWriteResGroup193], (instregex "IDIV8m")>;<br>
+<br>
+def BWWriteResGroup194 : SchedWriteRes<[BWPort5,BWPort6,BWPort23,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 35;<br>
+  let NumMicroOps = 23;<br>
+  let ResourceCycles = [1,5,2,1,4,10];<br>
+}<br>
+def: InstRW<[BWWriteResGroup194], (instregex "OUT32ir")>;<br>
+def: InstRW<[BWWriteResGroup194], (instregex "OUT32rr")>;<br>
+def: InstRW<[BWWriteResGroup194], (instregex "OUT8ir")>;<br>
+def: InstRW<[BWWriteResGroup194], (instregex "OUT8rr")>;<br>
+<br>
+def BWWriteResGroup195 : SchedWriteRes<[BWPort0,BWPort23,BWPort015]> {<br>
+  let Latency = 40;<br>
+  let NumMicroOps = 4;<br>
+  let ResourceCycles = [2,1,1];<br>
+}<br>
+def: InstRW<[BWWriteResGroup195], (instregex "VSQRTPDYm")>;<br>
+<br>
+def BWWriteResGroup196 : SchedWriteRes<[BWPort5,BWPort0156]> {<br>
+  let Latency = 42;<br>
+  let NumMicroOps = 22;<br>
+  let ResourceCycles = [2,20];<br>
+}<br>
+def: InstRW<[BWWriteResGroup196], (instregex "RDTSCP")>;<br>
+<br>
+def BWWriteResGroup197 : SchedWriteRes<[BWPort0,BWPort01,BWPort23,BWPort05,BWPort06,BWPort015,BWPort0156]> {<br>
+  let Latency = 60;<br>
+  let NumMicroOps = 64;<br>
+  let ResourceCycles = [2,2,8,1,10,2,39];<br>
+}<br>
+def: InstRW<[BWWriteResGroup197], (instregex "FLDENVm")>;<br>
+def: InstRW<[BWWriteResGroup197], (instregex "FLDENVm")>;<br>
+<br>
+def BWWriteResGroup198 : SchedWriteRes<[BWPort0,BWPort6,BWPort23,BWPort05,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 63;<br>
+  let NumMicroOps = 88;<br>
+  let ResourceCycles = [4,4,31,1,2,1,45];<br>
+}<br>
+def: InstRW<[BWWriteResGroup198], (instregex "FXRSTOR64")>;<br>
+<br>
+def BWWriteResGroup199 : SchedWriteRes<[BWPort0,BWPort6,BWPort23,BWPort05,BWPort06,BWPort15,BWPort0156]> {<br>
+  let Latency = 63;<br>
+  let NumMicroOps = 90;<br>
+  let ResourceCycles = [4,2,33,1,2,1,47];<br>
+}<br>
+def: InstRW<[BWWriteResGroup199], (instregex "FXRSTOR")>;<br>
+<br>
+def BWWriteResGroup200 : SchedWriteRes<[BWPort5,BWPort01,BWPort0156]> {<br>
+  let Latency = 75;<br>
+  let NumMicroOps = 15;<br>
+  let ResourceCycles = [6,3,6];<br>
+}<br>
+def: InstRW<[BWWriteResGroup200], (instregex "FNINIT")>;<br>
+<br>
+def BWWriteResGroup201 : SchedWriteRes<[BWPort0,BWPort1,BWPort5,BWPort6,BWPort01,BWPort0156]> {<br>
+  let Latency = 80;<br>
+  let NumMicroOps = 32;<br>
+  let ResourceCycles = [7,7,3,3,1,11];<br>
+}<br>
+def: InstRW<[BWWriteResGroup201], (instregex "DIV(16|32|64)r")>;<br>
+<br>
+def BWWriteResGroup202 : SchedWriteRes<[BWPort0,BWPort1,BWPort4,BWPort5,BWPort6,BWPort237,BWPort06,BWPort0156]> {<br>
+  let Latency = 115;<br>
+  let NumMicroOps = 100;<br>
+  let ResourceCycles = [9,9,11,8,1,11,21,30];<br>
+}<br>
+def: InstRW<[BWWriteResGroup202], (instregex "FSTENVm")>;<br>
+def: InstRW<[BWWriteResGroup202], (instregex "FSTENVm")>;<br>
+<br>
+} // SchedModel<br>
+<br>
<br>
Propchange: llvm/trunk/lib/Target/X86/X86SchedBroadwell.td<br>
------------------------------------------------------------------------------<br>
   svn:executable = *<br>
<br>
Modified: llvm/trunk/lib/Target/X86/X86Schedule.td<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86Schedule.td?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Target/X86/X86Schedule.td?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/lib/Target/X86/X86Schedule.td (original)<br>
+++ llvm/trunk/lib/Target/X86/X86Schedule.td Tue Oct 24 13:19:47 2017<br>
@@ -663,10 +663,10 @@ def GenericPostRAModel : GenericX86Model<br>
include "X86ScheduleAtom.td"<br>
include "X86SchedSandyBridge.td"<br>
include "X86SchedHaswell.td"<br>
+include "X86SchedBroadwell.td"<br>
include "X86ScheduleSLM.td"<br>
include "X86ScheduleZnver1.td"<br>
include "X86ScheduleBtVer2.td"<br>
include "X86SchedSkylakeClient.td"<br>
include "X86SchedSkylakeServer.td"<br>
<br>
-<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/aes-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/aes-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/aes-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/aes-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/aes-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -38,8 +38,8 @@ define <2 x i64> @test_aesdec(<2 x i64><br>
; BROADWELL-LABEL: test_aesdec:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaesdec %xmm1, %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    vaesdec (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaesdec (%rdi), %xmm0, %xmm0 # sched: [12:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aesdec:<br>
; SKYLAKE:       # BB#0:<br>
@@ -93,8 +93,8 @@ define <2 x i64> @test_aesdeclast(<2 x i<br>
; BROADWELL-LABEL: test_aesdeclast:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaesdeclast %xmm1, %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    vaesdeclast (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaesdeclast (%rdi), %xmm0, %xmm0 # sched: [12:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aesdeclast:<br>
; SKYLAKE:       # BB#0:<br>
@@ -148,8 +148,8 @@ define <2 x i64> @test_aesenc(<2 x i64><br>
; BROADWELL-LABEL: test_aesenc:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaesenc %xmm1, %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    vaesenc (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaesenc (%rdi), %xmm0, %xmm0 # sched: [12:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aesenc:<br>
; SKYLAKE:       # BB#0:<br>
@@ -203,8 +203,8 @@ define <2 x i64> @test_aesenclast(<2 x i<br>
; BROADWELL-LABEL: test_aesenclast:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaesenclast %xmm1, %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    vaesenclast (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaesenclast (%rdi), %xmm0, %xmm0 # sched: [12:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aesenclast:<br>
; SKYLAKE:       # BB#0:<br>
@@ -262,9 +262,9 @@ define <2 x i64> @test_aesimc(<2 x i64><br>
; BROADWELL-LABEL: test_aesimc:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaesimc %xmm0, %xmm0 # sched: [14:2.00]<br>
-; BROADWELL-NEXT:    vaesimc (%rdi), %xmm1 # sched: [14:2.00]<br>
+; BROADWELL-NEXT:    vaesimc (%rdi), %xmm1 # sched: [19:2.00]<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aesimc:<br>
; SKYLAKE:       # BB#0:<br>
@@ -326,9 +326,9 @@ define <2 x i64> @test_aeskeygenassist(<<br>
; BROADWELL-LABEL: test_aeskeygenassist:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaeskeygenassist $7, %xmm0, %xmm0 # sched: [29:7.00]<br>
-; BROADWELL-NEXT:    vaeskeygenassist $7, (%rdi), %xmm1 # sched: [28:7.00]<br>
+; BROADWELL-NEXT:    vaeskeygenassist $7, (%rdi), %xmm1 # sched: [33:7.00]<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_aeskeygenassist:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/avx-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/avx-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/avx-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/avx-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/avx-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -31,8 +31,8 @@ define <4 x double> @test_addpd(<4 x dou<br>
; BROADWELL-LABEL: test_addpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -85,8 +85,8 @@ define <8 x float> @test_addps(<8 x floa<br>
; BROADWELL-LABEL: test_addps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -139,8 +139,8 @@ define <4 x double> @test_addsubpd(<4 x<br>
; BROADWELL-LABEL: test_addsubpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddsubpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddsubpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddsubpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addsubpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -194,8 +194,8 @@ define <8 x float> @test_addsubps(<8 x f<br>
; BROADWELL-LABEL: test_addsubps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddsubps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddsubps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddsubps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addsubps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -252,9 +252,9 @@ define <4 x double> @test_andnotpd(<4 x<br>
; BROADWELL-LABEL: test_andnotpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandnpd %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandnpd (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandnpd (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andnotpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -321,9 +321,9 @@ define <8 x float> @test_andnotps(<8 x f<br>
; BROADWELL-LABEL: test_andnotps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandnps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandnps (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandnps (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andnotps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -390,9 +390,9 @@ define <4 x double> @test_andpd(<4 x dou<br>
; BROADWELL-LABEL: test_andpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandpd %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandpd (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandpd (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -457,9 +457,9 @@ define <8 x float> @test_andps(<8 x floa<br>
; BROADWELL-LABEL: test_andps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandps (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandps (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -525,8 +525,8 @@ define <4 x double> @test_blendpd(<4 x d<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendpd {{.*#+}} ymm0 = ymm0[0],ymm1[1,2],ymm0[3] sched: [1:0.33]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vblendpd {{.*#+}} ymm0 = ymm0[0],mem[1,2],ymm0[3] sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendpd {{.*#+}} ymm0 = ymm0[0],mem[1,2],ymm0[3] sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -584,8 +584,8 @@ define <8 x float> @test_blendps(<8 x fl<br>
; BROADWELL-LABEL: test_blendps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendps {{.*#+}} ymm0 = ymm0[0],ymm1[1,2],ymm0[3,4,5,6,7] sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vblendps {{.*#+}} ymm0 = ymm0[0,1],mem[2],ymm0[3],mem[4,5,6],ymm0[7] sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendps {{.*#+}} ymm0 = ymm0[0,1],mem[2],ymm0[3],mem[4,5,6],ymm0[7] sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -638,8 +638,8 @@ define <4 x double> @test_blendvpd(<4 x<br>
; BROADWELL-LABEL: test_blendvpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendvpd %ymm2, %ymm1, %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vblendvpd %ymm2, (%rdi), %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendvpd %ymm2, (%rdi), %ymm0, %ymm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendvpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -693,8 +693,8 @@ define <8 x float> @test_blendvps(<8 x f<br>
; BROADWELL-LABEL: test_blendvps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendvps %ymm2, %ymm1, %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vblendvps %ymm2, (%rdi), %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendvps %ymm2, (%rdi), %ymm0, %ymm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendvps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -744,8 +744,8 @@ define <8 x float> @test_broadcastf128(<<br>
;<br>
; BROADWELL-LABEL: test_broadcastf128:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vbroadcastf128 {{.*#+}} ymm0 = mem[0,1,0,1] sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vbroadcastf128 {{.*#+}} ymm0 = mem[0,1,0,1] sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastf128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -789,8 +789,8 @@ define <4 x double> @test_broadcastsd_ym<br>
;<br>
; BROADWELL-LABEL: test_broadcastsd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vbroadcastsd (%rdi), %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vbroadcastsd (%rdi), %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastsd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -835,8 +835,8 @@ define <4 x float> @test_broadcastss(flo<br>
;<br>
; BROADWELL-LABEL: test_broadcastss:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vbroadcastss (%rdi), %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vbroadcastss (%rdi), %xmm0 # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -881,8 +881,8 @@ define <8 x float> @test_broadcastss_ymm<br>
;<br>
; BROADWELL-LABEL: test_broadcastss_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vbroadcastss (%rdi), %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vbroadcastss (%rdi), %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastss_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -934,9 +934,9 @@ define <4 x double> @test_cmppd(<4 x dou<br>
; BROADWELL-LABEL: test_cmppd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqpd %ymm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vorpd %ymm0, %ymm1, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmppd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1002,9 +1002,9 @@ define <8 x float> @test_cmpps(<8 x floa<br>
; BROADWELL-LABEL: test_cmpps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqps %ymm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vorps %ymm0, %ymm1, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1070,9 +1070,9 @@ define <4 x double> @test_cvtdq2pd(<4 x<br>
; BROADWELL-LABEL: test_cvtdq2pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtdq2pd %xmm0, %ymm0 # sched: [6:1.00]<br>
-; BROADWELL-NEXT:    vcvtdq2pd (%rdi), %ymm1 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    vcvtdq2pd (%rdi), %ymm1 # sched: [11:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtdq2pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1135,9 +1135,9 @@ define <8 x float> @test_cvtdq2ps(<8 x i<br>
; BROADWELL-LABEL: test_cvtdq2ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtdq2ps %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcvtdq2ps (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcvtdq2ps (%rdi), %ymm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtdq2ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1198,9 +1198,9 @@ define <8 x i32> @test_cvtpd2dq(<4 x dou<br>
; BROADWELL-LABEL: test_cvtpd2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttpd2dq %ymm0, %xmm0 # sched: [6:1.00]<br>
-; BROADWELL-NEXT:    vcvttpd2dqy (%rdi), %xmm1 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcvttpd2dqy (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vinsertf128 $1, %xmm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpd2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1261,9 +1261,9 @@ define <8 x float> @test_cvtpd2ps(<4 x d<br>
; BROADWELL-LABEL: test_cvtpd2ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtpd2ps %ymm0, %xmm0 # sched: [6:1.00]<br>
-; BROADWELL-NEXT:    vcvtpd2psy (%rdi), %xmm1 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcvtpd2psy (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vinsertf128 $1, %xmm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpd2ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1324,9 +1324,9 @@ define <8 x i32> @test_cvtps2dq(<8 x flo<br>
; BROADWELL-LABEL: test_cvtps2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttps2dq %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcvttps2dq (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcvttps2dq (%rdi), %ymm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vorps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtps2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1383,9 +1383,9 @@ define <4 x double> @test_divpd(<4 x dou<br>
;<br>
; BROADWELL-LABEL: test_divpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivpd %ymm1, %ymm0, %ymm0 # sched: [35:2.00]<br>
-; BROADWELL-NEXT:    vdivpd (%rdi), %ymm0, %ymm0 # sched: [35:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivpd %ymm1, %ymm0, %ymm0 # sched: [23:2.00]<br>
+; BROADWELL-NEXT:    vdivpd (%rdi), %ymm0, %ymm0 # sched: [29:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1437,9 +1437,9 @@ define <8 x float> @test_divps(<8 x floa<br>
;<br>
; BROADWELL-LABEL: test_divps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivps %ymm1, %ymm0, %ymm0 # sched: [21:2.00]<br>
-; BROADWELL-NEXT:    vdivps (%rdi), %ymm0, %ymm0 # sched: [21:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivps %ymm1, %ymm0, %ymm0 # sched: [17:2.00]<br>
+; BROADWELL-NEXT:    vdivps (%rdi), %ymm0, %ymm0 # sched: [23:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1492,8 +1492,8 @@ define <8 x float> @test_dpps(<8 x float<br>
; BROADWELL-LABEL: test_dpps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vdpps $7, %ymm1, %ymm0, %ymm0 # sched: [14:2.00]<br>
-; BROADWELL-NEXT:    vdpps $7, (%rdi), %ymm0, %ymm0 # sched: [14:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdpps $7, (%rdi), %ymm0, %ymm0 # sched: [20:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_dpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1552,7 +1552,7 @@ define <4 x float> @test_extractf128(<8<br>
; BROADWELL-NEXT:    vextractf128 $1, %ymm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vextractf128 $1, %ymm1, (%rdi) # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_extractf128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1608,8 +1608,8 @@ define <4 x double> @test_haddpd(<4 x do<br>
; BROADWELL-LABEL: test_haddpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhaddpd %ymm1, %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhaddpd (%rdi), %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhaddpd (%rdi), %ymm0, %ymm0 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_haddpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1663,8 +1663,8 @@ define <8 x float> @test_haddps(<8 x flo<br>
; BROADWELL-LABEL: test_haddps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhaddps %ymm1, %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhaddps (%rdi), %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhaddps (%rdi), %ymm0, %ymm0 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_haddps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1718,8 +1718,8 @@ define <4 x double> @test_hsubpd(<4 x do<br>
; BROADWELL-LABEL: test_hsubpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhsubpd %ymm1, %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhsubpd (%rdi), %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhsubpd (%rdi), %ymm0, %ymm0 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_hsubpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1773,8 +1773,8 @@ define <8 x float> @test_hsubps(<8 x flo<br>
; BROADWELL-LABEL: test_hsubps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhsubps %ymm1, %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhsubps (%rdi), %ymm0, %ymm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhsubps (%rdi), %ymm0, %ymm0 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_hsubps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1831,9 +1831,9 @@ define <8 x float> @test_insertf128(<8 x<br>
; BROADWELL-LABEL: test_insertf128:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vinsertf128 $1, %xmm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vinsertf128 $1, (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vinsertf128 $1, (%rdi), %ymm0, %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_insertf128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1889,8 +1889,8 @@ define <32 x i8> @test_lddqu(i8* %a0) {<br>
;<br>
; BROADWELL-LABEL: test_lddqu:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vlddqu (%rdi), %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vlddqu (%rdi), %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lddqu:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1940,10 +1940,10 @@ define <2 x double> @test_maskmovpd(i8*<br>
;<br>
; BROADWELL-LABEL: test_maskmovpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmaskmovpd (%rdi), %xmm0, %xmm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vmaskmovpd %xmm1, %xmm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vmaskmovpd (%rdi), %xmm0, %xmm2 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    vmaskmovpd %xmm1, %xmm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovapd %xmm2, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2003,10 +2003,10 @@ define <4 x double> @test_maskmovpd_ymm(<br>
;<br>
; BROADWELL-LABEL: test_maskmovpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmaskmovpd (%rdi), %ymm0, %ymm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vmaskmovpd %ymm1, %ymm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vmaskmovpd (%rdi), %ymm0, %ymm2 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    vmaskmovpd %ymm1, %ymm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovapd %ymm2, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2066,10 +2066,10 @@ define <4 x float> @test_maskmovps(i8* %<br>
;<br>
; BROADWELL-LABEL: test_maskmovps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmaskmovps (%rdi), %xmm0, %xmm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vmaskmovps %xmm1, %xmm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vmaskmovps (%rdi), %xmm0, %xmm2 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    vmaskmovps %xmm1, %xmm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovaps %xmm2, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2129,10 +2129,10 @@ define <8 x float> @test_maskmovps_ymm(i<br>
;<br>
; BROADWELL-LABEL: test_maskmovps_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmaskmovps (%rdi), %ymm0, %ymm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vmaskmovps %ymm1, %ymm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vmaskmovps (%rdi), %ymm0, %ymm2 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    vmaskmovps %ymm1, %ymm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovaps %ymm2, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2190,8 +2190,8 @@ define <4 x double> @test_maxpd(<4 x dou<br>
; BROADWELL-LABEL: test_maxpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2245,8 +2245,8 @@ define <8 x float> @test_maxps(<8 x floa<br>
; BROADWELL-LABEL: test_maxps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2300,8 +2300,8 @@ define <4 x double> @test_minpd(<4 x dou<br>
; BROADWELL-LABEL: test_minpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2355,8 +2355,8 @@ define <8 x float> @test_minps(<8 x floa<br>
; BROADWELL-LABEL: test_minps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2412,10 +2412,10 @@ define <4 x double> @test_movapd(<4 x do<br>
;<br>
; BROADWELL-LABEL: test_movapd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovapd (%rdi), %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovapd (%rdi), %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovapd %ymm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movapd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2474,10 +2474,10 @@ define <8 x float> @test_movaps(<8 x flo<br>
;<br>
; BROADWELL-LABEL: test_movaps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovaps (%rdi), %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovaps (%rdi), %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovaps %ymm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movaps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2537,9 +2537,9 @@ define <4 x double> @test_movddup(<4 x d<br>
; BROADWELL-LABEL: test_movddup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovddup {{.*#+}} ymm0 = ymm0[0,0,2,2] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovddup {{.*#+}} ymm1 = mem[0,0,2,2] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovddup {{.*#+}} ymm1 = mem[0,0,2,2] sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movddup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2598,7 +2598,7 @@ define i32 @test_movmskpd(<4 x double> %<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovmskpd %ymm0, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movmskpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2650,7 +2650,7 @@ define i32 @test_movmskps(<8 x float> %a<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovmskps %ymm0, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movmskps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2702,7 +2702,7 @@ define <4 x double> @test_movntpd(<4 x d<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovntpd %ymm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2755,7 +2755,7 @@ define <8 x float> @test_movntps(<8 x fl<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovntps %ymm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2810,9 +2810,9 @@ define <8 x float> @test_movshdup(<8 x f<br>
; BROADWELL-LABEL: test_movshdup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovshdup {{.*#+}} ymm0 = ymm0[1,1,3,3,5,5,7,7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovshdup {{.*#+}} ymm1 = mem[1,1,3,3,5,5,7,7] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovshdup {{.*#+}} ymm1 = mem[1,1,3,3,5,5,7,7] sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movshdup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2873,9 +2873,9 @@ define <8 x float> @test_movsldup(<8 x f<br>
; BROADWELL-LABEL: test_movsldup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovsldup {{.*#+}} ymm0 = ymm0[0,0,2,2,4,4,6,6] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovsldup {{.*#+}} ymm1 = mem[0,0,2,2,4,4,6,6] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovsldup {{.*#+}} ymm1 = mem[0,0,2,2,4,4,6,6] sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movsldup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2937,10 +2937,10 @@ define <4 x double> @test_movupd(<4 x do<br>
;<br>
; BROADWELL-LABEL: test_movupd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovupd (%rdi), %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovupd (%rdi), %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovupd %ymm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movupd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3001,10 +3001,10 @@ define <8 x float> @test_movups(<8 x flo<br>
;<br>
; BROADWELL-LABEL: test_movups:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovups (%rdi), %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovups (%rdi), %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovups %ymm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movups:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3060,9 +3060,9 @@ define <4 x double> @test_mulpd(<4 x dou<br>
;<br>
; BROADWELL-LABEL: test_mulpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulpd %ymm1, %ymm0, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulpd (%rdi), %ymm0, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulpd %ymm1, %ymm0, %ymm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulpd (%rdi), %ymm0, %ymm0 # sched: [9:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3114,9 +3114,9 @@ define <8 x float> @test_mulps(<8 x floa<br>
;<br>
; BROADWELL-LABEL: test_mulps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulps %ymm1, %ymm0, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulps (%rdi), %ymm0, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulps %ymm1, %ymm0, %ymm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulps (%rdi), %ymm0, %ymm0 # sched: [9:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3172,9 +3172,9 @@ define <4 x double> @orpd(<4 x double> %<br>
; BROADWELL-LABEL: orpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vorpd %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vorpd (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vorpd (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: orpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3239,9 +3239,9 @@ define <8 x float> @test_orps(<8 x float<br>
; BROADWELL-LABEL: test_orps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vorps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vorps (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vorps (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_orps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3306,9 +3306,9 @@ define <4 x double> @test_perm2f128(<4 x<br>
; BROADWELL-LABEL: test_perm2f128:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vperm2f128 {{.*#+}} ymm1 = ymm0[2,3],ymm1[0,1] sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vperm2f128 {{.*#+}} ymm0 = ymm0[2,3],mem[0,1] sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vperm2f128 {{.*#+}} ymm0 = ymm0[2,3],mem[0,1] sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_perm2f128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3369,9 +3369,9 @@ define <2 x double> @test_permilpd(<2 x<br>
; BROADWELL-LABEL: test_permilpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilpd {{.*#+}} xmm0 = xmm0[1,0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilpd {{.*#+}} xmm1 = mem[1,0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpermilpd {{.*#+}} xmm1 = mem[1,0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3432,9 +3432,9 @@ define <4 x double> @test_permilpd_ymm(<<br>
; BROADWELL-LABEL: test_permilpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilpd {{.*#+}} ymm0 = ymm0[1,0,2,3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilpd {{.*#+}} ymm1 = mem[1,0,2,3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpermilpd {{.*#+}} ymm1 = mem[1,0,2,3] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3495,9 +3495,9 @@ define <4 x float> @test_permilps(<4 x f<br>
; BROADWELL-LABEL: test_permilps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilps {{.*#+}} xmm0 = xmm0[3,2,1,0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilps {{.*#+}} xmm1 = mem[3,2,1,0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpermilps {{.*#+}} xmm1 = mem[3,2,1,0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3558,9 +3558,9 @@ define <8 x float> @test_permilps_ymm(<8<br>
; BROADWELL-LABEL: test_permilps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilps {{.*#+}} ymm0 = ymm0[3,2,1,0,7,6,5,4] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilps {{.*#+}} ymm1 = mem[3,2,1,0,7,6,5,4] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpermilps {{.*#+}} ymm1 = mem[3,2,1,0,7,6,5,4] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3618,8 +3618,8 @@ define <2 x double> @test_permilvarpd(<2<br>
; BROADWELL-LABEL: test_permilvarpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilpd %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilpd (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpermilpd (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilvarpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3673,8 +3673,8 @@ define <4 x double> @test_permilvarpd_ym<br>
; BROADWELL-LABEL: test_permilvarpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilpd %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilpd (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpermilpd (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilvarpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3728,8 +3728,8 @@ define <4 x float> @test_permilvarps(<4<br>
; BROADWELL-LABEL: test_permilvarps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilps %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilps (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpermilps (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permilvarps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3783,8 +3783,8 @@ define <8 x float> @test_permilvarps_ymm<br>
; BROADWELL-LABEL: test_permilvarps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermilps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpermilps (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+; BROADWELL-NEXT:    vpermilps (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">;<o:p></o:p></p>
<p class="MsoNormal">; SKYLAKE-LABEL: test_permilvarps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3840,10 +3840,10 @@ define <8 x float> @test_rcpps(<8 x floa<br>
;<br>
; BROADWELL-LABEL: test_rcpps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vrcpps (%rdi), %ymm1 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    vrcpps (%rdi), %ymm1 # sched: [17:2.00]<br>
; BROADWELL-NEXT:    vrcpps %ymm0, %ymm0 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rcpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3904,10 +3904,10 @@ define <4 x double> @test_roundpd(<4 x d<br>
;<br>
; BROADWELL-LABEL: test_roundpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vroundpd $7, %ymm0, %ymm0 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundpd $7, (%rdi), %ymm1 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundpd $7, %ymm0, %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundpd $7, (%rdi), %ymm1 # sched: [12:2.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_roundpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3968,10 +3968,10 @@ define <8 x float> @test_roundps(<8 x fl<br>
;<br>
; BROADWELL-LABEL: test_roundps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vroundps $7, %ymm0, %ymm0 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundps $7, (%rdi), %ymm1 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundps $7, %ymm0, %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundps $7, (%rdi), %ymm1 # sched: [12:2.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_roundps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4032,10 +4032,10 @@ define <8 x float> @test_rsqrtps(<8 x fl<br>
;<br>
; BROADWELL-LABEL: test_rsqrtps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vrsqrtps (%rdi), %ymm1 # sched: [11:2.00]<br>
+; BROADWELL-NEXT:    vrsqrtps (%rdi), %ymm1 # sched: [17:2.00]<br>
; BROADWELL-NEXT:    vrsqrtps %ymm0, %ymm0 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rsqrtps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4097,9 +4097,9 @@ define <4 x double> @test_shufpd(<4 x do<br>
; BROADWELL-LABEL: test_shufpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vshufpd {{.*#+}} ymm0 = ymm0[1],ymm1[0],ymm0[2],ymm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vshufpd {{.*#+}} ymm1 = ymm1[1],mem[0],ymm1[2],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vshufpd {{.*#+}} ymm1 = ymm1[1],mem[0],ymm1[2],mem[3] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shufpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4157,8 +4157,8 @@ define <8 x float> @test_shufps(<8 x flo<br>
; BROADWELL-LABEL: test_shufps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vshufps {{.*#+}} ymm0 = ymm0[0,0],ymm1[0,0],ymm0[4,4],ymm1[4,4] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vshufps {{.*#+}} ymm0 = ymm0[0,3],mem[0,0],ymm0[4,7],mem[4,4] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vshufps {{.*#+}} ymm0 = ymm0[0,3],mem[0,0],ymm0[4,7],mem[4,4] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shufps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4213,10 +4213,10 @@ define <4 x double> @test_sqrtpd(<4 x do<br>
;<br>
; BROADWELL-LABEL: test_sqrtpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vsqrtpd (%rdi), %ymm1 # sched: [35:2.00]<br>
-; BROADWELL-NEXT:    vsqrtpd %ymm0, %ymm0 # sched: [35:2.00]<br>
+; BROADWELL-NEXT:    vsqrtpd (%rdi), %ymm1 # sched: [40:2.00]<br>
+; BROADWELL-NEXT:    vsqrtpd %ymm0, %ymm0 # sched: [34:2.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4277,10 +4277,10 @@ define <8 x float> @test_sqrtps(<8 x flo<br>
;<br>
; BROADWELL-LABEL: test_sqrtps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vsqrtps (%rdi), %ymm1 # sched: [21:2.00]<br>
+; BROADWELL-NEXT:    vsqrtps (%rdi), %ymm1 # sched: [27:2.00]<br>
; BROADWELL-NEXT:    vsqrtps %ymm0, %ymm0 # sched: [21:2.00]<br>
; BROADWELL-NEXT:    vaddps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4339,8 +4339,8 @@ define <4 x double> @test_subpd(<4 x dou<br>
; BROADWELL-LABEL: test_subpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubpd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubpd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4393,8 +4393,8 @@ define <8 x float> @test_subps(<8 x floa<br>
; BROADWELL-LABEL: test_subps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubps %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4458,9 +4458,9 @@ define i32 @test_testpd(<2 x double> %a0<br>
; BROADWELL-NEXT:    xorl %eax, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    vtestpd %xmm1, %xmm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    setb %al # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vtestpd (%rdi), %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    adcl $0, %eax # sched: [2:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vtestpd (%rdi), %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    adcl $0, %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_testpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4541,10 +4541,10 @@ define i32 @test_testpd_ymm(<4 x double><br>
; BROADWELL-NEXT:    xorl %eax, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    vtestpd %ymm1, %ymm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    setb %al # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vtestpd (%rdi), %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    adcl $0, %eax # sched: [2:0.50]<br>
+; BROADWELL-NEXT:    vtestpd (%rdi), %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    adcl $0, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_testpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4625,9 +4625,9 @@ define i32 @test_testps(<4 x float> %a0,<br>
; BROADWELL-NEXT:    xorl %eax, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    vtestps %xmm1, %xmm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    setb %al # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vtestps (%rdi), %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    adcl $0, %eax # sched: [2:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vtestps (%rdi), %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    adcl $0, %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_testps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4708,10 +4708,10 @@ define i32 @test_testps_ymm(<8 x float><br>
; BROADWELL-NEXT:    xorl %eax, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    vtestps %ymm1, %ymm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    setb %al # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vtestps (%rdi), %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    adcl $0, %eax # sched: [2:0.50]<br>
+; BROADWELL-NEXT:    vtestps (%rdi), %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    adcl $0, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_testps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4784,9 +4784,9 @@ define <4 x double> @test_unpckhpd(<4 x<br>
; BROADWELL-LABEL: test_unpckhpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpckhpd {{.*#+}} ymm0 = ymm0[1],ymm1[1],ymm0[3],ymm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpckhpd {{.*#+}} ymm1 = ymm1[1],mem[1],ymm1[3],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vunpckhpd {{.*#+}} ymm1 = ymm1[1],mem[1],ymm1[3],mem[3] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpckhpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4844,8 +4844,8 @@ define <8 x float> @test_unpckhps(<8 x f<br>
; BROADWELL-LABEL: test_unpckhps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpckhps {{.*#+}} ymm0 = ymm0[2],ymm1[2],ymm0[3],ymm1[3],ymm0[6],ymm1[6],ymm0[7],ymm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpckhps {{.*#+}} ymm0 = ymm0[2],mem[2],ymm0[3],mem[3],ymm0[6],mem[6],ymm0[7],mem[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vunpckhps {{.*#+}} ymm0 = ymm0[2],mem[2],ymm0[3],mem[3],ymm0[6],mem[6],ymm0[7],mem[7] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpckhps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4901,9 +4901,9 @@ define <4 x double> @test_unpcklpd(<4 x<br>
; BROADWELL-LABEL: test_unpcklpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpcklpd {{.*#+}} ymm0 = ymm0[0],ymm1[0],ymm0[2],ymm1[2] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpcklpd {{.*#+}} ymm1 = ymm1[0],mem[0],ymm1[2],mem[2] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vunpcklpd {{.*#+}} ymm1 = ymm1[0],mem[0],ymm1[2],mem[2] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpcklpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4961,8 +4961,8 @@ define <8 x float> @test_unpcklps(<8 x f<br>
; BROADWELL-LABEL: test_unpcklps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpcklps {{.*#+}} ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpcklps {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[4],mem[4],ymm0[5],mem[5] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vunpcklps {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[4],mem[4],ymm0[5],mem[5] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpcklps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5018,9 +5018,9 @@ define <4 x double> @test_xorpd(<4 x dou<br>
; BROADWELL-LABEL: test_xorpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vxorpd %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vxorpd (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vxorpd (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_xorpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5085,9 +5085,9 @@ define <8 x float> @test_xorps(<8 x floa<br>
; BROADWELL-LABEL: test_xorps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vxorps %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vxorps (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vxorps (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_xorps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5146,7 +5146,7 @@ define void @test_zeroall() {<br>
; BROADWELL-LABEL: test_zeroall:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vzeroall # sched: [16:16.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_zeroall:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5191,7 +5191,7 @@ define void @test_zeroupper() {<br>
; BROADWELL-LABEL: test_zeroupper:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_zeroupper:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/avx2-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/avx2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/avx2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/avx2-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/avx2-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -21,9 +21,9 @@ define <8 x i32> @test_broadcasti128(<8<br>
;<br>
; BROADWELL-LABEL: test_broadcasti128:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vbroadcasti128 {{.*#+}} ymm1 = mem[0,1,0,1] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vbroadcasti128 {{.*#+}} ymm1 = mem[0,1,0,1] sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddd %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcasti128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -65,7 +65,7 @@ define <4 x double> @test_broadcastsd_ym<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vbroadcastsd %xmm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastsd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -106,7 +106,7 @@ define <4 x float> @test_broadcastss(<4<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vbroadcastss %xmm0, %xmm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -147,7 +147,7 @@ define <8 x float> @test_broadcastss_ymm<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vbroadcastss %xmm0, %ymm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_broadcastss_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -197,7 +197,7 @@ define <4 x i32> @test_extracti128(<8 x<br>
; BROADWELL-NEXT:    vextracti128 $1, %ymm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vextracti128 $1, %ymm2, (%rdi) # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_extracti128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -246,8 +246,8 @@ define <2 x double> @test_gatherdpd(<2 x<br>
;<br>
; BROADWELL-LABEL: test_gatherdpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherdpd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherdpd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [25:3.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherdpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -281,8 +281,8 @@ define <4 x double> @test_gatherdpd_ymm(<br>
;<br>
; BROADWELL-LABEL: test_gatherdpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherdpd %ymm2, (%rdi,%xmm1,8), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherdpd %ymm2, (%rdi,%xmm1,8), %ymm0 # sched: [26:5.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherdpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -316,8 +316,8 @@ define <4 x float> @test_gatherdps(<4 x<br>
;<br>
; BROADWELL-LABEL: test_gatherdps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherdps %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherdps %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [25:3.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherdps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -351,8 +351,8 @@ define <8 x float> @test_gatherdps_ymm(<<br>
;<br>
; BROADWELL-LABEL: test_gatherdps_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherdps %ymm2, (%rdi,%ymm1,4), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherdps %ymm2, (%rdi,%ymm1,4), %ymm0 # sched: [26:4.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherdps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -386,8 +386,8 @@ define <2 x double> @test_gatherqpd(<2 x<br>
;<br>
; BROADWELL-LABEL: test_gatherqpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherqpd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherqpd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [22:3.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherqpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -421,8 +421,8 @@ define <4 x double> @test_gatherqpd_ymm(<br>
;<br>
; BROADWELL-LABEL: test_gatherqpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherqpd %ymm2, (%rdi,%ymm1,8), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherqpd %ymm2, (%rdi,%ymm1,8), %ymm0 # sched: [23:3.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherqpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -456,8 +456,8 @@ define <4 x float> @test_gatherqps(<4 x<br>
;<br>
; BROADWELL-LABEL: test_gatherqps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherqps %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vgatherqps %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [27:5.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherqps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -493,9 +493,9 @@ define <4 x float> @test_gatherqps_ymm(<<br>
;<br>
; BROADWELL-LABEL: test_gatherqps_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vgatherqps %xmm2, (%rdi,%ymm1,4), %xmm0 # sched: [1:?]<br>
+; BROADWELL-NEXT:    vgatherqps %xmm2, (%rdi,%ymm1,4), %xmm0 # sched: [24:5.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_gatherqps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -537,9 +537,9 @@ define <8 x i32> @test_inserti128(<8 x i<br>
; BROADWELL-LABEL: test_inserti128:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vinserti128 $1, %xmm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vinserti128 $1, (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vinserti128 $1, (%rdi), %ymm0, %ymm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddd %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_inserti128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -583,8 +583,8 @@ define <4 x i64> @test_movntdqa(i8* %a0)<br>
;<br>
; BROADWELL-LABEL: test_movntdqa:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovntdqa (%rdi), %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmovntdqa (%rdi), %ymm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntdqa:<br>
; SKYLAKE:       # BB#0:<br>
@@ -621,8 +621,8 @@ define <16 x i16> @test_mpsadbw(<32 x i8<br>
; BROADWELL-LABEL: test_mpsadbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmpsadbw $7, %ymm1, %ymm0, %ymm0 # sched: [7:2.00]<br>
-; BROADWELL-NEXT:    vmpsadbw $7, (%rdi), %ymm0, %ymm0 # sched: [7:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmpsadbw $7, (%rdi), %ymm0, %ymm0 # sched: [13:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mpsadbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -667,9 +667,9 @@ define <32 x i8> @test_pabsb(<32 x i8> %<br>
; BROADWELL-LABEL: test_pabsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsb %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsb (%rdi), %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsb (%rdi), %ymm1 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -717,9 +717,9 @@ define <8 x i32> @test_pabsd(<8 x i32> %<br>
; BROADWELL-LABEL: test_pabsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsd %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsd (%rdi), %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsd (%rdi), %ymm1 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -767,9 +767,9 @@ define <16 x i16> @test_pabsw(<16 x i16><br>
; BROADWELL-LABEL: test_pabsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsw %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsw (%rdi), %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsw (%rdi), %ymm1 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -815,8 +815,8 @@ define <16 x i16> @test_packssdw(<8 x i3<br>
; BROADWELL-LABEL: test_packssdw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackssdw %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackssdw (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackssdw (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packssdw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -859,8 +859,8 @@ define <32 x i8> @test_packsswb(<16 x i1<br>
; BROADWELL-LABEL: test_packsswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpacksswb %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpacksswb (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpacksswb (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packsswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -903,8 +903,8 @@ define <16 x i16> @test_packusdw(<8 x i3<br>
; BROADWELL-LABEL: test_packusdw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackusdw %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackusdw (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackusdw (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packusdw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -947,8 +947,8 @@ define <32 x i8> @test_packuswb(<16 x i1<br>
; BROADWELL-LABEL: test_packuswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackuswb %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackuswb (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackuswb (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packuswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -991,8 +991,8 @@ define <32 x i8> @test_paddb(<32 x i8> %<br>
; BROADWELL-LABEL: test_paddb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1033,8 +1033,8 @@ define <8 x i32> @test_paddd(<8 x i32> %<br>
; BROADWELL-LABEL: test_paddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1075,8 +1075,8 @@ define <4 x i64> @test_paddq(<4 x i64> %<br>
; BROADWELL-LABEL: test_paddq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddq (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddq (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1117,8 +1117,8 @@ define <32 x i8> @test_paddsb(<32 x i8><br>
; BROADWELL-LABEL: test_paddsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddsb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddsb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddsb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1160,8 +1160,8 @@ define <16 x i16> @test_paddsw(<16 x i16<br>
; BROADWELL-LABEL: test_paddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddsw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddsw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddsw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1203,8 +1203,8 @@ define <32 x i8> @test_paddusb(<32 x i8><br>
; BROADWELL-LABEL: test_paddusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddusb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddusb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddusb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1246,8 +1246,8 @@ define <16 x i16> @test_paddusw(<16 x i1<br>
; BROADWELL-LABEL: test_paddusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddusw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddusw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddusw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1289,8 +1289,8 @@ define <16 x i16> @test_paddw(<16 x i16><br>
; BROADWELL-LABEL: test_paddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1331,8 +1331,8 @@ define <32 x i8> @test_palignr(<32 x i8><br>
; BROADWELL-LABEL: test_palignr:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpalignr {{.*#+}} ymm0 = ymm1[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15],ymm0[0],ymm1[17,18,19,20,21,22,23,24,25,26,27,28,29,30,31],ymm0[16] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpalignr {{.*#+}} ymm0 = mem[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15],ymm0[0],mem[17,18,19,20,21,22,23,24,25,26,27,28,29,30,31],ymm0[16] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpalignr {{.*#+}} ymm0 = mem[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15],ymm0[0],mem[17,18,19,20,21,22,23,24,25,26,27,28,29,30,31],ymm0[16] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_palignr:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1375,9 +1375,9 @@ define <4 x i64> @test_pand(<4 x i64> %a<br>
; BROADWELL-LABEL: test_pand:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpand %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpand (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpand (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pand:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1424,9 +1424,9 @@ define <4 x i64> @test_pandn(<4 x i64> %<br>
; BROADWELL-LABEL: test_pandn:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpandn %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpandn (%rdi), %ymm0, %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpandn (%rdi), %ymm0, %ymm1 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pandn:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1473,8 +1473,8 @@ define <32 x i8> @test_pavgb(<32 x i8> %<br>
; BROADWELL-LABEL: test_pavgb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpavgb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpavgb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpavgb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1525,8 +1525,8 @@ define <16 x i16> @test_pavgw(<16 x i16><br>
; BROADWELL-LABEL: test_pavgw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpavgw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpavgw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpavgw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1579,9 +1579,9 @@ define <4 x i32> @test_pblendd(<4 x i32><br>
; BROADWELL-LABEL: test_pblendd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendd {{.*#+}} xmm1 = xmm1[0,1,2],xmm0[3] sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpblendd {{.*#+}} xmm1 = mem[0],xmm1[1],mem[2],xmm1[3] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpblendd {{.*#+}} xmm1 = mem[0],xmm1[1],mem[2],xmm1[3] sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1628,9 +1628,9 @@ define <8 x i32> @test_pblendd_ymm(<8 x<br>
; BROADWELL-LABEL: test_pblendd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendd {{.*#+}} ymm1 = ymm1[0,1,2],ymm0[3,4,5,6],ymm1[7] sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpblendd {{.*#+}} ymm1 = ymm1[0],mem[1,2],ymm1[3,4,5,6,7] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpblendd {{.*#+}} ymm1 = ymm1[0],mem[1,2],ymm1[3,4,5,6,7] sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1675,8 +1675,8 @@ define <32 x i8> @test_pblendvb(<32 x i8<br>
; BROADWELL-LABEL: test_pblendvb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendvb %ymm2, %ymm1, %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpblendvb %ymm3, (%rdi), %ymm0, %ymm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpblendvb %ymm3, (%rdi), %ymm0, %ymm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendvb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1718,8 +1718,8 @@ define <16 x i16> @test_pblendw(<16 x i1<br>
; BROADWELL-LABEL: test_pblendw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendw {{.*#+}} ymm0 = ymm0[0,1],ymm1[2,3,4],ymm0[5,6,7,8,9],ymm1[10,11,12],ymm0[13,14,15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpblendw {{.*#+}} ymm0 = mem[0],ymm0[1],mem[2],ymm0[3],mem[4],ymm0[5],mem[6],ymm0[7],mem[8],ymm0[9],mem[10],ymm0[11],mem[12],ymm0[13],mem[14],ymm0[15] sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpblendw {{.*#+}} ymm0 = mem[0],ymm0[1],mem[2],ymm0[3],mem[4],ymm0[5],mem[6],ymm0[7],mem[8],ymm0[9],mem[10],ymm0[11],mem[12],ymm0[13],mem[14],ymm0[15] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1762,9 +1762,9 @@ define <16 x i8> @test_pbroadcastb(<16 x<br>
; BROADWELL-LABEL: test_pbroadcastb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastb %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastb (%rdi), %xmm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpbroadcastb (%rdi), %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1811,9 +1811,9 @@ define <32 x i8> @test_pbroadcastb_ymm(<<br>
; BROADWELL-LABEL: test_pbroadcastb_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastb %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastb (%rdi), %ymm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpbroadcastb (%rdi), %ymm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastb_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1860,9 +1860,9 @@ define <4 x i32> @test_pbroadcastd(<4 x<br>
; BROADWELL-LABEL: test_pbroadcastd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastd %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastd (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpbroadcastd (%rdi), %xmm1 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1908,9 +1908,9 @@ define <8 x i32> @test_pbroadcastd_ymm(<<br>
; BROADWELL-LABEL: test_pbroadcastd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastd %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastd (%rdi), %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpbroadcastd (%rdi), %ymm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1956,9 +1956,9 @@ define <2 x i64> @test_pbroadcastq(<2 x<br>
; BROADWELL-LABEL: test_pbroadcastq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastq %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastq (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpbroadcastq (%rdi), %xmm1 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2004,9 +2004,9 @@ define <4 x i64> @test_pbroadcastq_ymm(<<br>
; BROADWELL-LABEL: test_pbroadcastq_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastq %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastq (%rdi), %ymm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpbroadcastq (%rdi), %ymm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2052,9 +2052,9 @@ define <8 x i16> @test_pbroadcastw(<8 x<br>
; BROADWELL-LABEL: test_pbroadcastw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastw %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastw (%rdi), %xmm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpbroadcastw (%rdi), %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2101,9 +2101,9 @@ define <16 x i16> @test_pbroadcastw_ymm(<br>
; BROADWELL-LABEL: test_pbroadcastw_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpbroadcastw %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpbroadcastw (%rdi), %ymm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpbroadcastw (%rdi), %ymm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pbroadcastw_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2148,8 +2148,8 @@ define <32 x i8> @test_pcmpeqb(<32 x i8><br>
; BROADWELL-LABEL: test_pcmpeqb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpeqb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2194,8 +2194,8 @@ define <8 x i32> @test_pcmpeqd(<8 x i32><br>
; BROADWELL-LABEL: test_pcmpeqd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpeqd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2240,8 +2240,8 @@ define <4 x i64> @test_pcmpeqq(<4 x i64><br>
; BROADWELL-LABEL: test_pcmpeqq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqq (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpeqq (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2286,8 +2286,8 @@ define <16 x i16> @test_pcmpeqw(<16 x i1<br>
; BROADWELL-LABEL: test_pcmpeqw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpeqw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2332,8 +2332,8 @@ define <32 x i8> @test_pcmpgtb(<32 x i8><br>
; BROADWELL-LABEL: test_pcmpgtb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpgtb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpgtb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2378,8 +2378,8 @@ define <8 x i32> @test_pcmpgtd(<8 x i32><br>
; BROADWELL-LABEL: test_pcmpgtd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpgtd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpgtd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2424,8 +2424,8 @@ define <4 x i64> @test_pcmpgtq(<4 x i64><br>
; BROADWELL-LABEL: test_pcmpgtq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtq %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpcmpgtq (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpgtq (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2470,8 +2470,8 @@ define <16 x i16> @test_pcmpgtw(<16 x i1<br>
; BROADWELL-LABEL: test_pcmpgtw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpgtw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpgtw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2518,9 +2518,9 @@ define <4 x i64> @test_perm2i128(<4 x i6<br>
; BROADWELL-LABEL: test_perm2i128:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vperm2i128 {{.*#+}} ymm1 = ymm0[2,3],ymm1[0,1] sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vperm2i128 {{.*#+}} ymm0 = ymm0[2,3],mem[0,1] sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vperm2i128 {{.*#+}} ymm0 = ymm0[2,3],mem[0,1] sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_perm2i128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2567,9 +2567,9 @@ define <8 x i32> @test_permd(<8 x i32> %<br>
; BROADWELL-LABEL: test_permd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermd %ymm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpermd (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpermd (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2617,9 +2617,9 @@ define <4 x double> @test_permpd(<4 x do<br>
; BROADWELL-LABEL: test_permpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermpd {{.*#+}} ymm0 = ymm0[3,2,2,3] sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpermpd {{.*#+}} ymm1 = mem[0,2,2,3] sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpermpd {{.*#+}} ymm1 = mem[0,2,2,3] sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddpd %ymm1, %ymm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2666,9 +2666,9 @@ define <8 x float> @test_permps(<8 x i32<br>
; BROADWELL-LABEL: test_permps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermps %ymm1, %ymm0, %ymm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpermps (%rdi), %ymm0, %ymm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpermps (%rdi), %ymm0, %ymm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2716,9 +2716,9 @@ define <4 x i64> @test_permq(<4 x i64> %<br>
; BROADWELL-LABEL: test_permq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpermq {{.*#+}} ymm0 = ymm0[3,2,2,3] sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpermq {{.*#+}} ymm1 = mem[0,2,2,3] sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpermq {{.*#+}} ymm1 = mem[0,2,2,3] sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_permq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2760,8 +2760,8 @@ define <4 x i32> @test_pgatherdd(<4 x i3<br>
;<br>
; BROADWELL-LABEL: test_pgatherdd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherdd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherdd %xmm2, (%rdi,%xmm1,2), %xmm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherdd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2795,8 +2795,8 @@ define <8 x i32> @test_pgatherdd_ymm(<8<br>
;<br>
; BROADWELL-LABEL: test_pgatherdd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherdd %ymm2, (%rdi,%ymm1,2), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherdd %ymm2, (%rdi,%ymm1,2), %ymm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherdd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2830,8 +2830,8 @@ define <2 x i64> @test_pgatherdq(<2 x i6<br>
;<br>
; BROADWELL-LABEL: test_pgatherdq:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherdq %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherdq %xmm2, (%rdi,%xmm1,2), %xmm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2865,8 +2865,8 @@ define <4 x i64> @test_pgatherdq_ymm(<4<br>
;<br>
; BROADWELL-LABEL: test_pgatherdq_ymm:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherdq %ymm2, (%rdi,%xmm1,2), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+; BROADWELL-NEXT:    vpgatherdq %ymm2, (%rdi,%xmm1,2), %ymm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<o:p></o:p></p>
<p class="MsoNormal">;<br>
; SKYLAKE-LABEL: test_pgatherdq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2900,8 +2900,8 @@ define <4 x i32> @test_pgatherqd(<4 x i3<br>
;<br>
; BROADWELL-LABEL: test_pgatherqd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherqd %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherqd %xmm2, (%rdi,%xmm1,2), %xmm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherqd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2937,9 +2937,9 @@ define <4 x i32> @test_pgatherqd_ymm(<4<br>
;<br>
; BROADWELL-LABEL: test_pgatherqd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherqd %xmm2, (%rdi,%ymm1,2), %xmm0 # sched: [1:?]<br>
+; BROADWELL-NEXT:    vpgatherqd %xmm2, (%rdi,%ymm1,2), %xmm0<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherqd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2976,8 +2976,8 @@ define <2 x i64> @test_pgatherqq(<2 x i6<br>
;<br>
; BROADWELL-LABEL: test_pgatherqq:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherqq %xmm2, (%rdi,%xmm1,2), %xmm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherqq %xmm2, (%rdi,%xmm1,2), %xmm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherqq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3011,8 +3011,8 @@ define <4 x i64> @test_pgatherqq_ymm(<4<br>
;<br>
; BROADWELL-LABEL: test_pgatherqq_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpgatherqq %ymm2, (%rdi,%ymm1,2), %ymm0 # sched: [1:?]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpgatherqq %ymm2, (%rdi,%ymm1,2), %ymm0<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pgatherqq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3049,8 +3049,8 @@ define <8 x i32> @test_phaddd(<8 x i32><br>
; BROADWELL-LABEL: test_phaddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddd %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddd (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddd (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3092,8 +3092,8 @@ define <16 x i16> @test_phaddsw(<16 x i1<br>
; BROADWELL-LABEL: test_phaddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddsw %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddsw (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddsw (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3135,8 +3135,8 @@ define <16 x i16> @test_phaddw(<16 x i16<br>
; BROADWELL-LABEL: test_phaddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddw %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddw (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddw (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3178,8 +3178,8 @@ define <8 x i32> @test_phsubd(<8 x i32><br>
; BROADWELL-LABEL: test_phsubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubd %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubd (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubd (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3221,8 +3221,8 @@ define <16 x i16> @test_phsubsw(<16 x i1<br>
; BROADWELL-LABEL: test_phsubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubsw %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubsw (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubsw (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3264,8 +3264,8 @@ define <16 x i16> @test_phsubw(<16 x i16<br>
; BROADWELL-LABEL: test_phsubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubw %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubw (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubw (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3307,8 +3307,8 @@ define <16 x i16> @test_pmaddubsw(<32 x<br>
; BROADWELL-LABEL: test_pmaddubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaddubsw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmaddubsw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaddubsw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3351,8 +3351,8 @@ define <8 x i32> @test_pmaddwd(<16 x i16<br>
; BROADWELL-LABEL: test_pmaddwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaddwd %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmaddwd (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaddwd (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3396,10 +3396,10 @@ define <4 x i32> @test_pmaskmovd(i8* %a0<br>
;<br>
; BROADWELL-LABEL: test_pmaskmovd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpmaskmovd (%rdi), %xmm0, %xmm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpmaskmovd %xmm1, %xmm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpmaskmovd (%rdi), %xmm0, %xmm2 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    vpmaskmovd %xmm1, %xmm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovdqa %xmm2, %xmm0 # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaskmovd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3445,10 +3445,10 @@ define <8 x i32> @test_pmaskmovd_ymm(i8*<br>
;<br>
; BROADWELL-LABEL: test_pmaskmovd_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpmaskmovd (%rdi), %ymm0, %ymm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpmaskmovd %ymm1, %ymm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpmaskmovd (%rdi), %ymm0, %ymm2 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    vpmaskmovd %ymm1, %ymm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovdqa %ymm2, %ymm0 # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaskmovd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3494,10 +3494,10 @@ define <2 x i64> @test_pmaskmovq(i8* %a0<br>
;<br>
; BROADWELL-LABEL: test_pmaskmovq:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpmaskmovq (%rdi), %xmm0, %xmm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpmaskmovq %xmm1, %xmm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpmaskmovq (%rdi), %xmm0, %xmm2 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    vpmaskmovq %xmm1, %xmm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovdqa %xmm2, %xmm0 # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaskmovq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3543,10 +3543,10 @@ define <4 x i64> @test_pmaskmovq_ymm(i8*<br>
;<br>
; BROADWELL-LABEL: test_pmaskmovq_ymm:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpmaskmovq (%rdi), %ymm0, %ymm2 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpmaskmovq %ymm1, %ymm0, (%rdi) # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vpmaskmovq (%rdi), %ymm0, %ymm2 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    vpmaskmovq %ymm1, %ymm0, (%rdi) # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vmovdqa %ymm2, %ymm0 # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaskmovq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3591,8 +3591,8 @@ define <32 x i8> @test_pmaxsb(<32 x i8><br>
; BROADWELL-LABEL: test_pmaxsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3634,8 +3634,8 @@ define <8 x i32> @test_pmaxsd(<8 x i32><br>
; BROADWELL-LABEL: test_pmaxsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3677,8 +3677,8 @@ define <16 x i16> @test_pmaxsw(<16 x i16<br>
; BROADWELL-LABEL: test_pmaxsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3720,8 +3720,8 @@ define <32 x i8> @test_pmaxub(<32 x i8><br>
; BROADWELL-LABEL: test_pmaxub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxub %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxub (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxub (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3763,8 +3763,8 @@ define <8 x i32> @test_pmaxud(<8 x i32><br>
; BROADWELL-LABEL: test_pmaxud:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxud %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxud (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxud (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxud:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3806,8 +3806,8 @@ define <16 x i16> @test_pmaxuw(<16 x i16<br>
; BROADWELL-LABEL: test_pmaxuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxuw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxuw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxuw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3849,8 +3849,8 @@ define <32 x i8> @test_pminsb(<32 x i8><br>
; BROADWELL-LABEL: test_pminsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3892,8 +3892,8 @@ define <8 x i32> @test_pminsd(<8 x i32><br>
; BROADWELL-LABEL: test_pminsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3935,8 +3935,8 @@ define <16 x i16> @test_pminsw(<16 x i16<br>
; BROADWELL-LABEL: test_pminsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3978,8 +3978,8 @@ define <32 x i8> @test_pminub(<32 x i8><br>
; BROADWELL-LABEL: test_pminub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminub %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminub (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminub (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4021,8 +4021,8 @@ define <8 x i32> @test_pminud(<8 x i32><br>
; BROADWELL-LABEL: test_pminud:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminud %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminud (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminud (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminud:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4064,8 +4064,8 @@ define <16 x i16> @test_pminuw(<16 x i16<br>
; BROADWELL-LABEL: test_pminuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminuw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminuw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminuw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4108,7 +4108,7 @@ define i32 @test_pmovmskb(<32 x i8> %a0)<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovmskb %ymm0, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovmskb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4150,9 +4150,9 @@ define <8 x i32> @test_pmovsxbd(<16 x i8<br>
; BROADWELL-LABEL: test_pmovsxbd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbd %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbd (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbd (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4201,9 +4201,9 @@ define <4 x i64> @test_pmovsxbq(<16 x i8<br>
; BROADWELL-LABEL: test_pmovsxbq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbq %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbq (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbq (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4252,9 +4252,9 @@ define <16 x i16> @test_pmovsxbw(<16 x i<br>
; BROADWELL-LABEL: test_pmovsxbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbw %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbw (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbw (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4301,9 +4301,9 @@ define <4 x i64> @test_pmovsxdq(<4 x i32<br>
; BROADWELL-LABEL: test_pmovsxdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxdq %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxdq (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxdq (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4350,9 +4350,9 @@ define <8 x i32> @test_pmovsxwd(<8 x i16<br>
; BROADWELL-LABEL: test_pmovsxwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxwd %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxwd (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxwd (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4399,9 +4399,9 @@ define <4 x i64> @test_pmovsxwq(<8 x i16<br>
; BROADWELL-LABEL: test_pmovsxwq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxwq %xmm0, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxwq (%rdi), %ymm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxwq (%rdi), %ymm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxwq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4450,9 +4450,9 @@ define <8 x i32> @test_pmovzxbd(<16 x i8<br>
; BROADWELL-LABEL: test_pmovzxbd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} ymm0 = xmm0[0],zero,zero,zero,xmm0[1],zero,zero,zero,xmm0[2],zero,zero,zero,xmm0[3],zero,zero,zero,xmm0[4],zero,zero,zero,xmm0[5],zero,zero,zero,xmm0[6],zero,zero,zero,xmm0[7],zero,zero,zero sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} ymm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero,mem[4],zero,zero,zero,mem[5],zero,zero,zero,mem[6],zero,zero,zero,mem[7],zero,zero,zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} ymm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero,mem[4],zero,zero,zero,mem[5],zero,zero,zero,mem[6],zero,zero,zero,mem[7],zero,zero,zero sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4501,9 +4501,9 @@ define <4 x i64> @test_pmovzxbq(<16 x i8<br>
; BROADWELL-LABEL: test_pmovzxbq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} ymm0 = xmm0[0],zero,zero,zero,zero,zero,zero,zero,xmm0[1],zero,zero,zero,zero,zero,zero,zero,xmm0[2],zero,zero,zero,zero,zero,zero,zero,xmm0[3],zero,zero,zero,zero,zero,zero,zero sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} ymm1 = mem[0],zero,zero,zero,zero,zero,zero,zero,mem[1],zero,zero,zero,zero,zero,zero,zero,mem[2],zero,zero,zero,zero,zero,zero,zero,mem[3],zero,zero,zero,zero,zero,zero,zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} ymm1 = mem[0],zero,zero,zero,zero,zero,zero,zero,mem[1],zero,zero,zero,zero,zero,zero,zero,mem[2],zero,zero,zero,zero,zero,zero,zero,mem[3],zero,zero,zero,zero,zero,zero,zero sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4552,9 +4552,9 @@ define <16 x i16> @test_pmovzxbw(<16 x i<br>
; BROADWELL-LABEL: test_pmovzxbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} ymm0 = xmm0[0],zero,xmm0[1],zero,xmm0[2],zero,xmm0[3],zero,xmm0[4],zero,xmm0[5],zero,xmm0[6],zero,xmm0[7],zero,xmm0[8],zero,xmm0[9],zero,xmm0[10],zero,xmm0[11],zero,xmm0[12],zero,xmm0[13],zero,xmm0[14],zero,xmm0[15],zero
 sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero,mem[8],zero,mem[9],zero,mem[10],zero,mem[11],zero,mem[12],zero,mem[13],zero,mem[14],zero,mem[15],zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero,mem[8],zero,mem[9],zero,mem[10],zero,mem[11],zero,mem[12],zero,mem[13],zero,mem[14],zero,mem[15],zero sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4601,9 +4601,9 @@ define <4 x i64> @test_pmovzxdq(<4 x i32<br>
; BROADWELL-LABEL: test_pmovzxdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} ymm0 = xmm0[0],zero,xmm0[1],zero,xmm0[2],zero,xmm0[3],zero sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4650,9 +4650,9 @@ define <8 x i32> @test_pmovzxwd(<8 x i16<br>
; BROADWELL-LABEL: test_pmovzxwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} ymm0 = xmm0[0],zero,xmm0[1],zero,xmm0[2],zero,xmm0[3],zero,xmm0[4],zero,xmm0[5],zero,xmm0[6],zero,xmm0[7],zero sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} ymm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4699,9 +4699,9 @@ define <4 x i64> @test_pmovzxwq(<8 x i16<br>
; BROADWELL-LABEL: test_pmovzxwq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} ymm0 = xmm0[0],zero,zero,zero,xmm0[1],zero,zero,zero,xmm0[2],zero,zero,zero,xmm0[3],zero,zero,zero sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} ymm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} ymm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero sched: [9:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxwq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4748,8 +4748,8 @@ define <4 x i64> @test_pmuldq(<8 x i32><br>
; BROADWELL-LABEL: test_pmuldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmuldq %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmuldq (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmuldq (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmuldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4792,8 +4792,8 @@ define <16 x i16> @test_pmulhrsw(<16 x i<br>
; BROADWELL-LABEL: test_pmulhrsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhrsw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhrsw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhrsw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhrsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4835,8 +4835,8 @@ define <16 x i16> @test_pmulhuw(<16 x i1<br>
; BROADWELL-LABEL: test_pmulhuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhuw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhuw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhuw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4878,8 +4878,8 @@ define <16 x i16> @test_pmulhw(<16 x i16<br>
; BROADWELL-LABEL: test_pmulhw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4921,8 +4921,8 @@ define <8 x i32> @test_pmulld(<8 x i32><br>
; BROADWELL-LABEL: test_pmulld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulld %ymm1, %ymm0, %ymm0 # sched: [10:2.00]<br>
-; BROADWELL-NEXT:    vpmulld (%rdi), %ymm0, %ymm0 # sched: [10:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulld (%rdi), %ymm0, %ymm0 # sched: [16:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4963,8 +4963,8 @@ define <16 x i16> @test_pmullw(<16 x i16<br>
; BROADWELL-LABEL: test_pmullw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmullw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmullw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmullw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmullw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5005,8 +5005,8 @@ define <4 x i64> @test_pmuludq(<8 x i32><br>
; BROADWELL-LABEL: test_pmuludq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmuludq %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmuludq (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmuludq (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmuludq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5051,9 +5051,9 @@ define <4 x i64> @test_por(<4 x i64> %a0<br>
; BROADWELL-LABEL: test_por:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpor (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpor (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_por:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5098,8 +5098,8 @@ define <4 x i64> @test_psadbw(<32 x i8><br>
; BROADWELL-LABEL: test_psadbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsadbw %ymm1, %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpsadbw (%rdi), %ymm0, %ymm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsadbw (%rdi), %ymm0, %ymm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psadbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5142,8 +5142,8 @@ define <32 x i8> @test_pshufb(<32 x i8><br>
; BROADWELL-LABEL: test_pshufb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufb %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufb (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpshufb (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5187,9 +5187,9 @@ define <8 x i32> @test_pshufd(<8 x i32><br>
; BROADWELL-LABEL: test_pshufd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufd {{.*#+}} ymm0 = ymm0[3,2,1,0,7,6,5,4] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufd {{.*#+}} ymm1 = mem[1,0,3,2,5,4,7,6] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshufd {{.*#+}} ymm1 = mem[1,0,3,2,5,4,7,6] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpaddd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5236,9 +5236,9 @@ define <16 x i16> @test_pshufhw(<16 x i1<br>
; BROADWELL-LABEL: test_pshufhw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufhw {{.*#+}} ymm0 = ymm0[0,1,2,3,7,6,5,4,8,9,10,11,15,14,13,12] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufhw {{.*#+}} ymm1 = mem[0,1,2,3,5,4,7,6,8,9,10,11,13,12,15,14] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshufhw {{.*#+}} ymm1 = mem[0,1,2,3,5,4,7,6,8,9,10,11,13,12,15,14] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufhw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5285,9 +5285,9 @@ define <16 x i16> @test_pshuflw(<16 x i1<br>
; BROADWELL-LABEL: test_pshuflw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshuflw {{.*#+}} ymm0 = ymm0[3,2,1,0,4,5,6,7,11,10,9,8,12,13,14,15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshuflw {{.*#+}} ymm1 = mem[1,0,3,2,4,5,6,7,9,8,11,10,12,13,14,15] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshuflw {{.*#+}} ymm1 = mem[1,0,3,2,4,5,6,7,9,8,11,10,12,13,14,15] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshuflw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5332,8 +5332,8 @@ define <32 x i8> @test_psignb(<32 x i8><br>
; BROADWELL-LABEL: test_psignb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5375,8 +5375,8 @@ define <8 x i32> @test_psignd(<8 x i32><br>
; BROADWELL-LABEL: test_psignd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5418,8 +5418,8 @@ define <16 x i16> @test_psignw(<16 x i16<br>
; BROADWELL-LABEL: test_psignw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5463,9 +5463,9 @@ define <8 x i32> @test_pslld(<8 x i32> %<br>
; BROADWELL-LABEL: test_pslld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpslld %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpslld (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpslld (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpslld $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pslld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5509,7 +5509,7 @@ define <32 x i8> @test_pslldq(<32 x i8><br>
; BROADWELL-LABEL: test_pslldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpslldq {{.*#+}} ymm0 = zero,zero,zero,ymm0[0,1,2,3,4,5,6,7,8,9,10,11,12],zero,zero,zero,ymm0[16,17,18,19,20,21,22,23,24,25,26,27,28] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pslldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5547,9 +5547,9 @@ define <4 x i64> @test_psllq(<4 x i64> %<br>
; BROADWELL-LABEL: test_psllq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllq %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsllq (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsllq (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsllq $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5595,8 +5595,8 @@ define <4 x i32> @test_psllvd(<4 x i32><br>
; BROADWELL-LABEL: test_psllvd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllvd %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsllvd (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllvd (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllvd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5638,8 +5638,8 @@ define <8 x i32> @test_psllvd_ymm(<8 x i<br>
; BROADWELL-LABEL: test_psllvd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllvd %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsllvd (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllvd (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllvd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5681,8 +5681,8 @@ define <2 x i64> @test_psllvq(<2 x i64><br>
; BROADWELL-LABEL: test_psllvq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllvq %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpsllvq (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllvq (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllvq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5724,8 +5724,8 @@ define <4 x i64> @test_psllvq_ymm(<4 x i<br>
; BROADWELL-LABEL: test_psllvq_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllvq %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpsllvq (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllvq (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllvq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5769,9 +5769,9 @@ define <16 x i16> @test_psllw(<16 x i16><br>
; BROADWELL-LABEL: test_psllw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllw %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsllw (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsllw (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsllw $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5819,9 +5819,9 @@ define <8 x i32> @test_psrad(<8 x i32> %<br>
; BROADWELL-LABEL: test_psrad:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrad %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsrad (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsrad (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrad $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrad:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5867,8 +5867,8 @@ define <4 x i32> @test_psravd(<4 x i32><br>
; BROADWELL-LABEL: test_psravd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsravd %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsravd (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsravd (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psravd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5910,8 +5910,8 @@ define <8 x i32> @test_psravd_ymm(<8 x i<br>
; BROADWELL-LABEL: test_psravd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsravd %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsravd (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsravd (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psravd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5955,9 +5955,9 @@ define <16 x i16> @test_psraw(<16 x i16><br>
; BROADWELL-LABEL: test_psraw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsraw %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsraw (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsraw (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsraw $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psraw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6005,9 +6005,9 @@ define <8 x i32> @test_psrld(<8 x i32> %<br>
; BROADWELL-LABEL: test_psrld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrld %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsrld (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsrld (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrld $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6051,7 +6051,7 @@ define <32 x i8> @test_psrldq(<32 x i8><br>
; BROADWELL-LABEL: test_psrldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrldq {{.*#+}} ymm0 = ymm0[3,4,5,6,7,8,9,10,11,12,13,14,15],zero,zero,zero,ymm0[19,20,21,22,23,24,25,26,27,28,29,30,31],zero,zero,zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6089,9 +6089,9 @@ define <4 x i64> @test_psrlq(<4 x i64> %<br>
; BROADWELL-LABEL: test_psrlq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlq %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsrlq (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsrlq (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrlq $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6137,8 +6137,8 @@ define <4 x i32> @test_psrlvd(<4 x i32><br>
; BROADWELL-LABEL: test_psrlvd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlvd %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsrlvd (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlvd (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlvd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6180,8 +6180,8 @@ define <8 x i32> @test_psrlvd_ymm(<8 x i<br>
; BROADWELL-LABEL: test_psrlvd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlvd %ymm1, %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vpsrlvd (%rdi), %ymm0, %ymm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlvd (%rdi), %ymm0, %ymm0 # sched: [9:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlvd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6223,8 +6223,8 @@ define <2 x i64> @test_psrlvq(<2 x i64><br>
; BROADWELL-LABEL: test_psrlvq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlvq %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpsrlvq (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlvq (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlvq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6266,8 +6266,8 @@ define <4 x i64> @test_psrlvq_ymm(<4 x i<br>
; BROADWELL-LABEL: test_psrlvq_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlvq %ymm1, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpsrlvq (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlvq (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlvq_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6311,9 +6311,9 @@ define <16 x i16> @test_psrlw(<16 x i16><br>
; BROADWELL-LABEL: test_psrlw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlw %xmm1, %ymm0, %ymm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vpsrlw (%rdi), %ymm0, %ymm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpsrlw (%rdi), %ymm0, %ymm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrlw $2, %ymm0, %ymm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6359,8 +6359,8 @@ define <32 x i8> @test_psubb(<32 x i8> %<br>
; BROADWELL-LABEL: test_psubb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6401,8 +6401,8 @@ define <8 x i32> @test_psubd(<8 x i32> %<br>
; BROADWELL-LABEL: test_psubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubd (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubd (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6443,8 +6443,8 @@ define <4 x i64> @test_psubq(<4 x i64> %<br>
; BROADWELL-LABEL: test_psubq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubq (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubq (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6485,8 +6485,8 @@ define <32 x i8> @test_psubsb(<32 x i8><br>
; BROADWELL-LABEL: test_psubsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubsb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubsb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubsb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6528,8 +6528,8 @@ define <16 x i16> @test_psubsw(<16 x i16<br>
; BROADWELL-LABEL: test_psubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubsw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubsw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubsw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6571,8 +6571,8 @@ define <32 x i8> @test_psubusb(<32 x i8><br>
; BROADWELL-LABEL: test_psubusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubusb %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubusb (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubusb (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6614,8 +6614,8 @@ define <16 x i16> @test_psubusw(<16 x i1<br>
; BROADWELL-LABEL: test_psubusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubusw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubusw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubusw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6657,8 +6657,8 @@ define <16 x i16> @test_psubw(<16 x i16><br>
; BROADWELL-LABEL: test_psubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubw %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubw (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubw (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6699,8 +6699,8 @@ define <32 x i8> @test_punpckhbw(<32 x i<br>
; BROADWELL-LABEL: test_punpckhbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} ymm0 = ymm0[8],ymm1[8],ymm0[9],ymm1[9],ymm0[10],ymm1[10],ymm0[11],ymm1[11],ymm0[12],ymm1[12],ymm0[13],ymm1[13],ymm0[14],ymm1[14],ymm0[15],ymm1[15],ymm0[24],ymm1[24],ymm0[25],ymm1[25],ymm0[26],ymm1[26],ymm0[27],ymm1[27],ymm0[28],ymm1[28],ymm0[29],ymm1[29],ymm0[30],ymm1[30],ymm0[31],ymm1[31]
 sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} ymm0 = ymm0[8],mem[8],ymm0[9],mem[9],ymm0[10],mem[10],ymm0[11],mem[11],ymm0[12],mem[12],ymm0[13],mem[13],ymm0[14],mem[14],ymm0[15],mem[15],ymm0[24],mem[24],ymm0[25],mem[25],ymm0[26],mem[26],ymm0[27],mem[27],ymm0[28],mem[28],ymm0[29],mem[29],ymm0[30],mem[30],ymm0[31],mem[31]
 sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} ymm0 = ymm0[8],mem[8],ymm0[9],mem[9],ymm0[10],mem[10],ymm0[11],mem[11],ymm0[12],mem[12],ymm0[13],mem[13],ymm0[14],mem[14],ymm0[15],mem[15],ymm0[24],mem[24],ymm0[25],mem[25],ymm0[26],mem[26],ymm0[27],mem[27],ymm0[28],mem[28],ymm0[29],mem[29],ymm0[30],mem[30],ymm0[31],mem[31]
 sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6745,10 +6745,10 @@ define <8 x i32> @test_punpckhdq(<8 x i3<br>
; BROADWELL-LABEL: test_punpckhdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} ymm0 = ymm0[2],ymm1[2],ymm0[3],ymm1[3],ymm0[6],ymm1[6],ymm0[7],ymm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} ymm0 = ymm0[2],mem[2],ymm0[3],mem[3],ymm0[6],mem[6],ymm0[7],mem[7] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} ymm0 = ymm0[2],mem[2],ymm0[3],mem[3],ymm0[6],mem[6],ymm0[7],mem[7] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpcmpeqd %ymm1, %ymm1, %ymm1 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vpsubd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6798,9 +6798,9 @@ define <4 x i64> @test_punpckhqdq(<4 x i<br>
; BROADWELL-LABEL: test_punpckhqdq:<br>
; BROADWELL:       # BB#0:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} ymm1 = ymm0[1],ymm1[1],ymm0[3],ymm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} ymm0 = ymm0[1],mem[1],ymm0[3],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} ymm0 = ymm0[1],mem[1],ymm0[3],mem[3] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<o:p></o:p></p>
<p class="MsoNormal">; SKYLAKE-LABEL: test_punpckhqdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6845,8 +6845,8 @@ define <16 x i16> @test_punpckhwd(<16 x<br>
; BROADWELL-LABEL: test_punpckhwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} ymm0 = ymm0[4],ymm1[4],ymm0[5],ymm1[5],ymm0[6],ymm1[6],ymm0[7],ymm1[7],ymm0[12],ymm1[12],ymm0[13],ymm1[13],ymm0[14],ymm1[14],ymm0[15],ymm1[15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} ymm0 = ymm0[4],mem[4],ymm0[5],mem[5],ymm0[6],mem[6],ymm0[7],mem[7],ymm0[12],mem[12],ymm0[13],mem[13],ymm0[14],mem[14],ymm0[15],mem[15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} ymm0 = ymm0[4],mem[4],ymm0[5],mem[5],ymm0[6],mem[6],ymm0[7],mem[7],ymm0[12],mem[12],ymm0[13],mem[13],ymm0[14],mem[14],ymm0[15],mem[15] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6887,8 +6887,8 @@ define <32 x i8> @test_punpcklbw(<32 x i<br>
; BROADWELL-LABEL: test_punpcklbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[2],ymm1[2],ymm0[3],ymm1[3],ymm0[4],ymm1[4],ymm0[5],ymm1[5],ymm0[6],ymm1[6],ymm0[7],ymm1[7],ymm0[16],ymm1[16],ymm0[17],ymm1[17],ymm0[18],ymm1[18],ymm0[19],ymm1[19],ymm0[20],ymm1[20],ymm0[21],ymm1[21],ymm0[22],ymm1[22],ymm0[23],ymm1[23]
 sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[2],mem[2],ymm0[3],mem[3],ymm0[4],mem[4],ymm0[5],mem[5],ymm0[6],mem[6],ymm0[7],mem[7],ymm0[16],mem[16],ymm0[17],mem[17],ymm0[18],mem[18],ymm0[19],mem[19],ymm0[20],mem[20],ymm0[21],mem[21],ymm0[22],mem[22],ymm0[23],mem[23]
 sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[2],mem[2],ymm0[3],mem[3],ymm0[4],mem[4],ymm0[5],mem[5],ymm0[6],mem[6],ymm0[7],mem[7],ymm0[16],mem[16],ymm0[17],mem[17],ymm0[18],mem[18],ymm0[19],mem[19],ymm0[20],mem[20],ymm0[21],mem[21],ymm0[22],mem[22],ymm0[23],mem[23]
 sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6933,10 +6933,10 @@ define <8 x i32> @test_punpckldq(<8 x i3<br>
; BROADWELL-LABEL: test_punpckldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckldq {{.*#+}} ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckldq {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[4],mem[4],ymm0[5],mem[5] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckldq {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[4],mem[4],ymm0[5],mem[5] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpcmpeqd %ymm1, %ymm1, %ymm1 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vpsubd %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6986,9 +6986,9 @@ define <4 x i64> @test_punpcklqdq(<4 x i<br>
; BROADWELL-LABEL: test_punpcklqdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} ymm1 = ymm0[0],ymm1[0],ymm0[2],ymm1[2] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[2],mem[2] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[2],mem[2] sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpaddq %ymm0, %ymm1, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklqdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7033,8 +7033,8 @@ define <16 x i16> @test_punpcklwd(<16 x<br>
; BROADWELL-LABEL: test_punpcklwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[2],ymm1[2],ymm0[3],ymm1[3],ymm0[8],ymm1[8],ymm0[9],ymm1[9],ymm0[10],ymm1[10],ymm0[11],ymm1[11] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[2],mem[2],ymm0[3],mem[3],ymm0[8],mem[8],ymm0[9],mem[9],ymm0[10],mem[10],ymm0[11],mem[11] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} ymm0 = ymm0[0],mem[0],ymm0[1],mem[1],ymm0[2],mem[2],ymm0[3],mem[3],ymm0[8],mem[8],ymm0[9],mem[9],ymm0[10],mem[10],ymm0[11],mem[11] sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7077,9 +7077,9 @@ define <4 x i64> @test_pxor(<4 x i64> %a<br>
; BROADWELL-LABEL: test_pxor:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpxor %ymm1, %ymm0, %ymm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpxor (%rdi), %ymm0, %ymm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpxor (%rdi), %ymm0, %ymm0 # sched: [7:0.50]<br>
; BROADWELL-NEXT:    vpaddq %ymm1, %ymm0, %ymm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pxor:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/bmi-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bmi-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bmi-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/bmi-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/bmi-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -30,10 +30,10 @@ define i16 @test_andn_i16(i16 zeroext %a<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    andnl %esi, %edi, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    notl %edi # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    andw (%rdx), %di # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    andw (%rdx), %di # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %edi, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    # kill: %AX<def> %AX<kill> %EAX<kill><br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andn_i16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -87,9 +87,9 @@ define i32 @test_andn_i32(i32 %a0, i32 %<br>
; BROADWELL-LABEL: test_andn_i32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    andnl %esi, %edi, %ecx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    andnl (%rdx), %edi, %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    andnl (%rdx), %edi, %eax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andn_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -137,9 +137,9 @@ define i64 @test_andn_i64(i64 %a0, i64 %<br>
; BROADWELL-LABEL: test_andn_i64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    andnq %rsi, %rdi, %rcx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    andnq (%rdx), %rdi, %rax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    andnq (%rdx), %rdi, %rax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andn_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -186,10 +186,10 @@ define i32 @test_bextr_i32(i32 %a0, i32<br>
;<br>
; BROADWELL-LABEL: test_bextr_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    bextrl %edi, (%rdx), %ecx # sched: [2:0.50]<br>
+; BROADWELL-NEXT:    bextrl %edi, (%rdx), %ecx # sched: [7:0.50]<br>
; BROADWELL-NEXT:    bextrl %edi, %esi, %eax # sched: [2:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_bextr_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -236,10 +236,10 @@ define i64 @test_bextr_i64(i64 %a0, i64<br>
;<br>
; BROADWELL-LABEL: test_bextr_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    bextrq %rdi, (%rdx), %rcx # sched: [2:0.50]<br>
+; BROADWELL-NEXT:    bextrq %rdi, (%rdx), %rcx # sched: [7:0.50]<br>
; BROADWELL-NEXT:    bextrq %rdi, %rsi, %rax # sched: [2:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_bextr_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -286,10 +286,10 @@ define i32 @test_blsi_i32(i32 %a0, i32 *<br>
;<br>
; BROADWELL-LABEL: test_blsi_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsil (%rsi), %ecx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsil (%rsi), %ecx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsil %edi, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsi_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -337,10 +337,10 @@ define i64 @test_blsi_i64(i64 %a0, i64 *<br>
;<br>
; BROADWELL-LABEL: test_blsi_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsiq (%rsi), %rcx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsiq (%rsi), %rcx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsiq %rdi, %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsi_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -388,10 +388,10 @@ define i32 @test_blsmsk_i32(i32 %a0, i32<br>
;<br>
; BROADWELL-LABEL: test_blsmsk_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsmskl (%rsi), %ecx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsmskl (%rsi), %ecx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsmskl %edi, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsmsk_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -439,10 +439,10 @@ define i64 @test_blsmsk_i64(i64 %a0, i64<br>
;<br>
; BROADWELL-LABEL: test_blsmsk_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsmskq (%rsi), %rcx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsmskq (%rsi), %rcx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsmskq %rdi, %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsmsk_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -490,10 +490,10 @@ define i32 @test_blsr_i32(i32 %a0, i32 *<br>
;<br>
; BROADWELL-LABEL: test_blsr_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsrl (%rsi), %ecx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsrl (%rsi), %ecx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsrl %edi, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsr_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -541,10 +541,10 @@ define i64 @test_blsr_i64(i64 %a0, i64 *<br>
;<br>
; BROADWELL-LABEL: test_blsr_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    blsrq (%rsi), %rcx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    blsrq (%rsi), %rcx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    blsrq %rdi, %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blsr_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -594,11 +594,11 @@ define i16 @test_cttz_i16(i16 zeroext %a<br>
;<br>
; BROADWELL-LABEL: test_cttz_i16:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    tzcntw (%rsi), %cx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    tzcntw (%rsi), %cx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    tzcntw %di, %ax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    # kill: %AX<def> %AX<kill> %EAX<kill><br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cttz_i16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -648,10 +648,10 @@ define i32 @test_cttz_i32(i32 %a0, i32 *<br>
;<br>
; BROADWELL-LABEL: test_cttz_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    tzcntl (%rsi), %ecx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    tzcntl (%rsi), %ecx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    tzcntl %edi, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cttz_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -698,10 +698,10 @@ define i64 @test_cttz_i64(i64 %a0, i64 *<br>
;<br>
; BROADWELL-LABEL: test_cttz_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    tzcntq (%rsi), %rcx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    tzcntq (%rsi), %rcx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    tzcntq %rdi, %rax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cttz_i64:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/bmi2-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -23,10 +23,10 @@ define i32 @test_bzhi_i32(i32 %a0, i32 %<br>
;<br>
; BROADWELL-LABEL: test_bzhi_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    bzhil %edi, (%rdx), %ecx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    bzhil %edi, (%rdx), %ecx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    bzhil %edi, %esi, %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_bzhi_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -73,10 +73,10 @@ define i64 @test_bzhi_i64(i64 %a0, i64 %<br>
;<br>
; BROADWELL-LABEL: test_bzhi_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    bzhiq %rdi, (%rdx), %rcx # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    bzhiq %rdi, (%rdx), %rcx # sched: [6:0.50]<br>
; BROADWELL-NEXT:    bzhiq %rdi, %rsi, %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_bzhi_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -132,9 +132,9 @@ define i64 @test_mulx_i64(i64 %a0, i64 %<br>
; BROADWELL-NEXT:    movq %rdx, %rax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movq %rdi, %rdx # sched: [1:0.25]<br>
; BROADWELL-NEXT:    mulxq %rsi, %rsi, %rcx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    mulxq (%rax), %rdx, %rax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    mulxq (%rax), %rdx, %rax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    orq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulx_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -193,10 +193,10 @@ define i32 @test_pdep_i32(i32 %a0, i32 %<br>
;<br>
; BROADWELL-LABEL: test_pdep_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pdepl (%rdx), %edi, %ecx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    pdepl (%rdx), %edi, %ecx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    pdepl %esi, %edi, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pdep_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -243,10 +243,10 @@ define i64 @test_pdep_i64(i64 %a0, i64 %<br>
;<br>
; BROADWELL-LABEL: test_pdep_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pdepq (%rdx), %rdi, %rcx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    pdepq (%rdx), %rdi, %rcx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    pdepq %rsi, %rdi, %rax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pdep_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -293,10 +293,10 @@ define i32 @test_pext_i32(i32 %a0, i32 %<br>
;<br>
; BROADWELL-LABEL: test_pext_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pextl (%rdx), %edi, %ecx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    pextl (%rdx), %edi, %ecx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    pextl %esi, %edi, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pext_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -343,10 +343,10 @@ define i64 @test_pext_i64(i64 %a0, i64 %<br>
;<br>
; BROADWELL-LABEL: test_pext_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pextq (%rdx), %rdi, %rcx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    pextq (%rdx), %rdi, %rcx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    pextq %rsi, %rdi, %rax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pext_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -394,9 +394,9 @@ define i32 @test_rorx_i32(i32 %a0, i32 %<br>
; BROADWELL-LABEL: test_rorx_i32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    rorxl $5, %edi, %ecx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    rorxl $5, (%rdx), %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    rorxl $5, (%rdx), %eax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rorx_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -447,9 +447,9 @@ define i64 @test_rorx_i64(i64 %a0, i64 %<br>
; BROADWELL-LABEL: test_rorx_i64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    rorxq $5, %rdi, %rcx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    rorxq $5, (%rdx), %rax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    rorxq $5, (%rdx), %rax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rorx_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -500,9 +500,9 @@ define i32 @test_sarx_i32(i32 %a0, i32 %<br>
; BROADWELL-LABEL: test_sarx_i32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    sarxl %esi, %edi, %ecx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    sarxl %esi, (%rdx), %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    sarxl %esi, (%rdx), %eax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sarx_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -549,9 +549,9 @@ define i64 @test_sarx_i64(i64 %a0, i64 %<br>
; BROADWELL-LABEL: test_sarx_i64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    sarxq %rsi, %rdi, %rcx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    sarxq %rsi, (%rdx), %rax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    sarxq %rsi, (%rdx), %rax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sarx_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -598,9 +598,9 @@ define i32 @test_shlx_i32(i32 %a0, i32 %<br>
; BROADWELL-LABEL: test_shlx_i32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    shlxl %esi, %edi, %ecx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    shlxl %esi, (%rdx), %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    shlxl %esi, (%rdx), %eax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shlx_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -647,9 +647,9 @@ define i64 @test_shlx_i64(i64 %a0, i64 %<br>
; BROADWELL-LABEL: test_shlx_i64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    shlxq %rsi, %rdi, %rcx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    shlxq %rsi, (%rdx), %rax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    shlxq %rsi, (%rdx), %rax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shlx_i64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -696,9 +696,9 @@ define i32 @test_shrx_i32(i32 %a0, i32 %<br>
; BROADWELL-LABEL: test_shrx_i32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    shrxl %esi, %edi, %ecx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    shrxl %esi, (%rdx), %eax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    shrxl %esi, (%rdx), %eax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shrx_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -745,9 +745,9 @@ define i64 @test_shrx_i64(i64 %a0, i64 %<br>
; BROADWELL-LABEL: test_shrx_i64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    shrxq %rsi, %rdi, %rcx # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    shrxq %rsi, (%rdx), %rax # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    shrxq %rsi, (%rdx), %rax # sched: [6:0.50]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shrx_i64:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/f16c-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/f16c-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/f16c-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/f16c-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/f16c-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -31,10 +31,10 @@ define <4 x float> @test_vcvtph2ps_128(<<br>
;<br>
; BROADWELL-LABEL: test_vcvtph2ps_128:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vcvtph2ps (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vcvtph2ps (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vcvtph2ps %xmm0, %xmm0 # sched: [2:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vcvtph2ps_128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -88,10 +88,10 @@ define <8 x float> @test_vcvtph2ps_256(<<br>
;<br>
; BROADWELL-LABEL: test_vcvtph2ps_256:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vcvtph2ps (%rdi), %ymm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vcvtph2ps (%rdi), %ymm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vcvtph2ps %xmm0, %ymm0 # sched: [2:1.00]<br>
; BROADWELL-NEXT:    vaddps %ymm0, %ymm1, %ymm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vcvtph2ps_256:<br>
; SKYLAKE:       # BB#0:<br>
@@ -144,7 +144,7 @@ define <8 x i16> @test_vcvtps2ph_128(<4<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtps2ph $0, %xmm0, %xmm0 # sched: [4:1.00]<br>
; BROADWELL-NEXT:    vcvtps2ph $0, %xmm1, (%rdi) # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vcvtps2ph_128:<br>
; SKYLAKE:       # BB#0:<br>
@@ -196,9 +196,9 @@ define <8 x i16> @test_vcvtps2ph_256(<8<br>
; BROADWELL-LABEL: test_vcvtps2ph_256:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtps2ph $0, %ymm0, %xmm0 # sched: [6:1.00]<br>
-; BROADWELL-NEXT:    vcvtps2ph $0, %ymm1, (%rdi) # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    vcvtps2ph $0, %ymm1, (%rdi) # sched: [4:1.00]<br>
; BROADWELL-NEXT:    vzeroupper # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vcvtps2ph_256:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/fma-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/fma-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/fma-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/fma-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/fma-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -31,8 +31,8 @@ define <2 x double> @test_vfmadd213pd(<2<br>
; BROADWELL-LABEL: test_vfmadd213pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -79,8 +79,8 @@ define <4 x double> @test_vfmadd213pd_ym<br>
; BROADWELL-LABEL: test_vfmadd213pd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213pd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -127,8 +127,8 @@ define <4 x float> @test_vfmadd213ps(<4<br>
; BROADWELL-LABEL: test_vfmadd213ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -175,8 +175,8 @@ define <8 x float> @test_vfmadd213ps_ymm<br>
; BROADWELL-LABEL: test_vfmadd213ps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213ps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -223,8 +223,8 @@ define <2 x double> @test_vfmadd213sd(<2<br>
; BROADWELL-LABEL: test_vfmadd213sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213sd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213sd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213sd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -271,8 +271,8 @@ define <4 x float> @test_vfmadd213ss(<4<br>
; BROADWELL-LABEL: test_vfmadd213ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmadd213ss %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmadd213ss (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmadd213ss (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmadd213ss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -331,8 +331,8 @@ define <2 x double> @test_vfmaddsubpd(<2<br>
; BROADWELL-LABEL: test_vfmaddsubpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmaddsub213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmaddsub213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmaddsub213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmaddsubpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -379,8 +379,8 @@ define <4 x double> @test_vfmaddsubpd_ym<br>
; BROADWELL-LABEL: test_vfmaddsubpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmaddsub213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmaddsub213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmaddsub213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmaddsubpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -427,8 +427,8 @@ define <4 x float> @test_vfmaddsubps(<4<br>
; BROADWELL-LABEL: test_vfmaddsubps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmaddsub213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmaddsub213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmaddsub213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmaddsubps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -475,8 +475,8 @@ define <8 x float> @test_vfmaddsubps_ymm<br>
; BROADWELL-LABEL: test_vfmaddsubps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmaddsub213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmaddsub213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmaddsub213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmaddsubps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -535,8 +535,8 @@ define <2 x double> @test_vfmsubaddpd(<2<br>
; BROADWELL-LABEL: test_vfmsubaddpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsubadd213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsubadd213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsubadd213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsubaddpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -583,8 +583,8 @@ define <4 x double> @test_vfmsubaddpd_ym<br>
; BROADWELL-LABEL: test_vfmsubaddpd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsubadd213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsubadd213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsubadd213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsubaddpd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -631,8 +631,8 @@ define <4 x float> @test_vfmsubaddps(<4<br>
; BROADWELL-LABEL: test_vfmsubaddps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsubadd213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsubadd213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsubadd213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsubaddps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -679,8 +679,8 @@ define <8 x float> @test_vfmsubaddps_ymm<br>
; BROADWELL-LABEL: test_vfmsubaddps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsubadd213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsubadd213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsubadd213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsubaddps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -739,8 +739,8 @@ define <2 x double> @test_vfmsub213pd(<2<br>
; BROADWELL-LABEL: test_vfmsub213pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -787,8 +787,8 @@ define <4 x double> @test_vfmsub213pd_ym<br>
; BROADWELL-LABEL: test_vfmsub213pd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213pd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -835,8 +835,8 @@ define <4 x float> @test_vfmsub213ps(<4<br>
; BROADWELL-LABEL: test_vfmsub213ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -883,8 +883,8 @@ define <8 x float> @test_vfmsub213ps_ymm<br>
; BROADWELL-LABEL: test_vfmsub213ps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213ps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -931,8 +931,8 @@ define <2 x double> @test_vfmsub213sd(<2<br>
; BROADWELL-LABEL: test_vfmsub213sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213sd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213sd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213sd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -979,8 +979,8 @@ define <4 x float> @test_vfmsub213ss(<4<br>
; BROADWELL-LABEL: test_vfmsub213ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfmsub213ss %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfmsub213ss (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfmsub213ss (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfmsub213ss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1039,8 +1039,8 @@ define <2 x double> @test_vfnmadd213pd(<<br>
; BROADWELL-LABEL: test_vfnmadd213pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1087,8 +1087,8 @@ define <4 x double> @test_vfnmadd213pd_y<br>
; BROADWELL-LABEL: test_vfnmadd213pd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213pd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1135,8 +1135,8 @@ define <4 x float> @test_vfnmadd213ps(<4<br>
; BROADWELL-LABEL: test_vfnmadd213ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1183,8 +1183,8 @@ define <8 x float> @test_vfnmadd213ps_ym<br>
; BROADWELL-LABEL: test_vfnmadd213ps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213ps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1231,8 +1231,8 @@ define <2 x double> @test_vfnmadd213sd(<<br>
; BROADWELL-LABEL: test_vfnmadd213sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213sd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213sd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213sd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1279,8 +1279,8 @@ define <4 x float> @test_vfnmadd213ss(<4<br>
; BROADWELL-LABEL: test_vfnmadd213ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmadd213ss %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmadd213ss (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmadd213ss (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmadd213ss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1339,8 +1339,8 @@ define <2 x double> @test_vfnmsub213pd(<<br>
; BROADWELL-LABEL: test_vfnmsub213pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213pd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213pd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213pd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1387,8 +1387,8 @@ define <4 x double> @test_vfnmsub213pd_y<br>
; BROADWELL-LABEL: test_vfnmsub213pd_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213pd %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213pd (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213pd (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213pd_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1435,8 +1435,8 @@ define <4 x float> @test_vfnmsub213ps(<4<br>
; BROADWELL-LABEL: test_vfnmsub213ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213ps %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213ps (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213ps (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1483,8 +1483,8 @@ define <8 x float> @test_vfnmsub213ps_ym<br>
; BROADWELL-LABEL: test_vfnmsub213ps_ymm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213ps %ymm2, %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213ps (%rdi), %ymm1, %ymm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213ps (%rdi), %ymm1, %ymm0 # sched: [11:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213ps_ymm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1531,8 +1531,8 @@ define <2 x double> @test_vfnmsub213sd(<<br>
; BROADWELL-LABEL: test_vfnmsub213sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213sd %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213sd (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213sd (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1579,8 +1579,8 @@ define <4 x float> @test_vfnmsub213ss(<4<br>
; BROADWELL-LABEL: test_vfnmsub213ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vfnmsub213ss %xmm2, %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vfnmsub213ss (%rdi), %xmm1, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vfnmsub213ss (%rdi), %xmm1, %xmm0 # sched: [10:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_vfnmsub213ss:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/lea32-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lea32-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lea32-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/lea32-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/lea32-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -52,7 +52,7 @@ define i32 @test_lea_offset(i32) {<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal -24(%rdi), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -116,7 +116,7 @@ define i32 @test_lea_offset_big(i32) {<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal 1024(%rdi), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -187,7 +187,7 @@ define i32 @test_lea_add(i32, i32) {<br>
; BROADWELL-NEXT:    # kill: %ESI<def> %ESI<kill> %RSI<def><br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rsi), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add:<br>
; SKYLAKE:       # BB#0:<br>
@@ -264,7 +264,7 @@ define i32 @test_lea_add_offset(i32, i32<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rsi), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $16, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -347,7 +347,7 @@ define i32 @test_lea_add_offset_big(i32,<br>
; BROADWELL-NEXT:    leal (%rdi,%rsi), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $-4096, %eax # imm = 0xF000<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -417,7 +417,7 @@ define i32 @test_lea_mul(i32) {<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rdi,2), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul:<br>
; SKYLAKE:       # BB#0:<br>
@@ -485,7 +485,7 @@ define i32 @test_lea_mul_offset(i32) {<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rdi,2), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $-32, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -559,7 +559,7 @@ define i32 @test_lea_mul_offset_big(i32)<br>
; BROADWELL-NEXT:    leal (%rdi,%rdi,8), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $10000, %eax # imm = 0x2710<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -632,7 +632,7 @@ define i32 @test_lea_add_scale(i32, i32)<br>
; BROADWELL-NEXT:    # kill: %ESI<def> %ESI<kill> %RSI<def><br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rsi,2), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale:<br>
; SKYLAKE:       # BB#0:<br>
@@ -710,7 +710,7 @@ define i32 @test_lea_add_scale_offset(i3<br>
; BROADWELL-NEXT:    # kill: %EDI<def> %EDI<kill> %RDI<def><br>
; BROADWELL-NEXT:    leal (%rdi,%rsi,4), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $96, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -794,7 +794,7 @@ define i32 @test_lea_add_scale_offset_bi<br>
; BROADWELL-NEXT:    leal (%rdi,%rsi,8), %eax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addl $-1200, %eax # imm = 0xFB50<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/lea64-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lea64-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lea64-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/lea64-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/lea64-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -46,7 +46,7 @@ define i64 @test_lea_offset(i64) {<br>
; BROADWELL-LABEL: test_lea_offset:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq -24(%rdi), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -101,7 +101,7 @@ define i64 @test_lea_offset_big(i64) {<br>
; BROADWELL-LABEL: test_lea_offset_big:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq 1024(%rdi), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -157,7 +157,7 @@ define i64 @test_lea_add(i64, i64) {<br>
; BROADWELL-LABEL: test_lea_add:<br>
; BROADWELL:       # BB#0:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">; BROADWELL-NEXT:    leaq (%rdi,%rsi), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add:<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">; SKYLAKE:       # BB#0:<br>
@@ -216,7 +216,7 @@ define i64 @test_lea_add_offset(i64, i64<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq (%rdi,%rsi), %rax # sched: [1:0.50]<o:p></o:p></p>
<p class="MsoNormal">; BROADWELL-NEXT:    addq $16, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -281,7 +281,7 @@ define i64 @test_lea_add_offset_big(i64,<br>
; BROADWELL-NEXT:    leaq (%rdi,%rsi), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq $-4096, %rax # imm = 0xF000<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -339,7 +339,7 @@ define i64 @test_lea_mul(i64) {<br>
; BROADWELL-LABEL: test_lea_mul:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq (%rdi,%rdi,2), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul:<br>
; SKYLAKE:       # BB#0:<br>
@@ -398,7 +398,7 @@ define i64 @test_lea_mul_offset(i64) {<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq (%rdi,%rdi,2), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq $-32, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -463,7 +463,7 @@ define i64 @test_lea_mul_offset_big(i64)<br>
; BROADWELL-NEXT:    leaq (%rdi,%rdi,8), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq $10000, %rax # imm = 0x2710<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_mul_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
@@ -521,7 +521,7 @@ define i64 @test_lea_add_scale(i64, i64)<br>
; BROADWELL-LABEL: test_lea_add_scale:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq (%rdi,%rsi,2), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale:<br>
; SKYLAKE:       # BB#0:<br>
@@ -581,7 +581,7 @@ define i64 @test_lea_add_scale_offset(i6<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    leaq (%rdi,%rsi,4), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq $96, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale_offset:<br>
; SKYLAKE:       # BB#0:<br>
@@ -647,7 +647,7 @@ define i64 @test_lea_add_scale_offset_bi<br>
; BROADWELL-NEXT:    leaq (%rdi,%rsi,8), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    addq $-1200, %rax # imm = 0xFB50<br>
; BROADWELL-NEXT:    # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lea_add_scale_offset_big:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/lzcnt-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -26,11 +26,11 @@ define i16 @test_ctlz_i16(i16 zeroext %a<br>
;<br>
; BROADWELL-LABEL: test_ctlz_i16:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    lzcntw (%rsi), %cx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    lzcntw (%rsi), %cx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    lzcntw %di, %ax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    # kill: %AX<def> %AX<kill> %EAX<kill><br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctlz_i16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -80,10 +80,10 @@ define i32 @test_ctlz_i32(i32 %a0, i32 *<br>
;<br>
; BROADWELL-LABEL: test_ctlz_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    lzcntl (%rsi), %ecx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    lzcntl (%rsi), %ecx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    lzcntl %edi, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctlz_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -130,10 +130,10 @@ define i64 @test_ctlz_i64(i64 %a0, i64 *<br>
;<br>
; BROADWELL-LABEL: test_ctlz_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    lzcntq (%rsi), %rcx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    lzcntq (%rsi), %rcx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    lzcntq %rdi, %rax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctlz_i64:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/mmx-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/mmx-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/mmx-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/mmx-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/mmx-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -54,11 +54,11 @@ define i64 @test_cvtpd2pi(<2 x double> %<br>
;<br>
; BROADWELL-LABEL: test_cvtpd2pi:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    cvtpd2pi (%rdi), %mm0 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    cvtpd2pi (%rdi), %mm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    cvtpd2pi %xmm0, %mm1 # sched: [4:1.00]<br>
; BROADWELL-NEXT:    por %mm1, %mm0 # sched: [1:0.33]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpd2pi:<br>
; SKYLAKE:       # BB#0:<br>
@@ -139,9 +139,9 @@ define <2 x double> @test_cvtpi2pd(x86_m<br>
; BROADWELL-LABEL: test_cvtpi2pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    cvtpi2pd %mm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    cvtpi2pd (%rdi), %xmm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    cvtpi2pd (%rdi), %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpi2pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -217,9 +217,9 @@ define <4 x float> @test_cvtpi2ps(x86_mm<br>
; BROADWELL-LABEL: test_cvtpi2ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    cvtpi2ps %mm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    cvtpi2ps (%rdi), %xmm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    cvtpi2ps (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpi2ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -300,10 +300,10 @@ define i64 @test_cvtps2pi(<4 x float> %a<br>
; BROADWELL-LABEL: test_cvtps2pi:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    cvtps2pi %xmm0, %mm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    cvtps2pi (%rdi), %mm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    cvtps2pi (%rdi), %mm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    por %mm0, %mm1 # sched: [1:0.33]<br>
; BROADWELL-NEXT:    movd %mm1, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtps2pi:<br>
; SKYLAKE:       # BB#0:<br>
@@ -388,11 +388,11 @@ define i64 @test_cvttpd2pi(<2 x double><br>
;<br>
; BROADWELL-LABEL: test_cvttpd2pi:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    cvttpd2pi (%rdi), %mm0 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    cvttpd2pi (%rdi), %mm0 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    cvttpd2pi %xmm0, %mm1 # sched: [4:1.00]<br>
; BROADWELL-NEXT:    por %mm1, %mm0 # sched: [1:0.33]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttpd2pi:<br>
; SKYLAKE:       # BB#0:<br>
@@ -478,10 +478,10 @@ define i64 @test_cvttps2pi(<4 x float> %<br>
; BROADWELL-LABEL: test_cvttps2pi:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    cvttps2pi %xmm0, %mm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    cvttps2pi (%rdi), %mm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    cvttps2pi (%rdi), %mm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    por %mm0, %mm1 # sched: [1:0.33]<br>
; BROADWELL-NEXT:    movd %mm1, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttps2pi:<br>
; SKYLAKE:       # BB#0:<br>
@@ -552,7 +552,7 @@ define void @test_emms() optsize {<br>
; BROADWELL-LABEL: test_emms:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    emms # sched: [31:10.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_emms:<br>
; SKYLAKE:       # BB#0:<br>
@@ -607,7 +607,7 @@ define void @test_maskmovq(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_maskmovq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    maskmovq %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -708,15 +708,15 @@ define i32 @test_movd(x86_mmx %a0, i32 %<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovd %edi, %xmm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vmovq %xmm0, -{{[0-9]+}}(%rsp) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    movq -{{[0-9]+}}(%rsp), %mm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    movq -{{[0-9]+}}(%rsp), %mm1 # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vmovlps %xmm0, -{{[0-9]+}}(%rsp) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    paddd -{{[0-9]+}}(%rsp), %mm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddd -{{[0-9]+}}(%rsp), %mm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    paddd %mm1, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movd %mm1, %ecx # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %eax # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movl %ecx, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -829,7 +829,7 @@ define i64 @test_movdq2q(<2 x i64> %a0)<br>
; BROADWELL-NEXT:    movdq2q %xmm0, %mm0 # sched: [2:0.67]<br>
; BROADWELL-NEXT:    paddd %mm0, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movdq2q:<br>
; SKYLAKE:       # BB#0:<br>
@@ -894,7 +894,7 @@ define void @test_movntq(x86_mmx* %a0, x<br>
; BROADWELL-LABEL: test_movntq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    movntq %mm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -960,10 +960,10 @@ define void @test_movq(i64 *%a0) {<br>
;<br>
; BROADWELL-LABEL: test_movq:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    movq (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    movq (%rdi), %mm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    paddd %mm0, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movq %mm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1029,7 +1029,7 @@ define <2 x i64> @test_movq2dq(x86_mmx %<br>
; BROADWELL-LABEL: test_movq2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    movq2dq %mm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movq2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1093,10 +1093,10 @@ define i64 @test_pabsb(x86_mmx *%a0) opt<br>
;<br>
; BROADWELL-LABEL: test_pabsb:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pabsb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pabsb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    pabsb %mm0, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1171,10 +1171,10 @@ define i64 @test_pabsd(x86_mmx *%a0) opt<br>
;<br>
; BROADWELL-LABEL: test_pabsd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pabsd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pabsd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    pabsd %mm0, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1249,10 +1249,10 @@ define i64 @test_pabsw(x86_mmx *%a0) opt<br>
;<br>
; BROADWELL-LABEL: test_pabsw:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pabsw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pabsw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    pabsw %mm0, %mm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1328,9 +1328,9 @@ define i64 @test_packssdw(x86_mmx %a0, x<br>
; BROADWELL-LABEL: test_packssdw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    packssdw %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    packssdw (%rdi), %mm0 # sched: [2:2.00]<br>
+; BROADWELL-NEXT:    packssdw (%rdi), %mm0 # sched: [7:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packssdw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1406,9 +1406,9 @@ define i64 @test_packsswb(x86_mmx %a0, x<br>
; BROADWELL-LABEL: test_packsswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    packsswb %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    packsswb (%rdi), %mm0 # sched: [2:2.00]<br>
+; BROADWELL-NEXT:    packsswb (%rdi), %mm0 # sched: [7:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packsswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1484,9 +1484,9 @@ define i64 @test_packuswb(x86_mmx %a0, x<br>
; BROADWELL-LABEL: test_packuswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    packuswb %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    packuswb (%rdi), %mm0 # sched: [2:2.00]<br>
+; BROADWELL-NEXT:    packuswb (%rdi), %mm0 # sched: [7:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packuswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1562,9 +1562,9 @@ define i64 @test_paddb(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_paddb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1640,9 +1640,9 @@ define i64 @test_paddd(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_paddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddd %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1718,9 +1718,9 @@ define i64 @test_paddq(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_paddq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddq %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddq (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddq (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1796,9 +1796,9 @@ define i64 @test_paddsb(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_paddsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddsb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddsb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddsb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1874,9 +1874,9 @@ define i64 @test_paddsw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_paddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddsw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddsw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddsw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1952,9 +1952,9 @@ define i64 @test_paddusb(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_paddusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddusb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddusb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddusb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2030,9 +2030,9 @@ define i64 @test_paddusw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_paddusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddusw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddusw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddusw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2108,9 +2108,9 @@ define i64 @test_paddw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_paddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    paddw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    paddw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    paddw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2186,9 +2186,9 @@ define i64 @test_palignr(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_palignr:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    palignr $1, %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    palignr $1, (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    palignr $1, (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_palignr:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2264,9 +2264,9 @@ define i64 @test_pand(x86_mmx %a0, x86_m<br>
; BROADWELL-LABEL: test_pand:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pand %mm1, %mm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    pand (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pand (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pand:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2342,9 +2342,9 @@ define i64 @test_pandn(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_pandn:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pandn %mm1, %mm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    pandn (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pandn (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pandn:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2420,9 +2420,9 @@ define i64 @test_pavgb(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_pavgb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pavgb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pavgb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pavgb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2498,9 +2498,9 @@ define i64 @test_pavgw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_pavgw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pavgw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pavgw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pavgw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2576,9 +2576,9 @@ define i64 @test_pcmpeqb(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpeqb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpeqb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpeqb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpeqb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2654,9 +2654,9 @@ define i64 @test_pcmpeqd(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpeqd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpeqd %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpeqd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpeqd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2732,9 +2732,9 @@ define i64 @test_pcmpeqw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpeqw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpeqw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpeqw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpeqw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2810,9 +2810,9 @@ define i64 @test_pcmpgtb(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpgtb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpgtb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpgtb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpgtb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2888,9 +2888,9 @@ define i64 @test_pcmpgtd(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpgtd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpgtd %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpgtd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpgtd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2966,9 +2966,9 @@ define i64 @test_pcmpgtw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pcmpgtw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pcmpgtw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pcmpgtw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pcmpgtw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3034,7 +3034,7 @@ define i32 @test_pextrw(x86_mmx %a0) opt<br>
; BROADWELL-LABEL: test_pextrw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pextrw $0, %mm0, %eax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3099,9 +3099,9 @@ define i64 @test_phaddd(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_phaddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phaddd %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phaddd (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phaddd (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3177,9 +3177,9 @@ define i64 @test_phaddsw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_phaddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phaddsw %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phaddsw (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phaddsw (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3255,9 +3255,9 @@ define i64 @test_phaddw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_phaddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phaddw %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phaddw (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phaddw (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3333,9 +3333,9 @@ define i64 @test_phsubd(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_phsubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phsubd %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phsubd (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phsubd (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3411,9 +3411,9 @@ define i64 @test_phsubsw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_phsubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phsubsw %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phsubsw (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phsubsw (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3489,9 +3489,9 @@ define i64 @test_phsubw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_phsubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    phsubw %mm1, %mm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    phsubw (%rdi), %mm0 # sched: [3:2.00]<br>
+; BROADWELL-NEXT:    phsubw (%rdi), %mm0 # sched: [8:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3572,10 +3572,10 @@ define i64 @test_pinsrw(x86_mmx %a0, i32<br>
; BROADWELL-LABEL: test_pinsrw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pinsrw $0, %edi, %mm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    movswl (%rsi), %eax # sched: [4:0.50]<br>
+; BROADWELL-NEXT:    movswl (%rsi), %eax # sched: [5:0.50]<br>
; BROADWELL-NEXT:    pinsrw $1, %eax, %mm0 # sched: [2:2.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pinsrw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3656,9 +3656,9 @@ define i64 @test_pmaddwd(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pmaddwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmaddwd %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmaddwd (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmaddwd (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3734,9 +3734,9 @@ define i64 @test_pmaddubsw(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_pmaddubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmaddubsw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmaddubsw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmaddubsw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3812,9 +3812,9 @@ define i64 @test_pmaxsw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pmaxsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmaxsw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pmaxsw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pmaxsw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3890,9 +3890,9 @@ define i64 @test_pmaxub(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pmaxub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmaxub %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pmaxub (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pmaxub (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3968,9 +3968,9 @@ define i64 @test_pminsw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pminsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pminsw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pminsw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pminsw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4046,9 +4046,9 @@ define i64 @test_pminub(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pminub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pminub %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    pminub (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pminub (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4113,8 +4113,8 @@ define i32 @test_pmovmskb(x86_mmx %a0) o<br>
;<br>
; BROADWELL-LABEL: test_pmovmskb:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pmovmskb %mm0, %eax # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    pmovmskb %mm0, %eax # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovmskb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4179,9 +4179,9 @@ define i64 @test_pmulhrsw(x86_mmx %a0, x<br>
; BROADWELL-LABEL: test_pmulhrsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmulhrsw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmulhrsw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmulhrsw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhrsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4257,9 +4257,9 @@ define i64 @test_pmulhw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pmulhw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmulhw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmulhw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmulhw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4335,9 +4335,9 @@ define i64 @test_pmulhuw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pmulhuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmulhuw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmulhuw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmulhuw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4413,9 +4413,9 @@ define i64 @test_pmullw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pmullw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmullw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmullw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmullw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmullw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4491,9 +4491,9 @@ define i64 @test_pmuludq(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_pmuludq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pmuludq %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    pmuludq (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    pmuludq (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmuludq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4569,9 +4569,9 @@ define i64 @test_por(x86_mmx %a0, x86_mm<br>
; BROADWELL-LABEL: test_por:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    por %mm1, %mm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    por (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    por (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_por:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4647,9 +4647,9 @@ define i64 @test_psadbw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psadbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psadbw %mm1, %mm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    psadbw (%rdi), %mm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    psadbw (%rdi), %mm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psadbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4725,9 +4725,9 @@ define i64 @test_pshufb(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_pshufb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pshufb %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    pshufb (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    pshufb (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4802,10 +4802,10 @@ define i64 @test_pshufw(x86_mmx *%a0) op<br>
;<br>
; BROADWELL-LABEL: test_pshufw:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    pshufw $0, (%rdi), %mm0 # mm0 = mem[0,0,0,0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    pshufw $0, (%rdi), %mm0 # mm0 = mem[0,0,0,0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    pshufw $0, %mm0, %mm0 # mm0 = mm0[0,0,0,0] sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4881,9 +4881,9 @@ define i64 @test_psignb(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psignb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psignb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psignb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psignb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4959,9 +4959,9 @@ define i64 @test_psignd(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psignd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psignd %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psignd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psignd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5037,9 +5037,9 @@ define i64 @test_psignw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psignw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psignw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psignw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psignw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5120,10 +5120,10 @@ define i64 @test_pslld(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_pslld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pslld %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    pslld (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    pslld (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    pslld $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pslld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5210,10 +5210,10 @@ define i64 @test_psllq(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psllq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psllq %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psllq (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psllq (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psllq $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5300,10 +5300,10 @@ define i64 @test_psllw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psllw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psllw %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psllw (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psllw (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psllw $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5390,10 +5390,10 @@ define i64 @test_psrad(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psrad:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psrad %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psrad (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psrad (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psrad $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrad:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5480,10 +5480,10 @@ define i64 @test_psraw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psraw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psraw %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psraw (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psraw (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psraw $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psraw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5570,10 +5570,10 @@ define i64 @test_psrld(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psrld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psrld %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psrld (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psrld (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psrld $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5660,10 +5660,10 @@ define i64 @test_psrlq(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psrlq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psrlq %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psrlq (%rdi), %mm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    psrlq (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psrlq $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlq:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">; SKYLAKE:       # BB#0:<br>
@@ -5750,10 +5750,10 @@ define i64 @test_psrlw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psrlw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psrlw %mm1, %mm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    psrlw (%rdi), %mm0 # sched: [1:1.00]<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">+; BROADWELL-NEXT:    psrlw (%rdi), %mm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    psrlw $7, %mm0 # sched: [1:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<o:p></o:p></p>
<p class="MsoNormal">;<br>
; SKYLAKE-LABEL: test_psrlw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5835,9 +5835,9 @@ define i64 @test_psubb(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psubb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5913,9 +5913,9 @@ define i64 @test_psubd(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubd %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubd (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubd (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5991,9 +5991,9 @@ define i64 @test_psubq(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psubq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubq %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubq (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubq (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6069,9 +6069,9 @@ define i64 @test_psubsb(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psubsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubsb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubsb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubsb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6147,9 +6147,9 @@ define i64 @test_psubsw(x86_mmx %a0, x86<br>
; BROADWELL-LABEL: test_psubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubsw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubsw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubsw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6225,9 +6225,9 @@ define i64 @test_psubusb(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_psubusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubusb %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubusb (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubusb (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6303,9 +6303,9 @@ define i64 @test_psubusw(x86_mmx %a0, x8<br>
; BROADWELL-LABEL: test_psubusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubusw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubusw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubusw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6381,9 +6381,9 @@ define i64 @test_psubw(x86_mmx %a0, x86_<br>
; BROADWELL-LABEL: test_psubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    psubw %mm1, %mm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    psubw (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    psubw (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6459,9 +6459,9 @@ define i64 @test_punpckhbw(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpckhbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpckhbw %mm1, %mm0 # mm0 = mm0[4],mm1[4],mm0[5],mm1[5],mm0[6],mm1[6],mm0[7],mm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpckhbw (%rdi), %mm0 # mm0 = mm0[4],mem[4],mm0[5],mem[5],mm0[6],mem[6],mm0[7],mem[7] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpckhbw (%rdi), %mm0 # mm0 = mm0[4],mem[4],mm0[5],mem[5],mm0[6],mem[6],mm0[7],mem[7] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6537,9 +6537,9 @@ define i64 @test_punpckhdq(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpckhdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpckhdq %mm1, %mm0 # mm0 = mm0[1],mm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpckhdq (%rdi), %mm0 # mm0 = mm0[1],mem[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpckhdq (%rdi), %mm0 # mm0 = mm0[1],mem[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6615,9 +6615,9 @@ define i64 @test_punpckhwd(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpckhwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpckhwd %mm1, %mm0 # mm0 = mm0[2],mm1[2],mm0[3],mm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpckhwd (%rdi), %mm0 # mm0 = mm0[2],mem[2],mm0[3],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpckhwd (%rdi), %mm0 # mm0 = mm0[2],mem[2],mm0[3],mem[3] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6693,9 +6693,9 @@ define i64 @test_punpcklbw(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpcklbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpcklbw %mm1, %mm0 # mm0 = mm0[0],mm1[0],mm0[1],mm1[1],mm0[2],mm1[2],mm0[3],mm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpcklbw (%rdi), %mm0 # mm0 = mm0[0],mem[0],mm0[1],mem[1],mm0[2],mem[2],mm0[3],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpcklbw (%rdi), %mm0 # mm0 = mm0[0],mem[0],mm0[1],mem[1],mm0[2],mem[2],mm0[3],mem[3] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6771,9 +6771,9 @@ define i64 @test_punpckldq(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpckldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpckldq %mm1, %mm0 # mm0 = mm0[0],mm1[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpckldq (%rdi), %mm0 # mm0 = mm0[0],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpckldq (%rdi), %mm0 # mm0 = mm0[0],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6849,9 +6849,9 @@ define i64 @test_punpcklwd(x86_mmx %a0,<br>
; BROADWELL-LABEL: test_punpcklwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    punpcklwd %mm1, %mm0 # mm0 = mm0[0],mm1[0],mm0[1],mm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    punpcklwd (%rdi), %mm0 # mm0 = mm0[0],mem[0],mm0[1],mem[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    punpcklwd (%rdi), %mm0 # mm0 = mm0[0],mem[0],mm0[1],mem[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6927,9 +6927,9 @@ define i64 @test_pxor(x86_mmx %a0, x86_m<br>
; BROADWELL-LABEL: test_pxor:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    pxor %mm1, %mm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    pxor (%rdi), %mm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    pxor (%rdi), %mm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    movd %mm0, %rax # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pxor:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/movbe-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/movbe-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/movbe-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/movbe-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/movbe-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -40,9 +40,9 @@ define i16 @test_movbe_i16(i16 *%a0, i16<br>
;<br>
; BROADWELL-LABEL: test_movbe_i16:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    movbew (%rdi), %ax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    movbew %si, (%rdx) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    movbew (%rdi), %ax # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    movbew %si, (%rdx) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movbe_i16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -100,9 +100,9 @@ define i32 @test_movbe_i32(i32 *%a0, i32<br>
;<br>
; BROADWELL-LABEL: test_movbe_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    movbel (%rdi), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    movbel %esi, (%rdx) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    movbel (%rdi), %eax # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    movbel %esi, (%rdx) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movbe_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -160,9 +160,9 @@ define i64 @test_movbe_i64(i64 *%a0, i64<br>
;<br>
; BROADWELL-LABEL: test_movbe_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    movbeq (%rdi), %rax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    movbeq %rsi, (%rdx) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    movbeq (%rdi), %rax # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    movbeq %rsi, (%rdx) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movbe_i64:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/popcnt-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -46,11 +46,11 @@ define i16 @test_ctpop_i16(i16 zeroext %<br>
;<br>
; BROADWELL-LABEL: test_ctpop_i16:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    popcntw (%rsi), %cx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    popcntw (%rsi), %cx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    popcntw %di, %ax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    # kill: %AX<def> %AX<kill> %EAX<kill><br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctpop_i16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -114,10 +114,10 @@ define i32 @test_ctpop_i32(i32 %a0, i32<br>
;<br>
; BROADWELL-LABEL: test_ctpop_i32:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    popcntl (%rsi), %ecx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    popcntl (%rsi), %ecx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    popcntl %edi, %eax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctpop_i32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -178,10 +178,10 @@ define i64 @test_ctpop_i64(i64 %a0, i64<br>
;<br>
; BROADWELL-LABEL: test_ctpop_i64:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    popcntq (%rsi), %rcx # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    popcntq (%rsi), %rcx # sched: [8:1.00]<br>
; BROADWELL-NEXT:    popcntq %rdi, %rax # sched: [3:1.00]<br>
; BROADWELL-NEXT:    orq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ctpop_i64:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/sse-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/sse-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/sse-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -45,8 +45,8 @@ define <4 x float> @test_addps(<4 x floa<br>
; BROADWELL-LABEL: test_addps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -111,8 +111,8 @@ define float @test_addss(float %a0, floa<br>
; BROADWELL-LABEL: test_addss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddss (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddss (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -181,8 +181,8 @@ define <4 x float> @test_andps(<4 x floa<br>
; BROADWELL-LABEL: test_andps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandps %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandps (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vandps (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -255,8 +255,8 @@ define <4 x float> @test_andnotps(<4 x f<br>
; BROADWELL-LABEL: test_andnotps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandnps %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandnps (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vandnps (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andnotps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -332,9 +332,9 @@ define <4 x float> @test_cmpps(<4 x floa<br>
; BROADWELL-LABEL: test_cmpps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqps %xmm1, %xmm0, %xmm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vorps %xmm0, %xmm1, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -407,8 +407,8 @@ define float @test_cmpss(float %a0, floa<br>
; BROADWELL-LABEL: test_cmpss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqss (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqss (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmpss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -521,13 +521,13 @@ define i32 @test_comiss(<4 x float> %a0,<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %cl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %cl # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vcomiss (%rdi), %xmm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcomiss (%rdi), %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %dl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    orb %cl, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movzbl %dl, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_comiss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -631,9 +631,9 @@ define float @test_cvtsi2ss(i32 %a0, i32<br>
; BROADWELL-LABEL: test_cvtsi2ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsi2ssl %edi, %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtsi2ssl (%rsi), %xmm1, %xmm1 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    vcvtsi2ssl (%rsi), %xmm1, %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsi2ss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -708,9 +708,9 @@ define float @test_cvtsi2ssq(i64 %a0, i6<br>
; BROADWELL-LABEL: test_cvtsi2ssq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsi2ssq %rdi, %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vcvtsi2ssq (%rsi), %xmm1, %xmm1 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    vcvtsi2ssq (%rsi), %xmm1, %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsi2ssq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -785,9 +785,9 @@ define i32 @test_cvtss2si(float %a0, flo<br>
; BROADWELL-LABEL: test_cvtss2si:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtss2si %xmm0, %ecx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtss2si (%rdi), %eax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvtss2si (%rdi), %eax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtss2si:<br>
; SKYLAKE:       # BB#0:<br>
@@ -865,9 +865,9 @@ define i64 @test_cvtss2siq(float %a0, fl<br>
; BROADWELL-LABEL: test_cvtss2siq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtss2si %xmm0, %rcx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtss2si (%rdi), %rax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvtss2si (%rdi), %rax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtss2siq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -945,9 +945,9 @@ define i32 @test_cvttss2si(float %a0, fl<br>
; BROADWELL-LABEL: test_cvttss2si:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttss2si %xmm0, %ecx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvttss2si (%rdi), %eax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvttss2si (%rdi), %eax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttss2si:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1022,9 +1022,9 @@ define i64 @test_cvttss2siq(float %a0, f<br>
; BROADWELL-LABEL: test_cvttss2siq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttss2si %xmm0, %rcx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvttss2si (%rdi), %rax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvttss2si (%rdi), %rax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttss2siq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1093,9 +1093,9 @@ define <4 x float> @test_divps(<4 x floa<br>
;<br>
; BROADWELL-LABEL: test_divps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivps %xmm1, %xmm0, %xmm0 # sched: [13:1.00]<br>
-; BROADWELL-NEXT:    vdivps (%rdi), %xmm0, %xmm0 # sched: [13:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivps %xmm1, %xmm0, %xmm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    vdivps (%rdi), %xmm0, %xmm0 # sched: [16:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1159,9 +1159,9 @@ define float @test_divss(float %a0, floa<br>
;<br>
; BROADWELL-LABEL: test_divss:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivss %xmm1, %xmm0, %xmm0 # sched: [13:1.00]<br>
-; BROADWELL-NEXT:    vdivss (%rdi), %xmm0, %xmm0 # sched: [13:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivss %xmm1, %xmm0, %xmm0 # sched: [11:1.00]<br>
+; BROADWELL-NEXT:    vdivss (%rdi), %xmm0, %xmm0 # sched: [16:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1226,8 +1226,8 @@ define void @test_ldmxcsr(i32 %a0) {<br>
; BROADWELL-LABEL: test_ldmxcsr:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    movl %edi, -{{[0-9]+}}(%rsp) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vldmxcsr -{{[0-9]+}}(%rsp) # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vldmxcsr -{{[0-9]+}}(%rsp) # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ldmxcsr:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1294,8 +1294,8 @@ define <4 x float> @test_maxps(<4 x floa<br>
; BROADWELL-LABEL: test_maxps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1361,8 +1361,8 @@ define <4 x float> @test_maxss(<4 x floa<br>
; BROADWELL-LABEL: test_maxss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxss (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxss (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1428,8 +1428,8 @@ define <4 x float> @test_minps(<4 x floa<br>
; BROADWELL-LABEL: test_minps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1495,8 +1495,8 @@ define <4 x float> @test_minss(<4 x floa<br>
; BROADWELL-LABEL: test_minss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminss (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminss (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1566,10 +1566,10 @@ define void @test_movaps(<4 x float> *%a<br>
;<br>
; BROADWELL-LABEL: test_movaps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovaps (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovaps (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovaps %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movaps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1641,7 +1641,7 @@ define <4 x float> @test_movhlps(<4 x fl<br>
; BROADWELL-LABEL: test_movhlps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpckhpd {{.*#+}} xmm0 = xmm1[1],xmm0[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movhlps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1708,10 +1708,10 @@ define void @test_movhps(<4 x float> %a0<br>
;<br>
; BROADWELL-LABEL: test_movhps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovhpd {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vmovhpd {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vpextrq $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpextrq $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movhps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1787,7 +1787,7 @@ define <4 x float> @test_movlhps(<4 x fl<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovlhps {{.*#+}} xmm0 = xmm0[0],xmm1[0] sched: [1:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movlhps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1855,10 +1855,10 @@ define void @test_movlps(<4 x float> %a0<br>
;<br>
; BROADWELL-LABEL: test_movlps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovlpd {{.*#+}} xmm1 = mem[0],xmm1[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vmovlpd {{.*#+}} xmm1 = mem[0],xmm1[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovlps %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movlps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1928,7 +1928,7 @@ define i32 @test_movmskps(<4 x float> %a<br>
; BROADWELL-LABEL: test_movmskps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovmskps %xmm0, %eax # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movmskps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1989,7 +1989,7 @@ define void @test_movntps(<4 x float> %a<br>
; BROADWELL-LABEL: test_movntps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovntps %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2052,10 +2052,10 @@ define void @test_movss_mem(float* %a0,<br>
;<br>
; BROADWELL-LABEL: test_movss_mem:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovss {{.*#+}} xmm0 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddss %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovss %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movss_mem:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2125,7 +2125,7 @@ define <4 x float> @test_movss_reg(<4 x<br>
; BROADWELL-LABEL: test_movss_reg:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendps {{.*#+}} xmm0 = xmm1[0],xmm0[1,2,3] sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movss_reg:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2188,10 +2188,10 @@ define void @test_movups(<4 x float> *%a<br>
;<br>
; BROADWELL-LABEL: test_movups:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovups (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovups (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovups %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movups:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2259,9 +2259,9 @@ define <4 x float> @test_mulps(<4 x floa<br>
;<br>
; BROADWELL-LABEL: test_mulps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulps %xmm1, %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulps (%rdi), %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulps %xmm1, %xmm0, %xmm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulps (%rdi), %xmm0, %xmm0 # sched: [8:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2325,9 +2325,9 @@ define float @test_mulss(float %a0, floa<br>
;<br>
; BROADWELL-LABEL: test_mulss:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulss %xmm1, %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulss (%rdi), %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulss %xmm1, %xmm0, %xmm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulss (%rdi), %xmm0, %xmm0 # sched: [8:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2396,8 +2396,8 @@ define <4 x float> @test_orps(<4 x float<br>
; BROADWELL-LABEL: test_orps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vorps %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vorps (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vorps (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_orps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2466,8 +2466,8 @@ define void @test_prefetchnta(i8* %a0) {<br>
;<br>
; BROADWELL-LABEL: test_prefetchnta:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    prefetchnta (%rdi) # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    prefetchnta (%rdi) # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_prefetchnta:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2534,9 +2534,9 @@ define <4 x float> @test_rcpps(<4 x floa<br>
; BROADWELL-LABEL: test_rcpps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vrcpps %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vrcpps (%rdi), %xmm1 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    vrcpps (%rdi), %xmm1 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rcpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2619,10 +2619,10 @@ define <4 x float> @test_rcpss(float %a0<br>
; BROADWELL-LABEL: test_rcpss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vrcpss %xmm0, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vrcpss %xmm1, %xmm1, %xmm1 # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rcpss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2706,9 +2706,9 @@ define <4 x float> @test_rsqrtps(<4 x fl<br>
; BROADWELL-LABEL: test_rsqrtps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vrsqrtps %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vrsqrtps (%rdi), %xmm1 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    vrsqrtps (%rdi), %xmm1 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rsqrtps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2791,10 +2791,10 @@ define <4 x float> @test_rsqrtss(float %<br>
; BROADWELL-LABEL: test_rsqrtss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vrsqrtss %xmm0, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vrsqrtss %xmm1, %xmm1, %xmm1 # sched: [5:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_rsqrtss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2871,8 +2871,8 @@ define void @test_sfence() {<br>
;<br>
; BROADWELL-LABEL: test_sfence:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    sfence # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    sfence # sched: [2:0.33]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sfence:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2936,8 +2936,8 @@ define <4 x float> @test_shufps(<4 x flo<br>
; BROADWELL-LABEL: test_shufps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vshufps {{.*#+}} xmm0 = xmm0[0,0],xmm1[0,0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vshufps {{.*#+}} xmm0 = xmm0[0,3],mem[0,0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vshufps {{.*#+}} xmm0 = xmm0[0,3],mem[0,0] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shufps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3008,9 +3008,9 @@ define <4 x float> @test_sqrtps(<4 x flo<br>
; BROADWELL-LABEL: test_sqrtps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsqrtps %xmm0, %xmm0 # sched: [14:1.00]<br>
-; BROADWELL-NEXT:    vsqrtps (%rdi), %xmm1 # sched: [14:1.00]<br>
+; BROADWELL-NEXT:    vsqrtps (%rdi), %xmm1 # sched: [19:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3093,10 +3093,10 @@ define <4 x float> @test_sqrtss(<4 x flo<br>
; BROADWELL-LABEL: test_sqrtss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsqrtss %xmm0, %xmm0, %xmm0 # sched: [14:1.00]<br>
-; BROADWELL-NEXT:    vmovaps (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovaps (%rdi), %xmm1 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vsqrtss %xmm1, %xmm1, %xmm1 # sched: [14:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3170,9 +3170,9 @@ define i32 @test_stmxcsr() {<br>
;<br>
; BROADWELL-LABEL: test_stmxcsr:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vstmxcsr -{{[0-9]+}}(%rsp) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    movl -{{[0-9]+}}(%rsp), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vstmxcsr -{{[0-9]+}}(%rsp) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    movl -{{[0-9]+}}(%rsp), %eax # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_stmxcsr:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3239,8 +3239,8 @@ define <4 x float> @test_subps(<4 x floa<br>
; BROADWELL-LABEL: test_subps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3305,8 +3305,8 @@ define float @test_subss(float %a0, floa<br>
; BROADWELL-LABEL: test_subss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubss (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubss (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3414,13 +3414,13 @@ define i32 @test_ucomiss(<4 x float> %a0<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %cl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %cl # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vucomiss (%rdi), %xmm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vucomiss (%rdi), %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %dl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    orb %cl, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movzbl %dl, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ucomiss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3523,8 +3523,8 @@ define <4 x float> @test_unpckhps(<4 x f<br>
; BROADWELL-LABEL: test_unpckhps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpckhps {{.*#+}} xmm0 = xmm0[2],xmm1[2],xmm0[3],xmm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpckhps {{.*#+}} xmm0 = xmm0[2],mem[2],xmm0[3],mem[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vunpckhps {{.*#+}} xmm0 = xmm0[2],mem[2],xmm0[3],mem[3] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpckhps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3593,8 +3593,8 @@ define <4 x float> @test_unpcklps(<4 x f<br>
; BROADWELL-LABEL: test_unpcklps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpcklps {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpcklps {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vunpcklps {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpcklps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3663,8 +3663,8 @@ define <4 x float> @test_xorps(<4 x floa<br>
; BROADWELL-LABEL: test_xorps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vxorps %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vxorps (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vxorps (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_xorps:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/sse2-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse2-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/sse2-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/sse2-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -45,8 +45,8 @@ define <2 x double> @test_addpd(<2 x dou<br>
; BROADWELL-LABEL: test_addpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -111,8 +111,8 @@ define double @test_addsd(double %a0, do<br>
; BROADWELL-LABEL: test_addsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddsd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddsd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -182,9 +182,9 @@ define <2 x double> @test_andpd(<2 x dou<br>
; BROADWELL-LABEL: test_andpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandpd %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandpd (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandpd (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -263,9 +263,9 @@ define <2 x double> @test_andnotpd(<2 x<br>
; BROADWELL-LABEL: test_andnotpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vandnpd %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vandnpd (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vandnpd (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_andnotpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -346,9 +346,9 @@ define <2 x double> @test_cmppd(<2 x dou<br>
; BROADWELL-LABEL: test_cmppd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqpd %xmm1, %xmm0, %xmm1 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vorpd %xmm0, %xmm1, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmppd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -421,8 +421,8 @@ define double @test_cmpsd(double %a0, do<br>
; BROADWELL-LABEL: test_cmpsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcmpeqsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcmpeqsd (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vcmpeqsd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cmpsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -535,13 +535,13 @@ define i32 @test_comisd(<2 x double> %a0<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %cl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %cl # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vcomisd (%rdi), %xmm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcomisd (%rdi), %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %dl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    orb %cl, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movzbl %dl, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_comisd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -645,9 +645,9 @@ define <2 x double> @test_cvtdq2pd(<4 x<br>
; BROADWELL-LABEL: test_cvtdq2pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtdq2pd %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtdq2pd (%rdi), %xmm1 # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvtdq2pd (%rdi), %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtdq2pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -725,9 +725,9 @@ define <4 x float> @test_cvtdq2ps(<4 x i<br>
; BROADWELL-LABEL: test_cvtdq2ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtdq2ps %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcvtdq2ps (%rdi), %xmm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcvtdq2ps (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtdq2ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -803,9 +803,9 @@ define <4 x i32> @test_cvtpd2dq(<2 x dou<br>
; BROADWELL-LABEL: test_cvtpd2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtpd2dq %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtpd2dqx (%rdi), %xmm1 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcvtpd2dqx (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpd2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -882,9 +882,9 @@ define <4 x float> @test_cvtpd2ps(<2 x d<br>
; BROADWELL-LABEL: test_cvtpd2ps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtpd2ps %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtpd2psx (%rdi), %xmm1 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcvtpd2psx (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtpd2ps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -961,9 +961,9 @@ define <4 x i32> @test_cvtps2dq(<4 x flo<br>
; BROADWELL-LABEL: test_cvtps2dq:<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtps2dq %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcvtps2dq (%rdi), %xmm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcvtps2dq (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<o:p></o:p></p>
<p class="MsoNormal">+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtps2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1040,9 +1040,9 @@ define <2 x double> @test_cvtps2pd(<4 x<br>
; BROADWELL-LABEL: test_cvtps2pd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtps2pd %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vcvtps2pd (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vcvtps2pd (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtps2pd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1119,9 +1119,9 @@ define i32 @test_cvtsd2si(double %a0, do<br>
; BROADWELL-LABEL: test_cvtsd2si:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsd2si %xmm0, %ecx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtsd2si (%rdi), %eax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvtsd2si (%rdi), %eax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsd2si:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1199,9 +1199,9 @@ define i64 @test_cvtsd2siq(double %a0, d<br>
; BROADWELL-LABEL: test_cvtsd2siq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsd2si %xmm0, %rcx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtsd2si (%rdi), %rax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvtsd2si (%rdi), %rax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsd2siq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1285,10 +1285,10 @@ define float @test_cvtsd2ss(double %a0,<br>
; BROADWELL-LABEL: test_cvtsd2ss:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsd2ss %xmm0, %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vmovsd {{.*#+}} xmm1 = mem[0],zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovsd {{.*#+}} xmm1 = mem[0],zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vcvtsd2ss %xmm1, %xmm1, %xmm1 # sched: [4:1.00]<br>
; BROADWELL-NEXT:    vaddss %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsd2ss:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1367,9 +1367,9 @@ define double @test_cvtsi2sd(i32 %a0, i3<br>
; BROADWELL-LABEL: test_cvtsi2sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsi2sdl %edi, %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtsi2sdl (%rsi), %xmm1, %xmm1 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    vcvtsi2sdl (%rsi), %xmm1, %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsi2sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1444,9 +1444,9 @@ define double @test_cvtsi2sdq(i64 %a0, i<br>
; BROADWELL-LABEL: test_cvtsi2sdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtsi2sdq %rdi, %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvtsi2sdq (%rsi), %xmm1, %xmm1 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    vcvtsi2sdq (%rsi), %xmm1, %xmm1 # sched: [9:1.00]<br>
; BROADWELL-NEXT:    vaddsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtsi2sdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1529,10 +1529,10 @@ define double @test_cvtss2sd(float %a0,<br>
; BROADWELL-LABEL: test_cvtss2sd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvtss2sd %xmm0, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovss {{.*#+}} xmm1 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vcvtss2sd %xmm1, %xmm1, %xmm1 # sched: [2:1.00]<br>
; BROADWELL-NEXT:    vaddsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvtss2sd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1612,9 +1612,9 @@ define <4 x i32> @test_cvttpd2dq(<2 x do<br>
; BROADWELL-LABEL: test_cvttpd2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttpd2dq %xmm0, %xmm0 # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvttpd2dqx (%rdi), %xmm1 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vcvttpd2dqx (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttpd2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1692,9 +1692,9 @@ define <4 x i32> @test_cvttps2dq(<4 x fl<br>
; BROADWELL-LABEL: test_cvttps2dq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttps2dq %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vcvttps2dq (%rdi), %xmm1 # sched: [3:1.00]<br>
+; BROADWELL-NEXT:    vcvttps2dq (%rdi), %xmm1 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttps2dq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1769,9 +1769,9 @@ define i32 @test_cvttsd2si(double %a0, d<br>
; BROADWELL-LABEL: test_cvttsd2si:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttsd2si %xmm0, %ecx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvttsd2si (%rdi), %eax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvttsd2si (%rdi), %eax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttsd2si:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1846,9 +1846,9 @@ define i64 @test_cvttsd2siq(double %a0,<br>
; BROADWELL-LABEL: test_cvttsd2siq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vcvttsd2si %xmm0, %rcx # sched: [4:1.00]<br>
-; BROADWELL-NEXT:    vcvttsd2si (%rdi), %rax # sched: [4:1.00]<br>
+; BROADWELL-NEXT:    vcvttsd2si (%rdi), %rax # sched: [9:1.00]<br>
; BROADWELL-NEXT:    addq %rcx, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_cvttsd2siq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1917,9 +1917,9 @@ define <2 x double> @test_divpd(<2 x dou<br>
;<br>
; BROADWELL-LABEL: test_divpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivpd %xmm1, %xmm0, %xmm0 # sched: [20:1.00]<br>
-; BROADWELL-NEXT:    vdivpd (%rdi), %xmm0, %xmm0 # sched: [20:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivpd %xmm1, %xmm0, %xmm0 # sched: [14:1.00]<br>
+; BROADWELL-NEXT:    vdivpd (%rdi), %xmm0, %xmm0 # sched: [19:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1983,9 +1983,9 @@ define double @test_divsd(double %a0, do<br>
;<br>
; BROADWELL-LABEL: test_divsd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vdivsd %xmm1, %xmm0, %xmm0 # sched: [20:1.00]<br>
-; BROADWELL-NEXT:    vdivsd (%rdi), %xmm0, %xmm0 # sched: [20:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdivsd %xmm1, %xmm0, %xmm0 # sched: [14:1.00]<br>
+; BROADWELL-NEXT:    vdivsd (%rdi), %xmm0, %xmm0 # sched: [19:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_divsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2051,7 +2051,7 @@ define void @test_lfence() {<br>
; BROADWELL-LABEL: test_lfence:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    lfence # sched: [2:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lfence:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2112,7 +2112,7 @@ define void @test_mfence() {<br>
; BROADWELL-LABEL: test_mfence:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    mfence # sched: [2:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mfence:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2171,7 +2171,7 @@ define void @test_maskmovdqu(<16 x i8> %<br>
; BROADWELL-LABEL: test_maskmovdqu:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaskmovdqu %xmm1, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maskmovdqu:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2231,8 +2231,8 @@ define <2 x double> @test_maxpd(<2 x dou<br>
; BROADWELL-LABEL: test_maxpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2298,8 +2298,8 @@ define <2 x double> @test_maxsd(<2 x dou<br>
; BROADWELL-LABEL: test_maxsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmaxsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vmaxsd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmaxsd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_maxsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2365,8 +2365,8 @@ define <2 x double> @test_minpd(<2 x dou<br>
; BROADWELL-LABEL: test_minpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2432,8 +2432,8 @@ define <2 x double> @test_minsd(<2 x dou<br>
; BROADWELL-LABEL: test_minsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vminsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vminsd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vminsd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_minsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2503,10 +2503,10 @@ define void @test_movapd(<2 x double> *%<br>
;<br>
; BROADWELL-LABEL: test_movapd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovapd (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovapd (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovapd %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movapd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2579,10 +2579,10 @@ define void @test_movdqa(<2 x i64> *%a0,<br>
;<br>
; BROADWELL-LABEL: test_movdqa:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovdqa (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovdqa (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm0, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovdqa %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movdqa:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2655,10 +2655,10 @@ define void @test_movdqu(<2 x i64> *%a0,<br>
;<br>
; BROADWELL-LABEL: test_movdqu:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovdqu (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovdqu (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm0, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovdqu %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movdqu:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2747,12 +2747,12 @@ define i32 @test_movd(<4 x i32> %a0, i32<br>
; BROADWELL-LABEL: test_movd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovd %edi, %xmm1 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovd {{.*#+}} xmm2 = mem[0],zero,zero,zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovd {{.*#+}} xmm2 = mem[0],zero,zero,zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vpaddd %xmm2, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovd %xmm0, %eax # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vmovd %xmm1, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2858,12 +2858,12 @@ define i64 @test_movd_64(<2 x i64> %a0,<br>
; BROADWELL-LABEL: test_movd_64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovq %rdi, %xmm1 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovq {{.*#+}} xmm2 = mem[0],zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovq {{.*#+}} xmm2 = mem[0],zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm2, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovq %xmm0, %rax # sched: [1:1.00]<br>
; BROADWELL-NEXT:    vmovq %xmm1, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movd_64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2953,10 +2953,10 @@ define void @test_movhpd(<2 x double> %a<br>
;<br>
; BROADWELL-LABEL: test_movhpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovhpd {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vmovhpd {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovhpd %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movhpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3032,10 +3032,10 @@ define void @test_movlpd(<2 x double> %a<br>
;<br>
; BROADWELL-LABEL: test_movlpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovlpd {{.*#+}} xmm1 = mem[0],xmm1[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vmovlpd {{.*#+}} xmm1 = mem[0],xmm1[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovlpd %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movlpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3104,7 +3104,7 @@ define i32 @test_movmskpd(<2 x double> %<br>
; BROADWELL-LABEL: test_movmskpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovmskpd %xmm0, %eax # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movmskpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3167,7 +3167,7 @@ define void @test_movntdqa(<2 x i64> %a0<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddq %xmm0, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovntdq %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntdqa:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3232,7 +3232,7 @@ define void @test_movntpd(<2 x double> %<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovntpd %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3300,10 +3300,10 @@ define <2 x i64> @test_movq_mem(<2 x i64<br>
;<br>
; BROADWELL-LABEL: test_movq_mem:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovq {{.*#+}} xmm1 = mem[0],zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovq {{.*#+}} xmm1 = mem[0],zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vmovq %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movq_mem:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3377,7 +3377,7 @@ define <2 x i64> @test_movq_reg(<2 x i64<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovq {{.*#+}} xmm0 = xmm0[0],zero sched: [1:0.33]<br>
; BROADWELL-NEXT:    vpaddq %xmm0, %xmm1, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movq_reg:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3445,10 +3445,10 @@ define void @test_movsd_mem(double* %a0,<br>
;<br>
; BROADWELL-LABEL: test_movsd_mem:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovsd {{.*#+}} xmm0 = mem[0],zero sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovsd {{.*#+}} xmm0 = mem[0],zero sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddsd %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovsd %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movsd_mem:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3519,7 +3519,7 @@ define <2 x double> @test_movsd_reg(<2 x<br>
; BROADWELL-LABEL: test_movsd_reg:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovlhps {{.*#+}} xmm0 = xmm1[0],xmm0[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movsd_reg:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3582,10 +3582,10 @@ define void @test_movupd(<2 x double> *%<br>
;<br>
; BROADWELL-LABEL: test_movupd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovupd (%rdi), %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovupd (%rdi), %xmm0 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm0, %xmm0 # sched: [3:1.00]<br>
; BROADWELL-NEXT:    vmovupd %xmm0, (%rsi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movupd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3653,9 +3653,9 @@ define <2 x double> @test_mulpd(<2 x dou<br>
;<br>
; BROADWELL-LABEL: test_mulpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulpd %xmm1, %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulpd (%rdi), %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulpd %xmm1, %xmm0, %xmm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulpd (%rdi), %xmm0, %xmm0 # sched: [8:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3719,9 +3719,9 @@ define double @test_mulsd(double %a0, do<br>
;<br>
; BROADWELL-LABEL: test_mulsd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmulsd %xmm1, %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    vmulsd (%rdi), %xmm0, %xmm0 # sched: [5:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmulsd %xmm1, %xmm0, %xmm0 # sched: [3:0.50]<br>
+; BROADWELL-NEXT:    vmulsd (%rdi), %xmm0, %xmm0 # sched: [8:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mulsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3791,9 +3791,9 @@ define <2 x double> @test_orpd(<2 x doub<br>
; BROADWELL-LABEL: test_orpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vorpd %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vorpd (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vorpd (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_orpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3871,8 +3871,8 @@ define <8 x i16> @test_packssdw(<4 x i32<br>
; BROADWELL-LABEL: test_packssdw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackssdw %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackssdw (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackssdw (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packssdw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3943,8 +3943,8 @@ define <16 x i8> @test_packsswb(<8 x i16<br>
; BROADWELL-LABEL: test_packsswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpacksswb %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpacksswb (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpacksswb (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packsswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4015,8 +4015,8 @@ define <16 x i8> @test_packuswb(<8 x i16<br>
; BROADWELL-LABEL: test_packuswb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackuswb %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackuswb (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackuswb (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packuswb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4087,8 +4087,8 @@ define <16 x i8> @test_paddb(<16 x i8> %<br>
; BROADWELL-LABEL: test_paddb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4157,8 +4157,8 @@ define <4 x i32> @test_paddd(<4 x i32> %<br>
; BROADWELL-LABEL: test_paddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4223,8 +4223,8 @@ define <2 x i64> @test_paddq(<2 x i64> %<br>
; BROADWELL-LABEL: test_paddq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddq (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddq (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4293,8 +4293,8 @@ define <16 x i8> @test_paddsb(<16 x i8><br>
; BROADWELL-LABEL: test_paddsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddsb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddsb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddsb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4364,8 +4364,8 @@ define <8 x i16> @test_paddsw(<8 x i16><br>
; BROADWELL-LABEL: test_paddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddsw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddsw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddsw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4435,8 +4435,8 @@ define <16 x i8> @test_paddusb(<16 x i8><br>
; BROADWELL-LABEL: test_paddusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddusb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddusb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddusb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4506,8 +4506,8 @@ define <8 x i16> @test_paddusw(<8 x i16><br>
; BROADWELL-LABEL: test_paddusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddusw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddusw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddusw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4577,8 +4577,8 @@ define <8 x i16> @test_paddw(<8 x i16> %<br>
; BROADWELL-LABEL: test_paddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpaddw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpaddw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_paddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4648,9 +4648,9 @@ define <2 x i64> @test_pand(<2 x i64> %a<br>
; BROADWELL-LABEL: test_pand:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpand %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpand (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpand (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pand:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4731,9 +4731,9 @@ define <2 x i64> @test_pandn(<2 x i64> %<br>
; BROADWELL-LABEL: test_pandn:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpandn %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpandn (%rdi), %xmm0, %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpandn (%rdi), %xmm0, %xmm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pandn:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4809,8 +4809,8 @@ define <16 x i8> @test_pavgb(<16 x i8> %<br>
; BROADWELL-LABEL: test_pavgb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpavgb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpavgb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpavgb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4889,8 +4889,8 @@ define <8 x i16> @test_pavgw(<8 x i16> %<br>
; BROADWELL-LABEL: test_pavgw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpavgw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpavgw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpavgw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pavgw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -4972,9 +4972,9 @@ define <16 x i8> @test_pcmpeqb(<16 x i8><br>
; BROADWELL-LABEL: test_pcmpeqb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqb %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpeqb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5053,9 +5053,9 @@ define <4 x i32> @test_pcmpeqd(<4 x i32><br>
; BROADWELL-LABEL: test_pcmpeqd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqd %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpeqd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5134,9 +5134,9 @@ define <8 x i16> @test_pcmpeqw(<8 x i16><br>
; BROADWELL-LABEL: test_pcmpeqw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqw %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpeqw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5216,9 +5216,9 @@ define <16 x i8> @test_pcmpgtb(<16 x i8><br>
; BROADWELL-LABEL: test_pcmpgtb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtb %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpgtb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpgtb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5298,9 +5298,9 @@ define <4 x i32> @test_pcmpgtd(<4 x i32><br>
; BROADWELL-LABEL: test_pcmpgtd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtd %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpeqd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5380,9 +5380,9 @@ define <8 x i16> @test_pcmpgtw(<8 x i16><br>
; BROADWELL-LABEL: test_pcmpgtw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtw %xmm1, %xmm0, %xmm1 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpgtw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpcmpgtw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm0, %xmm1, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5455,7 +5455,7 @@ define i16 @test_pextrw(<8 x i16> %a0) {<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpextrw $6, %xmm0, %eax # sched: [2:1.00]<br>
; BROADWELL-NEXT:    # kill: %AX<def> %AX<kill> %EAX<kill><br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5522,8 +5522,8 @@ define <8 x i16> @test_pinsrw(<8 x i16><br>
; BROADWELL-LABEL: test_pinsrw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpinsrw $1, %edi, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpinsrw $3, (%rsi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpinsrw $3, (%rsi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pinsrw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5596,8 +5596,8 @@ define <4 x i32> @test_pmaddwd(<8 x i16><br>
; BROADWELL-LABEL: test_pmaddwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaddwd %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmaddwd (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaddwd (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5668,8 +5668,8 @@ define <8 x i16> @test_pmaxsw(<8 x i16><br>
; BROADWELL-LABEL: test_pmaxsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5739,8 +5739,8 @@ define <16 x i8> @test_pmaxub(<16 x i8><br>
; BROADWELL-LABEL: test_pmaxub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxub %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxub (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxub (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5810,8 +5810,8 @@ define <8 x i16> @test_pminsw(<8 x i16><br>
; BROADWELL-LABEL: test_pminsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5881,8 +5881,8 @@ define <16 x i8> @test_pminub(<16 x i8><br>
; BROADWELL-LABEL: test_pminub:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminub %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminub (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminub (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminub:<br>
; SKYLAKE:       # BB#0:<br>
@@ -5945,7 +5945,7 @@ define i32 @test_pmovmskb(<16 x i8> %a0)<br>
; BROADWELL-LABEL: test_pmovmskb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovmskb %xmm0, %eax # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovmskb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6005,8 +6005,8 @@ define <8 x i16> @test_pmulhuw(<8 x i16><br>
; BROADWELL-LABEL: test_pmulhuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhuw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhuw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhuw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6072,8 +6072,8 @@ define <8 x i16> @test_pmulhw(<8 x i16><br>
; BROADWELL-LABEL: test_pmulhw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6139,8 +6139,8 @@ define <8 x i16> @test_pmullw(<8 x i16><br>
; BROADWELL-LABEL: test_pmullw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmullw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmullw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmullw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmullw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6213,8 +6213,8 @@ define <2 x i64> @test_pmuludq(<4 x i32><br>
; BROADWELL-LABEL: test_pmuludq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmuludq %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmuludq (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmuludq (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmuludq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6286,9 +6286,9 @@ define <2 x i64> @test_por(<2 x i64> %a0<br>
; BROADWELL-LABEL: test_por:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpor (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpor (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_por:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6366,8 +6366,8 @@ define <2 x i64> @test_psadbw(<16 x i8><br>
; BROADWELL-LABEL: test_psadbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsadbw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpsadbw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsadbw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psadbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6441,9 +6441,9 @@ define <4 x i32> @test_pshufd(<4 x i32><br>
; BROADWELL-LABEL: test_pshufd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufd {{.*#+}} xmm0 = xmm0[1,0,3,2] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufd {{.*#+}} xmm1 = mem[3,2,1,0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshufd {{.*#+}} xmm1 = mem[3,2,1,0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6520,9 +6520,9 @@ define <8 x i16> @test_pshufhw(<8 x i16><br>
; BROADWELL-LABEL: test_pshufhw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufhw {{.*#+}} xmm0 = xmm0[0,1,2,3,5,4,7,6] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufhw {{.*#+}} xmm1 = mem[0,1,2,3,7,6,5,4] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshufhw {{.*#+}} xmm1 = mem[0,1,2,3,7,6,5,4] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufhw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6599,9 +6599,9 @@ define <8 x i16> @test_pshuflw(<8 x i16><br>
; BROADWELL-LABEL: test_pshuflw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshuflw {{.*#+}} xmm0 = xmm0[1,0,3,2,4,5,6,7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshuflw {{.*#+}} xmm1 = mem[3,2,1,0,4,5,6,7] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpshuflw {{.*#+}} xmm1 = mem[3,2,1,0,4,5,6,7] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshuflw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6676,9 +6676,9 @@ define <4 x i32> @test_pslld(<4 x i32> %<br>
; BROADWELL-LABEL: test_pslld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpslld %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpslld (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpslld (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpslld $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pslld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6751,7 +6751,7 @@ define <4 x i32> @test_pslldq(<4 x i32><br>
; BROADWELL-LABEL: test_pslldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpslldq {{.*#+}} xmm0 = zero,zero,zero,zero,xmm0[0,1,2,3,4,5,6,7,8,9,10,11] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pslldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6815,9 +6815,9 @@ define <2 x i64> @test_psllq(<2 x i64> %<br>
; BROADWELL-LABEL: test_psllq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllq %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsllq (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllq (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsllq $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6894,9 +6894,9 @@ define <8 x i16> @test_psllw(<8 x i16> %<br>
; BROADWELL-LABEL: test_psllw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsllw %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsllw (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsllw (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsllw $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psllw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -6973,9 +6973,9 @@ define <4 x i32> @test_psrad(<4 x i32> %<br>
; BROADWELL-LABEL: test_psrad:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrad %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsrad (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrad (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrad $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrad:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7052,9 +7052,9 @@ define <8 x i16> @test_psraw(<8 x i16> %<br>
; BROADWELL-LABEL: test_psraw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsraw %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsraw (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsraw (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsraw $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psraw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7131,9 +7131,9 @@ define <4 x i32> @test_psrld(<4 x i32> %<br>
; BROADWELL-LABEL: test_psrld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrld %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsrld (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrld (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrld $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7206,7 +7206,7 @@ define <4 x i32> @test_psrldq(<4 x i32><br>
; BROADWELL-LABEL: test_psrldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrldq {{.*#+}} xmm0 = xmm0[4,5,6,7,8,9,10,11,12,13,14,15],zero,zero,zero,zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7270,9 +7270,9 @@ define <2 x i64> @test_psrlq(<2 x i64> %<br>
; BROADWELL-LABEL: test_psrlq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlq %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsrlq (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlq (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrlq $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7349,9 +7349,9 @@ define <8 x i16> @test_psrlw(<8 x i16> %<br>
; BROADWELL-LABEL: test_psrlw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsrlw %xmm1, %xmm0, %xmm0 # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpsrlw (%rdi), %xmm0, %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsrlw (%rdi), %xmm0, %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    vpsrlw $2, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psrlw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7427,8 +7427,8 @@ define <16 x i8> @test_psubb(<16 x i8> %<br>
; BROADWELL-LABEL: test_psubb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">-; BROADWELL-NEXT:    vpsubb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubb:<br>
; SKYLAKE:       # BB#0:<o:p></o:p></p>
<p class="MsoNormal">@@ -7497,8 +7497,8 @@ define <4 x i32> @test_psubd(<4 x i32> %<br>
; BROADWELL-LABEL: test_psubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7563,8 +7563,8 @@ define <2 x i64> @test_psubq(<2 x i64> %<br>
; BROADWELL-LABEL: test_psubq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubq (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubq (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7633,8 +7633,8 @@ define <16 x i8> @test_psubsb(<16 x i8><br>
; BROADWELL-LABEL: test_psubsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubsb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubsb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubsb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7704,8 +7704,8 @@ define <8 x i16> @test_psubsw(<8 x i16><br>
; BROADWELL-LABEL: test_psubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubsw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubsw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubsw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7775,8 +7775,8 @@ define <16 x i8> @test_psubusb(<16 x i8><br>
; BROADWELL-LABEL: test_psubusb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubusb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubusb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubusb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7846,8 +7846,8 @@ define <8 x i16> @test_psubusw(<8 x i16><br>
; BROADWELL-LABEL: test_psubusw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubusw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubusw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubusw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubusw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7917,8 +7917,8 @@ define <8 x i16> @test_psubw(<8 x i16> %<br>
; BROADWELL-LABEL: test_psubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsubw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsubw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsubw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -7987,8 +7987,8 @@ define <16 x i8> @test_punpckhbw(<16 x i<br>
; BROADWELL-LABEL: test_punpckhbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} xmm0 = xmm0[8],xmm1[8],xmm0[9],xmm1[9],xmm0[10],xmm1[10],xmm0[11],xmm1[11],xmm0[12],xmm1[12],xmm0[13],xmm1[13],xmm0[14],xmm1[14],xmm0[15],xmm1[15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} xmm0 = xmm0[8],mem[8],xmm0[9],mem[9],xmm0[10],mem[10],xmm0[11],mem[11],xmm0[12],mem[12],xmm0[13],mem[13],xmm0[14],mem[14],xmm0[15],mem[15] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhbw {{.*#+}} xmm0 = xmm0[8],mem[8],xmm0[9],mem[9],xmm0[10],mem[10],xmm0[11],mem[11],xmm0[12],mem[12],xmm0[13],mem[13],xmm0[14],mem[14],xmm0[15],mem[15] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8060,9 +8060,9 @@ define <4 x i32> @test_punpckhdq(<4 x i3<br>
; BROADWELL-LABEL: test_punpckhdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} xmm0 = xmm0[2],xmm1[2],xmm0[3],xmm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} xmm1 = xmm1[2],mem[2],xmm1[3],mem[3] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhdq {{.*#+}} xmm1 = xmm1[2],mem[2],xmm1[3],mem[3] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8137,9 +8137,9 @@ define <2 x i64> @test_punpckhqdq(<2 x i<br>
; BROADWELL-LABEL: test_punpckhqdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} xmm0 = xmm0[1],xmm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} xmm1 = xmm1[1],mem[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhqdq {{.*#+}} xmm1 = xmm1[1],mem[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhqdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8213,8 +8213,8 @@ define <8 x i16> @test_punpckhwd(<8 x i1<br>
; BROADWELL-LABEL: test_punpckhwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} xmm0 = xmm0[4],xmm1[4],xmm0[5],xmm1[5],xmm0[6],xmm1[6],xmm0[7],xmm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} xmm0 = xmm0[4],mem[4],xmm0[5],mem[5],xmm0[6],mem[6],xmm0[7],mem[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpckhwd {{.*#+}} xmm0 = xmm0[4],mem[4],xmm0[5],mem[5],xmm0[6],mem[6],xmm0[7],mem[7] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckhwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8283,8 +8283,8 @@ define <16 x i8> @test_punpcklbw(<16 x i<br>
; BROADWELL-LABEL: test_punpcklbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1],xmm0[2],xmm1[2],xmm0[3],xmm1[3],xmm0[4],xmm1[4],xmm0[5],xmm1[5],xmm0[6],xmm1[6],xmm0[7],xmm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1],xmm0[2],mem[2],xmm0[3],mem[3],xmm0[4],mem[4],xmm0[5],mem[5],xmm0[6],mem[6],xmm0[7],mem[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklbw {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1],xmm0[2],mem[2],xmm0[3],mem[3],xmm0[4],mem[4],xmm0[5],mem[5],xmm0[6],mem[6],xmm0[7],mem[7] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8356,9 +8356,9 @@ define <4 x i32> @test_punpckldq(<4 x i3<br>
; BROADWELL-LABEL: test_punpckldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpckldq {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpckldq {{.*#+}} xmm1 = xmm1[0],mem[0],xmm1[1],mem[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpckldq {{.*#+}} xmm1 = xmm1[0],mem[0],xmm1[1],mem[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpckldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8433,9 +8433,9 @@ define <2 x i64> @test_punpcklqdq(<2 x i<br>
; BROADWELL-LABEL: test_punpcklqdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} xmm0 = xmm0[0],xmm1[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklqdq {{.*#+}} xmm1 = xmm1[0],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklqdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8509,8 +8509,8 @@ define <8 x i16> @test_punpcklwd(<8 x i1<br>
; BROADWELL-LABEL: test_punpcklwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1],xmm0[2],xmm1[2],xmm0[3],xmm1[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1],xmm0[2],mem[2],xmm0[3],mem[3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpunpcklwd {{.*#+}} xmm0 = xmm0[0],mem[0],xmm0[1],mem[1],xmm0[2],mem[2],xmm0[3],mem[3] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_punpcklwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8580,9 +8580,9 @@ define <2 x i64> @test_pxor(<2 x i64> %a<br>
; BROADWELL-LABEL: test_pxor:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpxor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vpxor (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpxor (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pxor:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8657,9 +8657,9 @@ define <2 x double> @test_shufpd(<2 x do<br>
; BROADWELL-LABEL: test_shufpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vshufpd {{.*#+}} xmm0 = xmm0[1],xmm1[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vshufpd {{.*#+}} xmm1 = xmm1[1],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vshufpd {{.*#+}} xmm1 = xmm1[1],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_shufpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8735,9 +8735,9 @@ define <2 x double> @test_sqrtpd(<2 x do<br>
; BROADWELL-LABEL: test_sqrtpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsqrtpd %xmm0, %xmm0 # sched: [21:1.00]<br>
-; BROADWELL-NEXT:    vsqrtpd (%rdi), %xmm1 # sched: [21:1.00]<br>
+; BROADWELL-NEXT:    vsqrtpd (%rdi), %xmm1 # sched: [26:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8820,10 +8820,10 @@ define <2 x double> @test_sqrtsd(<2 x do<br>
; BROADWELL-LABEL: test_sqrtsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsqrtsd %xmm0, %xmm0, %xmm0 # sched: [21:1.00]<br>
-; BROADWELL-NEXT:    vmovapd (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovapd (%rdi), %xmm1 # sched: [5:0.50]<br>
; BROADWELL-NEXT:    vsqrtsd %xmm1, %xmm1, %xmm1 # sched: [21:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_sqrtsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8898,8 +8898,8 @@ define <2 x double> @test_subpd(<2 x dou<br>
; BROADWELL-LABEL: test_subpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -8964,8 +8964,8 @@ define double @test_subsd(double %a0, do<br>
; BROADWELL-LABEL: test_subsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vsubsd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vsubsd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vsubsd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_subsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -9073,13 +9073,13 @@ define i32 @test_ucomisd(<2 x double> %a<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %cl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %cl # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vucomisd (%rdi), %xmm0 # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    vucomisd (%rdi), %xmm0 # sched: [8:1.00]<br>
; BROADWELL-NEXT:    setnp %al # sched: [1:0.50]<br>
; BROADWELL-NEXT:    sete %dl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    orb %cl, %dl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movzbl %dl, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ucomisd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -9183,9 +9183,9 @@ define <2 x double> @test_unpckhpd(<2 x<br>
; BROADWELL-LABEL: test_unpckhpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpckhpd {{.*#+}} xmm0 = xmm0[1],xmm1[1] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpckhpd {{.*#+}} xmm1 = xmm1[1],mem[1] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vunpckhpd {{.*#+}} xmm1 = xmm1[1],mem[1] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpckhpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -9266,9 +9266,9 @@ define <2 x double> @test_unpcklpd(<2 x<br>
; BROADWELL-LABEL: test_unpcklpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vunpcklpd {{.*#+}} xmm0 = xmm0[0],xmm1[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vunpcklpd {{.*#+}} xmm1 = xmm0[0],mem[0] sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vunpcklpd {{.*#+}} xmm1 = xmm0[0],mem[0] sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_unpcklpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -9343,9 +9343,9 @@ define <2 x double> @test_xorpd(<2 x dou<br>
; BROADWELL-LABEL: test_xorpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vxorpd %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vxorpd (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vxorpd (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_xorpd:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/sse3-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse3-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse3-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/sse3-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/sse3-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -45,8 +45,8 @@ define <2 x double> @test_addsubpd(<2 x<br>
; BROADWELL-LABEL: test_addsubpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddsubpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddsubpd (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddsubpd (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addsubpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -112,8 +112,8 @@ define <4 x float> @test_addsubps(<4 x f<br>
; BROADWELL-LABEL: test_addsubps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vaddsubps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vaddsubps (%rdi), %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vaddsubps (%rdi), %xmm0, %xmm0 # sched: [8:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_addsubps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -179,8 +179,8 @@ define <2 x double> @test_haddpd(<2 x do<br>
; BROADWELL-LABEL: test_haddpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhaddpd %xmm1, %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhaddpd (%rdi), %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhaddpd (%rdi), %xmm0, %xmm0 # sched: [10:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_haddpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -246,8 +246,8 @@ define <4 x float> @test_haddps(<4 x flo<br>
; BROADWELL-LABEL: test_haddps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhaddps %xmm1, %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhaddps (%rdi), %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhaddps (%rdi), %xmm0, %xmm0 # sched: [10:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_haddps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -313,8 +313,8 @@ define <2 x double> @test_hsubpd(<2 x do<br>
; BROADWELL-LABEL: test_hsubpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhsubpd %xmm1, %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhsubpd (%rdi), %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhsubpd (%rdi), %xmm0, %xmm0 # sched: [10:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_hsubpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -380,8 +380,8 @@ define <4 x float> @test_hsubps(<4 x flo<br>
; BROADWELL-LABEL: test_hsubps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vhsubps %xmm1, %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    vhsubps (%rdi), %xmm0, %xmm0 # sched: [5:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vhsubps (%rdi), %xmm0, %xmm0 # sched: [10:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_hsubps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -443,8 +443,8 @@ define <16 x i8> @test_lddqu(i8* %a0) {<br>
;<br>
; BROADWELL-LABEL: test_lddqu:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vlddqu (%rdi), %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vlddqu (%rdi), %xmm0 # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_lddqu:<br>
; SKYLAKE:       # BB#0:<br>
@@ -511,7 +511,7 @@ define void @test_monitor(i8* %a0, i32 %<br>
; BROADWELL-NEXT:    leaq (%rdi), %rax # sched: [1:0.50]<br>
; BROADWELL-NEXT:    movl %esi, %ecx # sched: [1:0.25]<br>
; BROADWELL-NEXT:    monitor # sched: [100:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_monitor:<br>
; SKYLAKE:       # BB#0:<br>
@@ -585,9 +585,9 @@ define <2 x double> @test_movddup(<2 x d<br>
; BROADWELL-LABEL: test_movddup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovddup {{.*#+}} xmm0 = xmm0[0,0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovddup {{.*#+}} xmm1 = mem[0,0] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovddup {{.*#+}} xmm1 = mem[0,0] sched: [5:0.50]<br>
; BROADWELL-NEXT:    vsubpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movddup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -663,9 +663,9 @@ define <4 x float> @test_movshdup(<4 x f<br>
; BROADWELL-LABEL: test_movshdup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovshdup {{.*#+}} xmm0 = xmm0[1,1,3,3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovshdup {{.*#+}} xmm1 = mem[1,1,3,3] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovshdup {{.*#+}} xmm1 = mem[1,1,3,3] sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movshdup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -741,9 +741,9 @@ define <4 x float> @test_movsldup(<4 x f<br>
; BROADWELL-LABEL: test_movsldup:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmovsldup {{.*#+}} xmm0 = xmm0[0,0,2,2] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vmovsldup {{.*#+}} xmm1 = mem[0,0,2,2] sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vmovsldup {{.*#+}} xmm1 = mem[0,0,2,2] sched: [5:0.50]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movsldup:<br>
; SKYLAKE:       # BB#0:<br>
@@ -819,8 +819,8 @@ define void @test_mwait(i32 %a0, i32 %a1<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    movl %edi, %ecx # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movl %esi, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    mwait # sched: [20:2.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    mwait # sched: [100:0.25]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mwait:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/sse41-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse41-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse41-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/sse41-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/sse41-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -43,8 +43,8 @@ define <2 x double> @test_blendpd(<2 x d<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendpd {{.*#+}} xmm0 = xmm0[0],xmm1[1] sched: [1:0.33]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    vblendpd {{.*#+}} xmm0 = xmm0[0],mem[1] sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendpd {{.*#+}} xmm0 = xmm0[0],mem[1] sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -109,8 +109,8 @@ define <4 x float> @test_blendps(<4 x fl<br>
; BROADWELL-LABEL: test_blendps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendps {{.*#+}} xmm0 = xmm0[0],xmm1[1,2],xmm0[3] sched: [1:0.33]<br>
-; BROADWELL-NEXT:    vblendps {{.*#+}} xmm0 = xmm0[0],mem[1],xmm0[2,3] sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendps {{.*#+}} xmm0 = xmm0[0],mem[1],xmm0[2,3] sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -175,8 +175,8 @@ define <2 x double> @test_blendvpd(<2 x<br>
; BROADWELL-LABEL: test_blendvpd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendvpd %xmm2, %xmm1, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vblendvpd %xmm2, (%rdi), %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendvpd %xmm2, (%rdi), %xmm0, %xmm0 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendvpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -242,8 +242,8 @@ define <4 x float> @test_blendvps(<4 x f<br>
; BROADWELL-LABEL: test_blendvps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vblendvps %xmm2, %xmm1, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vblendvps %xmm2, (%rdi), %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vblendvps %xmm2, (%rdi), %xmm0, %xmm0 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_blendvps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -303,8 +303,8 @@ define <2 x double> @test_dppd(<2 x doub<br>
; BROADWELL-LABEL: test_dppd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vdppd $7, %xmm1, %xmm0, %xmm0 # sched: [9:1.00]<br>
-; BROADWELL-NEXT:    vdppd $7, (%rdi), %xmm0, %xmm0 # sched: [9:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdppd $7, (%rdi), %xmm0, %xmm0 # sched: [14:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_dppd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -364,8 +364,8 @@ define <4 x float> @test_dpps(<4 x float<br>
; BROADWELL-LABEL: test_dpps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vdpps $7, %xmm1, %xmm0, %xmm0 # sched: [14:2.00]<br>
-; BROADWELL-NEXT:    vdpps $7, (%rdi), %xmm0, %xmm0 # sched: [14:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vdpps $7, (%rdi), %xmm0, %xmm0 # sched: [19:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_dpps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -425,8 +425,8 @@ define i32 @test_extractps(<4 x float> %<br>
; BROADWELL-LABEL: test_extractps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vextractps $3, %xmm0, %eax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vextractps $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vextractps $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_extractps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -487,8 +487,8 @@ define <4 x float> @test_insertps(<4 x f<br>
; BROADWELL-LABEL: test_insertps:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vinsertps {{.*#+}} xmm0 = zero,xmm1[0],xmm0[2,3] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vinsertps {{.*#+}} xmm0 = xmm0[0,1,2],mem[0] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vinsertps {{.*#+}} xmm0 = xmm0[0,1,2],mem[0] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_insertps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -543,8 +543,8 @@ define <2 x i64> @test_movntdqa(i8* %a0)<br>
;<br>
; BROADWELL-LABEL: test_movntdqa:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vmovntdqa (%rdi), %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmovntdqa (%rdi), %xmm0 # sched: [5:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_movntdqa:<br>
; SKYLAKE:       # BB#0:<br>
@@ -598,8 +598,8 @@ define <8 x i16> @test_mpsadbw(<16 x i8><br>
; BROADWELL-LABEL: test_mpsadbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vmpsadbw $7, %xmm1, %xmm0, %xmm0 # sched: [7:2.00]<br>
-; BROADWELL-NEXT:    vmpsadbw $7, (%rdi), %xmm0, %xmm0 # sched: [7:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vmpsadbw $7, (%rdi), %xmm0, %xmm0 # sched: [12:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_mpsadbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -660,8 +660,8 @@ define <8 x i16> @test_packusdw(<4 x i32<br>
; BROADWELL-LABEL: test_packusdw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpackusdw %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpackusdw (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpackusdw (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_packusdw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -728,8 +728,8 @@ define <16 x i8> @test_pblendvb(<16 x i8<br>
; BROADWELL-LABEL: test_pblendvb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendvb %xmm2, %xmm1, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpblendvb %xmm2, (%rdi), %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpblendvb %xmm2, (%rdi), %xmm0, %xmm0 # sched: [7:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendvb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -789,8 +789,8 @@ define <8 x i16> @test_pblendw(<8 x i16><br>
; BROADWELL-LABEL: test_pblendw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpblendw {{.*#+}} xmm0 = xmm0[0],xmm1[1],xmm0[2],xmm1[3],xmm0[4],xmm1[5],xmm0[6],xmm1[7] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpblendw {{.*#+}} xmm0 = xmm0[0,1],mem[2,3],xmm0[4,5,6],mem[7] sched: [4:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpblendw {{.*#+}} xmm0 = xmm0[0,1],mem[2,3],xmm0[4,5,6],mem[7] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pblendw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -849,8 +849,8 @@ define <2 x i64> @test_pcmpeqq(<2 x i64><br>
; BROADWELL-LABEL: test_pcmpeqq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpeqq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpcmpeqq (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpeqq (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpeqq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -913,8 +913,8 @@ define i32 @test_pextrb(<16 x i8> %a0, i<br>
; BROADWELL-LABEL: test_pextrb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpextrb $3, %xmm0, %eax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpextrb $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpextrb $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -979,8 +979,8 @@ define i32 @test_pextrd(<4 x i32> %a0, i<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpaddd %xmm0, %xmm0, %xmm0 # sched: [1:0.50]<br>
; BROADWELL-NEXT:    vpextrd $3, %xmm0, %eax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpextrd $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpextrd $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1044,8 +1044,8 @@ define i64 @test_pextrq(<2 x i64> %a0, <<br>
; BROADWELL-LABEL: test_pextrq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpextrq $1, %xmm0, %rax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpextrq $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpextrq $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1104,8 +1104,8 @@ define i32 @test_pextrw(<8 x i16> %a0, i<br>
; BROADWELL-LABEL: test_pextrw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpextrw $3, %xmm0, %eax # sched: [2:1.00]<br>
-; BROADWELL-NEXT:    vpextrw $1, %xmm0, (%rdi) # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpextrw $1, %xmm0, (%rdi) # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pextrw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1164,9 +1164,9 @@ define <8 x i16> @test_phminposuw(<8 x i<br>
;<br>
; BROADWELL-LABEL: test_phminposuw:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vphminposuw (%rdi), %xmm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    vphminposuw (%rdi), %xmm0 # sched: [10:1.00]<br>
; BROADWELL-NEXT:    vphminposuw %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phminposuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1226,8 +1226,8 @@ define <16 x i8> @test_pinsrb(<16 x i8><br>
; BROADWELL-LABEL: test_pinsrb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpinsrb $1, %edi, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpinsrb $3, (%rsi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpinsrb $3, (%rsi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pinsrb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1286,8 +1286,8 @@ define <4 x i32> @test_pinsrd(<4 x i32><br>
; BROADWELL-LABEL: test_pinsrd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpinsrd $1, %edi, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpinsrd $3, (%rsi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpinsrd $3, (%rsi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pinsrd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1350,9 +1350,9 @@ define <2 x i64> @test_pinsrq(<2 x i64><br>
; BROADWELL-LABEL: test_pinsrq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpinsrq $1, %rdi, %xmm0, %xmm0 # sched: [2:2.00]<br>
-; BROADWELL-NEXT:    vpinsrq $1, (%rsi), %xmm1, %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpinsrq $1, (%rsi), %xmm1, %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pinsrq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1416,8 +1416,8 @@ define <16 x i8> @test_pmaxsb(<16 x i8><br>
; BROADWELL-LABEL: test_pmaxsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1477,8 +1477,8 @@ define <4 x i32> @test_pmaxsd(<4 x i32><br>
; BROADWELL-LABEL: test_pmaxsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxsd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxsd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxsd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1538,8 +1538,8 @@ define <4 x i32> @test_pmaxud(<4 x i32><br>
; BROADWELL-LABEL: test_pmaxud:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxud %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxud (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxud (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxud:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1599,8 +1599,8 @@ define <8 x i16> @test_pmaxuw(<8 x i16><br>
; BROADWELL-LABEL: test_pmaxuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaxuw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpmaxuw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaxuw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaxuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1660,8 +1660,8 @@ define <16 x i8> @test_pminsb(<16 x i8><br>
; BROADWELL-LABEL: test_pminsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1721,8 +1721,8 @@ define <4 x i32> @test_pminsd(<4 x i32><br>
; BROADWELL-LABEL: test_pminsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminsd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminsd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminsd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1782,8 +1782,8 @@ define <4 x i32> @test_pminud(<4 x i32><br>
; BROADWELL-LABEL: test_pminud:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminud %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminud (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminud (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminud:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1843,8 +1843,8 @@ define <8 x i16> @test_pminuw(<8 x i16><br>
; BROADWELL-LABEL: test_pminuw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpminuw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpminuw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpminuw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pminuw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1909,9 +1909,9 @@ define <8 x i16> @test_pmovsxbw(<16 x i8<br>
; BROADWELL-LABEL: test_pmovsxbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbw %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbw (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbw (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1981,9 +1981,9 @@ define <4 x i32> @test_pmovsxbd(<16 x i8<br>
; BROADWELL-LABEL: test_pmovsxbd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbd %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbd (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbd (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2053,9 +2053,9 @@ define <2 x i64> @test_pmovsxbq(<16 x i8<br>
; BROADWELL-LABEL: test_pmovsxbq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxbq %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxbq (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxbq (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxbq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2125,9 +2125,9 @@ define <2 x i64> @test_pmovsxdq(<4 x i32<br>
; BROADWELL-LABEL: test_pmovsxdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxdq %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxdq (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxdq (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2197,9 +2197,9 @@ define <4 x i32> @test_pmovsxwd(<8 x i16<br>
; BROADWELL-LABEL: test_pmovsxwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxwd %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxwd (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxwd (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2269,9 +2269,9 @@ define <2 x i64> @test_pmovsxwq(<8 x i16<br>
; BROADWELL-LABEL: test_pmovsxwq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovsxwq %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovsxwq (%rdi), %xmm1 # sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovsxwq (%rdi), %xmm1 # sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovsxwq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2341,9 +2341,9 @@ define <8 x i16> @test_pmovzxbw(<16 x i8<br>
; BROADWELL-LABEL: test_pmovzxbw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} xmm0 = xmm0[0],zero,xmm0[1],zero,xmm0[2],zero,xmm0[3],zero,xmm0[4],zero,xmm0[5],zero,xmm0[6],zero,xmm0[7],zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} xmm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbw {{.*#+}} xmm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero,mem[4],zero,mem[5],zero,mem[6],zero,mem[7],zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2413,9 +2413,9 @@ define <4 x i32> @test_pmovzxbd(<16 x i8<br>
; BROADWELL-LABEL: test_pmovzxbd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} xmm0 = xmm0[0],zero,zero,zero,xmm0[1],zero,zero,zero,xmm0[2],zero,zero,zero,xmm0[3],zero,zero,zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} xmm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbd {{.*#+}} xmm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero,mem[2],zero,zero,zero,mem[3],zero,zero,zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2485,9 +2485,9 @@ define <2 x i64> @test_pmovzxbq(<16 x i8<br>
; BROADWELL-LABEL: test_pmovzxbq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} xmm0 = xmm0[0],zero,zero,zero,zero,zero,zero,zero,xmm0[1],zero,zero,zero,zero,zero,zero,zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} xmm1 = mem[0],zero,zero,zero,zero,zero,zero,zero,mem[1],zero,zero,zero,zero,zero,zero,zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxbq {{.*#+}} xmm1 = mem[0],zero,zero,zero,zero,zero,zero,zero,mem[1],zero,zero,zero,zero,zero,zero,zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxbq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2557,9 +2557,9 @@ define <2 x i64> @test_pmovzxdq(<4 x i32<br>
; BROADWELL-LABEL: test_pmovzxdq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} xmm0 = xmm0[0],zero,xmm0[1],zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} xmm1 = mem[0],zero,mem[1],zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxdq {{.*#+}} xmm1 = mem[0],zero,mem[1],zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxdq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2629,9 +2629,9 @@ define <4 x i32> @test_pmovzxwd(<8 x i16<br>
; BROADWELL-LABEL: test_pmovzxwd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} xmm0 = xmm0[0],zero,xmm0[1],zero,xmm0[2],zero,xmm0[3],zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} xmm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxwd {{.*#+}} xmm1 = mem[0],zero,mem[1],zero,mem[2],zero,mem[3],zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxwd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2701,9 +2701,9 @@ define <2 x i64> @test_pmovzxwq(<8 x i16<br>
; BROADWELL-LABEL: test_pmovzxwq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} xmm0 = xmm0[0],zero,zero,zero,xmm0[1],zero,zero,zero sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} xmm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero sched: [1:1.00]<br>
+; BROADWELL-NEXT:    vpmovzxwq {{.*#+}} xmm1 = mem[0],zero,zero,zero,mem[1],zero,zero,zero sched: [6:1.00]<br>
; BROADWELL-NEXT:    vpaddq %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmovzxwq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2768,8 +2768,8 @@ define <2 x i64> @test_pmuldq(<4 x i32><br>
; BROADWELL-LABEL: test_pmuldq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmuldq %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmuldq (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmuldq (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmuldq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2830,8 +2830,8 @@ define <4 x i32> @test_pmulld(<4 x i32><br>
; BROADWELL-LABEL: test_pmulld:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulld %xmm1, %xmm0, %xmm0 # sched: [10:2.00]<br>
-; BROADWELL-NEXT:    vpmulld (%rdi), %xmm0, %xmm0 # sched: [10:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulld (%rdi), %xmm0, %xmm0 # sched: [15:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulld:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2907,11 +2907,11 @@ define i32 @test_ptest(<2 x i64> %a0, <2<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vptest %xmm1, %xmm0 # sched: [2:1.00]<br>
; BROADWELL-NEXT:    setb %al # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vptest (%rdi), %xmm0 # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vptest (%rdi), %xmm0 # sched: [7:1.00]<br>
; BROADWELL-NEXT:    setb %cl # sched: [1:0.50]<br>
; BROADWELL-NEXT:    andb %al, %cl # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movzbl %cl, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_ptest:<br>
; SKYLAKE:       # BB#0:<br>
@@ -2992,10 +2992,10 @@ define <2 x double> @test_roundpd(<2 x d<br>
;<br>
; BROADWELL-LABEL: test_roundpd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vroundpd $7, %xmm0, %xmm0 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundpd $7, (%rdi), %xmm1 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundpd $7, %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundpd $7, (%rdi), %xmm1 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_roundpd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3064,10 +3064,10 @@ define <4 x float> @test_roundps(<4 x fl<br>
;<br>
; BROADWELL-LABEL: test_roundps:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vroundps $7, %xmm0, %xmm0 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundps $7, (%rdi), %xmm1 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundps $7, %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundps $7, (%rdi), %xmm1 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddps %xmm1, %xmm0, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_roundps:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3137,10 +3137,10 @@ define <2 x double> @test_roundsd(<2 x d<br>
;<br>
; BROADWELL-LABEL: test_roundsd:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vroundsd $7, %xmm1, %xmm0, %xmm1 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundsd $7, (%rdi), %xmm0, %xmm0 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundsd $7, %xmm1, %xmm0, %xmm1 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundsd $7, (%rdi), %xmm0, %xmm0 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddpd %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_roundsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -3210,10 +3210,10 @@ define <4 x float> @test_roundss(<4 x fl<br>
;<br>
; BROADWELL-LABEL: test_roundss:<br>
; BROADWELL:       # BB#0:<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">-; BROADWELL-NEXT:    vroundss $7, %xmm1, %xmm0, %xmm1 # sched: [5:1.25]<br>
-; BROADWELL-NEXT:    vroundss $7, (%rdi), %xmm0, %xmm0 # sched: [6:2.00]<br>
+; BROADWELL-NEXT:    vroundss $7, %xmm1, %xmm0, %xmm1 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    vroundss $7, (%rdi), %xmm0, %xmm0 # sched: [11:2.00]<br>
; BROADWELL-NEXT:    vaddps %xmm0, %xmm1, %xmm0 # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<o:p></o:p></p>
<p class="MsoNormal">; SKYLAKE-LABEL: test_roundss:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/sse42-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse42-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sse42-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/sse42-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/sse42-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -42,9 +42,9 @@ define i32 @crc32_32_8(i32 %a0, i8 %a1,<br>
; BROADWELL-LABEL: crc32_32_8:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    crc32b %sil, %edi # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    crc32b (%rdx), %edi # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    crc32b (%rdx), %edi # sched: [8:1.00]<br>
; BROADWELL-NEXT:    movl %edi, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: crc32_32_8:<br>
; SKYLAKE:       # BB#0:<br>
@@ -112,9 +112,9 @@ define i32 @crc32_32_16(i32 %a0, i16 %a1<br>
; BROADWELL-LABEL: crc32_32_16:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    crc32w %si, %edi # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    crc32w (%rdx), %edi # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    crc32w (%rdx), %edi # sched: [8:1.00]<br>
; BROADWELL-NEXT:    movl %edi, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: crc32_32_16:<br>
; SKYLAKE:       # BB#0:<br>
@@ -182,9 +182,9 @@ define i32 @crc32_32_32(i32 %a0, i32 %a1<br>
; BROADWELL-LABEL: crc32_32_32:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    crc32l %esi, %edi # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    crc32l (%rdx), %edi # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    crc32l (%rdx), %edi # sched: [8:1.00]<br>
; BROADWELL-NEXT:    movl %edi, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: crc32_32_32:<br>
; SKYLAKE:       # BB#0:<br>
@@ -252,9 +252,9 @@ define i64 @crc32_64_8(i64 %a0, i8 %a1,<br>
; BROADWELL-LABEL: crc32_64_8:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    crc32b %sil, %edi # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    crc32b (%rdx), %edi # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    crc32b (%rdx), %edi # sched: [8:1.00]<br>
; BROADWELL-NEXT:    movq %rdi, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: crc32_64_8:<br>
; SKYLAKE:       # BB#0:<br>
@@ -322,9 +322,9 @@ define i64 @crc32_64_64(i64 %a0, i64 %a1<br>
; BROADWELL-LABEL: crc32_64_64:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    crc32q %rsi, %rdi # sched: [3:1.00]<br>
-; BROADWELL-NEXT:    crc32q (%rdx), %rdi # sched: [7:1.00]<br>
+; BROADWELL-NEXT:    crc32q (%rdx), %rdi # sched: [8:1.00]<br>
; BROADWELL-NEXT:    movq %rdi, %rax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: crc32_64_64:<br>
; SKYLAKE:       # BB#0:<br>
@@ -421,10 +421,10 @@ define i32 @test_pcmpestri(<16 x i8> %a0<br>
; BROADWELL-NEXT:    movl %ecx, %esi # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movl $7, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movl $7, %edx # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vpcmpestri $7, (%rdi), %xmm0 # sched: [18:4.00]<br>
+; BROADWELL-NEXT:    vpcmpestri $7, (%rdi), %xmm0 # sched: [23:4.00]<br>
; BROADWELL-NEXT:    # kill: %ECX<def> %ECX<kill> %RCX<def><br>
; BROADWELL-NEXT:    leal (%rcx,%rsi), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpestri:<br>
; SKYLAKE:       # BB#0:<br>
@@ -533,8 +533,8 @@ define <16 x i8> @test_pcmpestrm(<16 x i<br>
; BROADWELL-NEXT:    vpcmpestrm $7, %xmm1, %xmm0 # sched: [19:4.00]<br>
; BROADWELL-NEXT:    movl $7, %eax # sched: [1:0.25]<br>
; BROADWELL-NEXT:    movl $7, %edx # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vpcmpestrm $7, (%rdi), %xmm0 # sched: [19:4.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpestrm $7, (%rdi), %xmm0 # sched: [24:4.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpestrm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -623,10 +623,10 @@ define i32 @test_pcmpistri(<16 x i8> %a0<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpistri $7, %xmm1, %xmm0 # sched: [11:3.00]<br>
; BROADWELL-NEXT:    movl %ecx, %eax # sched: [1:0.25]<br>
-; BROADWELL-NEXT:    vpcmpistri $7, (%rdi), %xmm0 # sched: [11:3.00]<br>
+; BROADWELL-NEXT:    vpcmpistri $7, (%rdi), %xmm0 # sched: [16:3.00]<br>
; BROADWELL-NEXT:    # kill: %ECX<def> %ECX<kill> %RCX<def><br>
; BROADWELL-NEXT:    leal (%rcx,%rax), %eax # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpistri:<br>
; SKYLAKE:       # BB#0:<br>
@@ -699,8 +699,8 @@ define <16 x i8> @test_pcmpistrm(<16 x i<br>
; BROADWELL-LABEL: test_pcmpistrm:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpistrm $7, %xmm1, %xmm0 # sched: [11:3.00]<br>
-; BROADWELL-NEXT:    vpcmpistrm $7, (%rdi), %xmm0 # sched: [11:3.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpistrm $7, (%rdi), %xmm0 # sched: [16:3.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpistrm:<br>
; SKYLAKE:       # BB#0:<br>
@@ -760,8 +760,8 @@ define <2 x i64> @test_pcmpgtq(<2 x i64><br>
; BROADWELL-LABEL: test_pcmpgtq:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpcmpgtq %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpcmpgtq (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpcmpgtq (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pcmpgtq:<br>
; SKYLAKE:       # BB#0:<br>
@@ -823,9 +823,9 @@ define <2 x i64> @test_pclmulqdq(<2 x i6<br>
;<br>
; BROADWELL-LABEL: test_pclmulqdq:<br>
; BROADWELL:       # BB#0:<br>
-; BROADWELL-NEXT:    vpclmulqdq $0, %xmm1, %xmm0, %xmm0 # sched: [11:2.00]<br>
-; BROADWELL-NEXT:    vpclmulqdq $0, (%rdi), %xmm0, %xmm0 # sched: [11:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpclmulqdq $0, %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
+; BROADWELL-NEXT:    vpclmulqdq $0, (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pclmulqdq:<br>
; SKYLAKE:       # BB#0:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll<br>
URL:<span class="apple-converted-space"> </span><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff"><span style="color:purple">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll?rev=316492&r1=316491&r2=316492&view=diff</span></a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/ssse3-schedule.ll Tue Oct 24 13:19:47 2017<br>
@@ -51,9 +51,9 @@ define <16 x i8> @test_pabsb(<16 x i8> %<br>
; BROADWELL-LABEL: test_pabsb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsb %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsb (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsb (%rdi), %xmm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -130,9 +130,9 @@ define <4 x i32> @test_pabsd(<4 x i32> %<br>
; BROADWELL-LABEL: test_pabsd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsd %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsd (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsd (%rdi), %xmm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -209,9 +209,9 @@ define <8 x i16> @test_pabsw(<8 x i16> %<br>
; BROADWELL-LABEL: test_pabsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpabsw %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpabsw (%rdi), %xmm1 # sched: [1:0.50]<br>
+; BROADWELL-NEXT:    vpabsw (%rdi), %xmm1 # sched: [6:0.50]<br>
; BROADWELL-NEXT:    vpor %xmm1, %xmm0, %xmm0 # sched: [1:0.33]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pabsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -287,8 +287,8 @@ define <8 x i16> @test_palignr(<8 x i16><br>
; BROADWELL-LABEL: test_palignr:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpalignr {{.*#+}} xmm0 = xmm0[6,7,8,9,10,11,12,13,14,15],xmm1[0,1,2,3,4,5] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpalignr {{.*#+}} xmm0 = mem[14,15],xmm0[0,1,2,3,4,5,6,7,8,9,10,11,12,13] sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpalignr {{.*#+}} xmm0 = mem[14,15],xmm0[0,1,2,3,4,5,6,7,8,9,10,11,12,13] sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_palignr:<br>
; SKYLAKE:       # BB#0:<br>
@@ -353,8 +353,8 @@ define <4 x i32> @test_phaddd(<4 x i32><br>
; BROADWELL-LABEL: test_phaddd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddd %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddd (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddd (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -420,8 +420,8 @@ define <8 x i16> @test_phaddsw(<8 x i16><br>
; BROADWELL-LABEL: test_phaddsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddsw %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddsw (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddsw (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -487,8 +487,8 @@ define <8 x i16> @test_phaddw(<8 x i16><br>
; BROADWELL-LABEL: test_phaddw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphaddw %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphaddw (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphaddw (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phaddw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -554,8 +554,8 @@ define <4 x i32> @test_phsubd(<4 x i32><br>
; BROADWELL-LABEL: test_phsubd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubd %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubd (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubd (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -621,8 +621,8 @@ define <8 x i16> @test_phsubsw(<8 x i16><br>
; BROADWELL-LABEL: test_phsubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubsw %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubsw (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubsw (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -688,8 +688,8 @@ define <8 x i16> @test_phsubw(<8 x i16><br>
; BROADWELL-LABEL: test_phsubw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vphsubw %xmm1, %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    vphsubw (%rdi), %xmm0, %xmm0 # sched: [3:2.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vphsubw (%rdi), %xmm0, %xmm0 # sched: [8:2.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_phsubw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -755,8 +755,8 @@ define <8 x i16> @test_pmaddubsw(<16 x i<br>
; BROADWELL-LABEL: test_pmaddubsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmaddubsw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmaddubsw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmaddubsw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmaddubsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -823,8 +823,8 @@ define <8 x i16> @test_pmulhrsw(<8 x i16<br>
; BROADWELL-LABEL: test_pmulhrsw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpmulhrsw %xmm1, %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    vpmulhrsw (%rdi), %xmm0, %xmm0 # sched: [5:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpmulhrsw (%rdi), %xmm0, %xmm0 # sched: [10:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pmulhrsw:<br>
; SKYLAKE:       # BB#0:<br>
@@ -890,8 +890,8 @@ define <16 x i8> @test_pshufb(<16 x i8><br>
; BROADWELL-LABEL: test_pshufb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpshufb %xmm1, %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    vpshufb (%rdi), %xmm0, %xmm0 # sched: [1:1.00]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpshufb (%rdi), %xmm0, %xmm0 # sched: [6:1.00]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_pshufb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -961,8 +961,8 @@ define <16 x i8> @test_psignb(<16 x i8><br>
; BROADWELL-LABEL: test_psignb:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignb %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignb (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignb (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignb:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1032,8 +1032,8 @@ define <4 x i32> @test_psignd(<4 x i32><br>
; BROADWELL-LABEL: test_psignd:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignd %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignd (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignd (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignd:<br>
; SKYLAKE:       # BB#0:<br>
@@ -1103,8 +1103,8 @@ define <8 x i16> @test_psignw(<8 x i16><br>
; BROADWELL-LABEL: test_psignw:<br>
; BROADWELL:       # BB#0:<br>
; BROADWELL-NEXT:    vpsignw %xmm1, %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    vpsignw (%rdi), %xmm0, %xmm0 # sched: [1:0.50]<br>
-; BROADWELL-NEXT:    retq # sched: [2:1.00]<br>
+; BROADWELL-NEXT:    vpsignw (%rdi), %xmm0, %xmm0 # sched: [6:0.50]<br>
+; BROADWELL-NEXT:    retq # sched: [7:1.00]<br>
;<br>
; SKYLAKE-LABEL: test_psignw:<br>
; SKYLAKE:       # BB#0:<br>
<br>
<br>
_______________________________________________<br>
llvm-commits mailing list<br>
<a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits"><span style="color:purple">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits</span></a><o:p></o:p></p>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
<div>
<div>
<p class="MsoNormal">_______________________________________________<br>
llvm-commits mailing list<br>
<a href="mailto:llvm-commits@lists.llvm.org"><span style="color:purple">llvm-commits@lists.llvm.org</span></a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits"><span style="color:purple">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits</span></a><o:p></o:p></p>
</div>
</div>
</div>
</blockquote>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
</div>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Helvetica",sans-serif">---------------------------------------------------------------------<br>
Intel Israel (74) Limited</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Helvetica",sans-serif">This e-mail and any attachments may contain confidential material for<br>
the sole use of the intended recipient(s). Any review or distribution<br>
by others is strictly prohibited. If you are not the intended<br>
recipient, please contact the sender and delete all copies.</span><o:p></o:p></p>
</div>
</div>
</blockquote>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;font-variant-caps: normal;text-align:start;-webkit-text-stroke-width: 0px;word-spacing:0px">
<span style="font-size:9.0pt;font-family:"Helvetica",sans-serif">---------------------------------------------------------------------<br>
Intel Israel (74) Limited<o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;font-variant-caps: normal;text-align:start;-webkit-text-stroke-width: 0px;word-spacing:0px">
<span style="font-size:9.0pt;font-family:"Helvetica",sans-serif">This e-mail and any attachments may contain confidential material for<br>
the sole use of the intended recipient(s). Any review or distribution<br>
by others is strictly prohibited. If you are not the intended<br>
recipient, please contact the sender and delete all copies.<o:p></o:p></span></p>
</div>
</blockquote>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</div>
<p>---------------------------------------------------------------------<br>
Intel Israel (74) Limited</p>

<p>This e-mail and any attachments may contain confidential material for<br>
the sole use of the intended recipient(s). Any review or distribution<br>
by others is strictly prohibited. If you are not the intended<br>
recipient, please contact the sender and delete all copies.</p></body>
</html>