<div dir="ltr"><div dir="ltr"><br><div class="gmail_quote"><div dir="ltr">On Wed, Dec 5, 2018 at 5:40 PM Sanjay Patel <<a href="mailto:spatel@rotateright.com">spatel@rotateright.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div dir="auto">Hi David,</div></div><div dir="auto"><br></div><div dir="auto">Thanks for reporting the problem. I don’t have any guesses as to how this could cause OOM. Cc’ing some people that might.</div></blockquote><div><br></div><div>+a few more, thanks! :-)</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"></div><div dir="auto">If the problem is only showing up on PPC, a quick hack would be to add a TLI hook and disable the transform for PPC. I’d prefer that to a full revert (especially since I’ve already enhanced this code to handle some vectors, and another enhancement is in progress)...unless we have a test case that shows it’s not a PPC-specific bug?</div><div dir="auto"><br></div></blockquote><div><br></div><div>Agreed on avoiding a revert... this looks like a pretty hard-fought change, based on <a href="https://bugs.llvm.org/show_bug.cgi?id=32023">https://bugs.llvm.org/show_bug.cgi?id=32023</a>.</div><div><br></div><div>I do believe this is PPC-specific (although, as is often the case, your change may simply be tickling a bug somewhere else).</div><div><br></div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"></div><div dir="auto">Another option is to add a disablement flag and default it to disable until we have a reproducer.</div><div><div dir="auto">I can add the TLI hook or cripple flag tomorrow if that sounds reasonable.</div></div></blockquote><div><br></div><div>Sure, that's fine. I might find some more useful clues by then, though, so I'll keep you in the loop.</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div class="gmail_quote" dir="auto"><div dir="ltr">On Wed, Dec 5, 2018 at 6:16 PM David Jones <<a href="mailto:dlj@google.com" target="_blank" class="gmail-cremed cremed">dlj@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr"><div dir="ltr">Hi Sanjay,<div><br></div><div>I've seen some instances of Clang going OOM, and it appears to cleanly bisect to this revision. The affected builds are while building Clang itself (FrontendAction, Sema, and a couple of others) targeting PPC with ASAN (which uses -O1).</div><div><br></div><div>Unfortunately, since the reproduction requires running to OOM, reproducing takes longer than I would like. I haven't been able to get a useful, reduced example, but I am trying to get one (manually and via creduce). I'm also trying to reproduce with a Clang built with a few different sanitizers (although these are also a bit finicky, so no luck yet there, either.)</div><div><br></div><div>I'm not familiar with this area of LLVM, so the best fix I can offer is a revert.</div><div><br></div><div>Could you advise on a fix? (If you don't have a timely fix in mind, please let me know if you would like me to revert the revision for you.)</div><div><br></div><div>Thanks,</div><div>David Jones</div></div></div></div><br><div class="gmail_quote"></div><div class="gmail_quote"><div dir="ltr">On Thu, Nov 29, 2018 at 1:01 PM Sanjay Patel via llvm-commits <<a href="mailto:llvm-commits@lists.llvm.org" target="_blank" class="gmail-cremed cremed">llvm-commits@lists.llvm.org</a>> wrote:<br></div></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"></blockquote></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Author: spatel<br>
Date: Thu Nov 29 12:58:26 2018<br>
New Revision: 347917<br>
<br>
URL: <a href="http://llvm.org/viewvc/llvm-project?rev=347917&view=rev" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project?rev=347917&view=rev</a><br>
Log:<br>
[DAGCombiner] narrow truncated binops<br>
<br>
The motivating case for this is shown in:<br>
<a href="https://bugs.llvm.org/show_bug.cgi?id=32023" rel="noreferrer" target="_blank" class="gmail-cremed cremed">https://bugs.llvm.org/show_bug.cgi?id=32023</a><br>
and the corresponding rot16.ll regression tests.<br>
<br>
Because x86 scalar shift amounts are i8 values, we can end up with trunc-binop-trunc <br>
sequences that don't get folded in IR.<br>
<br>
As the TODO comments suggest, there will be regressions if we extend this (for x86, <br>
we mostly seem to be missing LEA opportunities, but there are likely vector folds <br>
missing too). I think those should be considered existing bugs because this is the <br>
same transform that we do as an IR canonicalization in instcombine. We just need <br>
more tests to make those visible independent of this patch.<br>
<br>
Differential Revision: <a href="https://reviews.llvm.org/D54640" rel="noreferrer" target="_blank" class="gmail-cremed cremed">https://reviews.llvm.org/D54640</a><br>
<br>
Modified:<br>
llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp<br>
llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll<br>
llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll<br>
llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll<br>
llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll<br>
llvm/trunk/test/CodeGen/X86/bool-math.ll<br>
llvm/trunk/test/CodeGen/X86/clear-lowbits.ll<br>
llvm/trunk/test/CodeGen/X86/cmov.ll<br>
llvm/trunk/test/CodeGen/X86/extract-bits.ll<br>
llvm/trunk/test/CodeGen/X86/extract-lowbits.ll<br>
llvm/trunk/test/CodeGen/X86/fshl.ll<br>
llvm/trunk/test/CodeGen/X86/fshr.ll<br>
llvm/trunk/test/CodeGen/X86/funnel-shift-rot.ll<br>
llvm/trunk/test/CodeGen/X86/funnel-shift.ll<br>
llvm/trunk/test/CodeGen/X86/pr32284.ll<br>
llvm/trunk/test/CodeGen/X86/pr37879.ll<br>
llvm/trunk/test/CodeGen/X86/rot16.ll<br>
llvm/trunk/test/CodeGen/X86/rotate.ll<br>
llvm/trunk/test/CodeGen/X86/rotate4.ll<br>
llvm/trunk/test/CodeGen/X86/schedule-x86-64-shld.ll<br>
llvm/trunk/test/CodeGen/X86/scheduler-backtracking.ll<br>
llvm/trunk/test/CodeGen/X86/test-shrink.ll<br>
llvm/trunk/test/CodeGen/X86/xchg-nofold.ll<br>
<br>
Modified: llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp (original)<br>
+++ llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp Thu Nov 29 12:58:26 2018<br>
@@ -9722,6 +9722,28 @@ SDValue DAGCombiner::visitTRUNCATE(SDNod<br>
if (SDValue NewVSel = matchVSelectOpSizesWithSetCC(N))<br>
return NewVSel;<br>
<br>
+ // Narrow a suitable binary operation with a constant operand by moving it<br>
+ // ahead of the truncate. This is limited to pre-legalization because targets<br>
+ // may prefer a wider type during later combines and invert this transform.<br>
+ switch (N0.getOpcode()) {<br>
+ // TODO: Add case for ADD - that will likely require a change in logic here<br>
+ // or target-specific changes to avoid regressions.<br>
+ case ISD::SUB:<br>
+ case ISD::MUL:<br>
+ case ISD::AND:<br>
+ case ISD::OR:<br>
+ case ISD::XOR:<br>
+ // TODO: This should allow vector constants/types too.<br>
+ if (!LegalOperations && N0.hasOneUse() &&<br>
+ (isa<ConstantSDNode>(N0.getOperand(0)) ||<br>
+ isa<ConstantSDNode>(N0.getOperand(1)))) {<br>
+ SDLoc DL(N);<br>
+ SDValue NarrowL = DAG.getNode(ISD::TRUNCATE, DL, VT, N0.getOperand(0));<br>
+ SDValue NarrowR = DAG.getNode(ISD::TRUNCATE, DL, VT, N0.getOperand(1));<br>
+ return DAG.getNode(N0.getOpcode(), DL, VT, NarrowL, NarrowR);<br>
+ }<br>
+ }<br>
+<br>
return SDValue();<br>
}<br>
<br>
<br>
Modified: llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll (original)<br>
+++ llvm/trunk/test/CodeGen/AMDGPU/cgp-bitfield-extract.ll Thu Nov 29 12:58:26 2018<br>
@@ -125,11 +125,11 @@ ret:<br>
; GCN: s_cbranch_scc1<br>
<br>
; SI: s_bfe_u32 s{{[0-9]+}}, s{{[0-9]+}}, 0x80004<br>
-; VI: s_and_b32 s{{[0-9]+}}, [[BFE]], 0xff<br>
+; VI: v_mov_b32_e32 v{{[0-9]+}}, 0xff<br>
<br>
; GCN: BB2_2:<br>
; SI: s_bfe_u32 s{{[0-9]+}}, s{{[0-9]+}}, 0x70004<br>
-; VI: s_and_b32 s{{[0-9]+}}, [[BFE]], 0x7f<br>
+; VI: v_mov_b32_e32 v{{[0-9]+}}, 0x7f<br>
<br>
; GCN: BB2_3:<br>
; GCN: buffer_store_short<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/2008-09-11-CoalescerBug2.ll Thu Nov 29 12:58:26 2018<br>
@@ -17,7 +17,7 @@ define i32 @func_44(i16 signext %p_46) n<br>
; SOURCE-SCHED-NEXT: setg %cl<br>
; SOURCE-SCHED-NEXT: movb g_73, %dl<br>
; SOURCE-SCHED-NEXT: xorl %eax, %eax<br>
-; SOURCE-SCHED-NEXT: subl {{[0-9]+}}(%esp), %eax<br>
+; SOURCE-SCHED-NEXT: subb {{[0-9]+}}(%esp), %al<br>
; SOURCE-SCHED-NEXT: testb %dl, %dl<br>
; SOURCE-SCHED-NEXT: jne .LBB0_2<br>
; SOURCE-SCHED-NEXT: # %bb.1: # %bb11<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/2010-08-04-MaskedSignedCompare.ll Thu Nov 29 12:58:26 2018<br>
@@ -11,7 +11,7 @@ define i32 @main() nounwind {<br>
; CHECK: # %bb.0: # %entry<br>
; CHECK-NEXT: xorl %eax, %eax<br>
; CHECK-NEXT: cmpq {{.*}}(%rip), %rax<br>
-; CHECK-NEXT: sbbl %eax, %eax<br>
+; CHECK-NEXT: sbbb %al, %al<br>
; CHECK-NEXT: testb $-106, %al<br>
; CHECK-NEXT: jle .LBB0_1<br>
; CHECK-NEXT: # %bb.2: # %if.then<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/add-sub-nsw-nuw.ll Thu Nov 29 12:58:26 2018<br>
@@ -9,7 +9,7 @@ define i8 @PR30841(i64 %argc) {<br>
; CHECK-LABEL: PR30841:<br>
; CHECK: ## %bb.0: ## %entry<br>
; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; CHECK-NEXT: negl %eax<br>
+; CHECK-NEXT: negb %al<br>
; CHECK-NEXT: ## kill: def $al killed $al killed $eax<br>
; CHECK-NEXT: retl<br>
entry:<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/bool-math.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bool-math.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/bool-math.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/bool-math.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/bool-math.ll Thu Nov 29 12:58:26 2018<br>
@@ -33,7 +33,7 @@ define i8 @sub_zext_cmp_mask_narrower_re<br>
; CHECK-LABEL: sub_zext_cmp_mask_narrower_result:<br>
; CHECK: # %bb.0:<br>
; CHECK-NEXT: movl %edi, %eax<br>
-; CHECK-NEXT: andl $1, %eax<br>
+; CHECK-NEXT: andb $1, %al<br>
; CHECK-NEXT: orb $46, %al<br>
; CHECK-NEXT: # kill: def $al killed $al killed $eax<br>
; CHECK-NEXT: retq<br>
@@ -77,7 +77,7 @@ define i8 @add_zext_cmp_mask_narrower_re<br>
; CHECK-LABEL: add_zext_cmp_mask_narrower_result:<br>
; CHECK: # %bb.0:<br>
; CHECK-NEXT: movl %edi, %eax<br>
-; CHECK-NEXT: andl $1, %eax<br>
+; CHECK-NEXT: andb $1, %al<br>
; CHECK-NEXT: xorb $43, %al<br>
; CHECK-NEXT: # kill: def $al killed $al killed $eax<br>
; CHECK-NEXT: retq<br>
@@ -159,7 +159,7 @@ define i8 @low_bit_select_constants_bigg<br>
; CHECK-LABEL: low_bit_select_constants_bigger_true_narrower_result:<br>
; CHECK: # %bb.0:<br>
; CHECK-NEXT: movl %edi, %eax<br>
-; CHECK-NEXT: andl $1, %eax<br>
+; CHECK-NEXT: andb $1, %al<br>
; CHECK-NEXT: xorb $41, %al<br>
; CHECK-NEXT: # kill: def $al killed $al killed $eax<br>
; CHECK-NEXT: retq<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/clear-lowbits.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/clear-lowbits.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/clear-lowbits.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/clear-lowbits.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/clear-lowbits.ll Thu Nov 29 12:58:26 2018<br>
@@ -866,10 +866,9 @@ define i16 @clear_lowbits16_ic0(i16 %val<br>
; X86-NOBMI2-LABEL: clear_lowbits16_ic0:<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: movzwl {{[0-9]+}}(%esp), %eax<br>
-; X86-NOBMI2-NEXT: movw $16, %cx<br>
-; X86-NOBMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-NOBMI2-NEXT: movb $16, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $cx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X86-NOBMI2-NEXT: retl<br>
@@ -877,8 +876,8 @@ define i16 @clear_lowbits16_ic0(i16 %val<br>
; X86-BMI2-LABEL: clear_lowbits16_ic0:<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: movzwl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI2-NEXT: movw $16, %cx<br>
-; X86-BMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-BMI2-NEXT: movb $16, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -887,10 +886,9 @@ define i16 @clear_lowbits16_ic0(i16 %val<br>
; X64-NOBMI2-LABEL: clear_lowbits16_ic0:<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movzwl %di, %eax<br>
-; X64-NOBMI2-NEXT: movl $16, %ecx<br>
-; X64-NOBMI2-NEXT: subl %esi, %ecx<br>
+; X64-NOBMI2-NEXT: movb $16, %cl<br>
+; X64-NOBMI2-NEXT: subb %sil, %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X64-NOBMI2-NEXT: retq<br>
@@ -898,8 +896,8 @@ define i16 @clear_lowbits16_ic0(i16 %val<br>
; X64-BMI2-LABEL: clear_lowbits16_ic0:<br>
; X64-BMI2: # %bb.0:<br>
; X64-BMI2-NEXT: movzwl %di, %eax<br>
-; X64-BMI2-NEXT: movl $16, %ecx<br>
-; X64-BMI2-NEXT: subl %esi, %ecx<br>
+; X64-BMI2-NEXT: movb $16, %cl<br>
+; X64-BMI2-NEXT: subb %sil, %cl<br>
; X64-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -962,10 +960,9 @@ define i16 @clear_lowbits16_ic2_load(i16<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI2-NEXT: movzwl (%eax), %eax<br>
-; X86-NOBMI2-NEXT: movw $16, %cx<br>
-; X86-NOBMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-NOBMI2-NEXT: movb $16, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $cx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X86-NOBMI2-NEXT: retl<br>
@@ -974,8 +971,8 @@ define i16 @clear_lowbits16_ic2_load(i16<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI2-NEXT: movzwl (%eax), %eax<br>
-; X86-BMI2-NEXT: movw $16, %cx<br>
-; X86-BMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-BMI2-NEXT: movb $16, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -984,10 +981,9 @@ define i16 @clear_lowbits16_ic2_load(i16<br>
; X64-NOBMI2-LABEL: clear_lowbits16_ic2_load:<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movzwl (%rdi), %eax<br>
-; X64-NOBMI2-NEXT: movl $16, %ecx<br>
-; X64-NOBMI2-NEXT: subl %esi, %ecx<br>
+; X64-NOBMI2-NEXT: movb $16, %cl<br>
+; X64-NOBMI2-NEXT: subb %sil, %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X64-NOBMI2-NEXT: retq<br>
@@ -995,8 +991,8 @@ define i16 @clear_lowbits16_ic2_load(i16<br>
; X64-BMI2-LABEL: clear_lowbits16_ic2_load:<br>
; X64-BMI2: # %bb.0:<br>
; X64-BMI2-NEXT: movzwl (%rdi), %eax<br>
-; X64-BMI2-NEXT: movl $16, %ecx<br>
-; X64-BMI2-NEXT: subl %esi, %ecx<br>
+; X64-BMI2-NEXT: movb $16, %cl<br>
+; X64-BMI2-NEXT: subb %sil, %cl<br>
; X64-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -1062,10 +1058,9 @@ define i16 @clear_lowbits16_ic4_commutat<br>
; X86-NOBMI2-LABEL: clear_lowbits16_ic4_commutative:<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: movzwl {{[0-9]+}}(%esp), %eax<br>
-; X86-NOBMI2-NEXT: movw $16, %cx<br>
-; X86-NOBMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-NOBMI2-NEXT: movb $16, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $cx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X86-NOBMI2-NEXT: retl<br>
@@ -1073,8 +1068,8 @@ define i16 @clear_lowbits16_ic4_commutat<br>
; X86-BMI2-LABEL: clear_lowbits16_ic4_commutative:<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: movzwl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI2-NEXT: movw $16, %cx<br>
-; X86-BMI2-NEXT: subw {{[0-9]+}}(%esp), %cx<br>
+; X86-BMI2-NEXT: movb $16, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -1083,10 +1078,9 @@ define i16 @clear_lowbits16_ic4_commutat<br>
; X64-NOBMI2-LABEL: clear_lowbits16_ic4_commutative:<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movzwl %di, %eax<br>
-; X64-NOBMI2-NEXT: movl $16, %ecx<br>
-; X64-NOBMI2-NEXT: subl %esi, %ecx<br>
+; X64-NOBMI2-NEXT: movb $16, %cl<br>
+; X64-NOBMI2-NEXT: subb %sil, %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
-; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
; X64-NOBMI2-NEXT: retq<br>
@@ -1094,8 +1088,8 @@ define i16 @clear_lowbits16_ic4_commutat<br>
; X64-BMI2-LABEL: clear_lowbits16_ic4_commutative:<br>
; X64-BMI2: # %bb.0:<br>
; X64-BMI2-NEXT: movzwl %di, %eax<br>
-; X64-BMI2-NEXT: movl $16, %ecx<br>
-; X64-BMI2-NEXT: subl %esi, %ecx<br>
+; X64-BMI2-NEXT: movb $16, %cl<br>
+; X64-BMI2-NEXT: subb %sil, %cl<br>
; X64-BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X64-BMI2-NEXT: # kill: def $ax killed $ax killed $eax<br>
@@ -1113,7 +1107,7 @@ define i32 @clear_lowbits32_ic0(i32 %val<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI2-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1122,7 +1116,7 @@ define i32 @clear_lowbits32_ic0(i32 %val<br>
; X86-BMI2-LABEL: clear_lowbits32_ic0:<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: xorl %eax, %eax<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %al<br>
; X86-BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %ecx<br>
; X86-BMI2-NEXT: shlxl %eax, %ecx, %eax<br>
; X86-BMI2-NEXT: retl<br>
@@ -1131,7 +1125,7 @@ define i32 @clear_lowbits32_ic0(i32 %val<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movl %esi, %ecx<br>
; X64-NOBMI2-NEXT: movl %edi, %eax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1139,7 +1133,7 @@ define i32 @clear_lowbits32_ic0(i32 %val<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits32_ic0:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxl %esi, %edi, %eax<br>
; X64-BMI2-NEXT: shlxl %esi, %eax, %eax<br>
; X64-BMI2-NEXT: retq<br>
@@ -1197,7 +1191,7 @@ define i32 @clear_lowbits32_ic2_load(i32<br>
; X86-NOBMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI2-NEXT: movl (%eax), %eax<br>
; X86-NOBMI2-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1207,7 +1201,7 @@ define i32 @clear_lowbits32_ic2_load(i32<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI2-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: shrxl %ecx, (%eax), %eax<br>
; X86-BMI2-NEXT: shlxl %ecx, %eax, %eax<br>
; X86-BMI2-NEXT: retl<br>
@@ -1216,7 +1210,7 @@ define i32 @clear_lowbits32_ic2_load(i32<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movl %esi, %ecx<br>
; X64-NOBMI2-NEXT: movl (%rdi), %eax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1224,7 +1218,7 @@ define i32 @clear_lowbits32_ic2_load(i32<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits32_ic2_load:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxl %esi, (%rdi), %eax<br>
; X64-BMI2-NEXT: shlxl %esi, %eax, %eax<br>
; X64-BMI2-NEXT: retq<br>
@@ -1285,7 +1279,7 @@ define i32 @clear_lowbits32_ic4_commutat<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI2-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: shrl %cl, %eax<br>
; X86-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1294,7 +1288,7 @@ define i32 @clear_lowbits32_ic4_commutat<br>
; X86-BMI2-LABEL: clear_lowbits32_ic4_commutative:<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: xorl %eax, %eax<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %al<br>
; X86-BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %ecx<br>
; X86-BMI2-NEXT: shlxl %eax, %ecx, %eax<br>
; X86-BMI2-NEXT: retl<br>
@@ -1303,7 +1297,7 @@ define i32 @clear_lowbits32_ic4_commutat<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movl %esi, %ecx<br>
; X64-NOBMI2-NEXT: movl %edi, %eax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrl %cl, %eax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1311,7 +1305,7 @@ define i32 @clear_lowbits32_ic4_commutat<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits32_ic4_commutative:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxl %esi, %edi, %eax<br>
; X64-BMI2-NEXT: shlxl %esi, %eax, %eax<br>
; X64-BMI2-NEXT: retq<br>
@@ -1326,8 +1320,8 @@ define i32 @clear_lowbits32_ic4_commutat<br>
define i64 @clear_lowbits64_ic0(i64 %val, i64 %numlowbits) nounwind {<br>
; X86-NOBMI2-LABEL: clear_lowbits64_ic0:<br>
; X86-NOBMI2: # %bb.0:<br>
-; X86-NOBMI2-NEXT: movl $64, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: movb $64, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: movl $-1, %edx<br>
; X86-NOBMI2-NEXT: movl $-1, %eax<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1344,8 +1338,8 @@ define i64 @clear_lowbits64_ic0(i64 %val<br>
;<br>
; X86-BMI2-LABEL: clear_lowbits64_ic0:<br>
; X86-BMI2: # %bb.0:<br>
-; X86-BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: movl $-1, %edx<br>
; X86-BMI2-NEXT: shlxl %ecx, %edx, %eax<br>
; X86-BMI2-NEXT: shldl %cl, %edx, %edx<br>
@@ -1363,7 +1357,7 @@ define i64 @clear_lowbits64_ic0(i64 %val<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI2-NEXT: movq %rdi, %rax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrq %cl, %rax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI2-NEXT: shlq %cl, %rax<br>
@@ -1371,7 +1365,7 @@ define i64 @clear_lowbits64_ic0(i64 %val<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits64_ic0:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxq %rsi, %rdi, %rax<br>
; X64-BMI2-NEXT: shlxq %rsi, %rax, %rax<br>
; X64-BMI2-NEXT: retq<br>
@@ -1446,8 +1440,8 @@ define i64 @clear_lowbits64_ic2_load(i64<br>
; X86-NOBMI2: # %bb.0:<br>
; X86-NOBMI2-NEXT: pushl %esi<br>
; X86-NOBMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
-; X86-NOBMI2-NEXT: movl $64, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: movb $64, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: movl $-1, %edx<br>
; X86-NOBMI2-NEXT: movl $-1, %eax<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1467,8 +1461,8 @@ define i64 @clear_lowbits64_ic2_load(i64<br>
; X86-BMI2: # %bb.0:<br>
; X86-BMI2-NEXT: pushl %esi<br>
; X86-BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
-; X86-BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: movl $-1, %edx<br>
; X86-BMI2-NEXT: shlxl %ecx, %edx, %eax<br>
; X86-BMI2-NEXT: shldl %cl, %edx, %edx<br>
@@ -1487,7 +1481,7 @@ define i64 @clear_lowbits64_ic2_load(i64<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI2-NEXT: movq (%rdi), %rax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrq %cl, %rax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI2-NEXT: shlq %cl, %rax<br>
@@ -1495,7 +1489,7 @@ define i64 @clear_lowbits64_ic2_load(i64<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits64_ic2_load:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxq %rsi, (%rdi), %rax<br>
; X64-BMI2-NEXT: shlxq %rsi, %rax, %rax<br>
; X64-BMI2-NEXT: retq<br>
@@ -1576,8 +1570,8 @@ define i64 @clear_lowbits64_ic3_load_ind<br>
define i64 @clear_lowbits64_ic4_commutative(i64 %val, i64 %numlowbits) nounwind {<br>
; X86-NOBMI2-LABEL: clear_lowbits64_ic4_commutative:<br>
; X86-NOBMI2: # %bb.0:<br>
-; X86-NOBMI2-NEXT: movl $64, %ecx<br>
-; X86-NOBMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI2-NEXT: movb $64, %cl<br>
+; X86-NOBMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI2-NEXT: movl $-1, %edx<br>
; X86-NOBMI2-NEXT: movl $-1, %eax<br>
; X86-NOBMI2-NEXT: shll %cl, %eax<br>
@@ -1594,8 +1588,8 @@ define i64 @clear_lowbits64_ic4_commutat<br>
;<br>
; X86-BMI2-LABEL: clear_lowbits64_ic4_commutative:<br>
; X86-BMI2: # %bb.0:<br>
-; X86-BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI2-NEXT: movl $-1, %edx<br>
; X86-BMI2-NEXT: shlxl %ecx, %edx, %eax<br>
; X86-BMI2-NEXT: shldl %cl, %edx, %edx<br>
@@ -1613,7 +1607,7 @@ define i64 @clear_lowbits64_ic4_commutat<br>
; X64-NOBMI2: # %bb.0:<br>
; X64-NOBMI2-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI2-NEXT: movq %rdi, %rax<br>
-; X64-NOBMI2-NEXT: negl %ecx<br>
+; X64-NOBMI2-NEXT: negb %cl<br>
; X64-NOBMI2-NEXT: shrq %cl, %rax<br>
; X64-NOBMI2-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI2-NEXT: shlq %cl, %rax<br>
@@ -1621,7 +1615,7 @@ define i64 @clear_lowbits64_ic4_commutat<br>
;<br>
; X64-BMI2-LABEL: clear_lowbits64_ic4_commutative:<br>
; X64-BMI2: # %bb.0:<br>
-; X64-BMI2-NEXT: negl %esi<br>
+; X64-BMI2-NEXT: negb %sil<br>
; X64-BMI2-NEXT: shrxq %rsi, %rdi, %rax<br>
; X64-BMI2-NEXT: shlxq %rsi, %rax, %rax<br>
; X64-BMI2-NEXT: retq<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/cmov.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/cmov.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/cmov.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/cmov.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/cmov.ll Thu Nov 29 12:58:26 2018<br>
@@ -81,7 +81,7 @@ define i1 @test4() nounwind {<br>
; CHECK-NEXT: movsbl {{.*}}(%rip), %edx<br>
; CHECK-NEXT: movzbl %dl, %ecx<br>
; CHECK-NEXT: shrl $7, %ecx<br>
-; CHECK-NEXT: xorl $1, %ecx<br>
+; CHECK-NEXT: xorb $1, %cl<br>
; CHECK-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; CHECK-NEXT: sarl %cl, %edx<br>
; CHECK-NEXT: movb {{.*}}(%rip), %al<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/extract-bits.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/extract-bits.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/extract-bits.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/extract-bits.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/extract-bits.ll Thu Nov 29 12:58:26 2018<br>
@@ -2983,7 +2983,7 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %edi<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -3005,7 +3005,7 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %edi<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -3020,22 +3020,22 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
;<br>
; X86-BMI1BMI2-LABEL: bextr32_c0:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: pushl %edi<br>
+; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: pushl %eax<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
-; X86-BMI1BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %edi<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, %edi, %eax<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, %esi, %eax<br>
; X86-BMI1BMI2-NEXT: addl $4, %esp<br>
; X86-BMI1BMI2-NEXT: popl %esi<br>
-; X86-BMI1BMI2-NEXT: popl %edi<br>
+; X86-BMI1BMI2-NEXT: popl %ebx<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bextr32_c0:<br>
@@ -3047,7 +3047,7 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
; X64-NOBMI-NEXT: movl %edi, %ebx<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl $-1, %ebp<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
@@ -3069,7 +3069,7 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
; X64-BMI1NOTBM-NEXT: movl %edi, %ebx<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
@@ -3089,8 +3089,8 @@ define i32 @bextr32_c0(i32 %val, i32 %nu<br>
; X64-BMI1BMI2-NEXT: pushq %rax<br>
; X64-BMI1BMI2-NEXT: movl %edx, %ebx<br>
; X64-BMI1BMI2-NEXT: shrxl %esi, %edi, %ebp<br>
-; X64-BMI1BMI2-NEXT: movl %edx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -3254,7 +3254,7 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
; X86-NOBMI-NEXT: movl (%eax), %edi<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -3277,7 +3277,7 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
; X86-BMI1NOTBM-NEXT: movl (%eax), %edi<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -3292,23 +3292,23 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
;<br>
; X86-BMI1BMI2-LABEL: bextr32_c2_load:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: pushl %edi<br>
+; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: pushl %eax<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
-; X86-BMI1BMI2-NEXT: shrxl %ecx, (%eax), %edi<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: shrxl %ecx, (%eax), %esi<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, %edi, %eax<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, %esi, %eax<br>
; X86-BMI1BMI2-NEXT: addl $4, %esp<br>
; X86-BMI1BMI2-NEXT: popl %esi<br>
-; X86-BMI1BMI2-NEXT: popl %edi<br>
+; X86-BMI1BMI2-NEXT: popl %ebx<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bextr32_c2_load:<br>
@@ -3320,7 +3320,7 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
; X64-NOBMI-NEXT: movl (%rdi), %ebp<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl $-1, %ebx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -3342,7 +3342,7 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
; X64-BMI1NOTBM-NEXT: movl (%rdi), %ebp<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
@@ -3362,8 +3362,8 @@ define i32 @bextr32_c2_load(i32* %w, i32<br>
; X64-BMI1BMI2-NEXT: pushq %rax<br>
; X64-BMI1BMI2-NEXT: movl %edx, %ebx<br>
; X64-BMI1BMI2-NEXT: shrxl %esi, (%rdi), %ebp<br>
-; X64-BMI1BMI2-NEXT: movl %edx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -3531,7 +3531,7 @@ define i32 @bextr32_c4_commutative(i32 %<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %edi<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -3553,7 +3553,7 @@ define i32 @bextr32_c4_commutative(i32 %<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %edi<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -3568,22 +3568,22 @@ define i32 @bextr32_c4_commutative(i32 %<br>
;<br>
; X86-BMI1BMI2-LABEL: bextr32_c4_commutative:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: pushl %edi<br>
+; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: pushl %eax<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
-; X86-BMI1BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %edi<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: shrxl %eax, {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, %edi, %eax<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, %esi, %eax<br>
; X86-BMI1BMI2-NEXT: addl $4, %esp<br>
; X86-BMI1BMI2-NEXT: popl %esi<br>
-; X86-BMI1BMI2-NEXT: popl %edi<br>
+; X86-BMI1BMI2-NEXT: popl %ebx<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bextr32_c4_commutative:<br>
@@ -3595,7 +3595,7 @@ define i32 @bextr32_c4_commutative(i32 %<br>
; X64-NOBMI-NEXT: movl %edi, %ebx<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl $-1, %ebp<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
@@ -3617,7 +3617,7 @@ define i32 @bextr32_c4_commutative(i32 %<br>
; X64-BMI1NOTBM-NEXT: movl %edi, %ebx<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
@@ -3637,8 +3637,8 @@ define i32 @bextr32_c4_commutative(i32 %<br>
; X64-BMI1BMI2-NEXT: pushq %rax<br>
; X64-BMI1BMI2-NEXT: movl %edx, %ebx<br>
; X64-BMI1BMI2-NEXT: shrxl %esi, %edi, %ebp<br>
-; X64-BMI1BMI2-NEXT: movl %edx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -3667,7 +3667,7 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X86-NOBMI-NEXT: movl %ebx, %ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -3694,7 +3694,7 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X86-BMI1NOTBM-NEXT: movl %ebx, %ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -3716,16 +3716,16 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X86-BMI1BMI2-NEXT: pushl %edi<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: subl $16, %esp<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %edi<br>
-; X86-BMI1BMI2-NEXT: shrxl %edi, {{[0-9]+}}(%esp), %ebx<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: shrxl %edi, {{[0-9]+}}(%esp), %esi<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, %ebx, %esi<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, %esi, %esi<br>
; X86-BMI1BMI2-NEXT: movl %edi, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
@@ -3744,7 +3744,7 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X64-NOBMI-NEXT: movl %edi, %ebp<br>
; X64-NOBMI-NEXT: movl %r14d, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl $-1, %ebx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -3768,7 +3768,7 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X64-BMI1NOTBM-NEXT: movl %edi, %ebp<br>
; X64-BMI1NOTBM-NEXT: movl %r14d, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
@@ -3791,8 +3791,8 @@ define i32 @bextr32_c5_skipextrauses(i32<br>
; X64-BMI1BMI2-NEXT: movl %edx, %ebx<br>
; X64-BMI1BMI2-NEXT: movl %esi, %ebp<br>
; X64-BMI1BMI2-NEXT: shrxl %esi, %edi, %r14d<br>
-; X64-BMI1BMI2-NEXT: movl %edx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -3835,8 +3835,8 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X86-NOBMI-NEXT: movl %edi, %esi<br>
; X86-NOBMI-NEXT: xorl %edi, %edi<br>
; X86-NOBMI-NEXT: .LBB32_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %ebp<br>
; X86-NOBMI-NEXT: movl $-1, %ebx<br>
; X86-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -3882,8 +3882,8 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X86-BMI1NOTBM-NEXT: movl %edi, %esi<br>
; X86-BMI1NOTBM-NEXT: xorl %edi, %edi<br>
; X86-BMI1NOTBM-NEXT: .LBB32_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
@@ -3928,8 +3928,8 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X86-BMI1BMI2-NEXT: movl %edi, %esi<br>
; X86-BMI1BMI2-NEXT: xorl %edi, %edi<br>
; X86-BMI1BMI2-NEXT: .LBB32_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ebx<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %ebx, %ebp<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %ebx, %ebx<br>
@@ -3964,7 +3964,7 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X64-NOBMI-NEXT: movq %rdi, %r14<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %r14<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -3986,7 +3986,7 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X64-BMI1NOTBM-NEXT: movq %rdi, %r14<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %r14<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -4007,7 +4007,7 @@ define i64 @bextr64_c0(i64 %val, i64 %nu<br>
; X64-BMI1BMI2-NEXT: movq %rdx, %rbx<br>
; X64-BMI1BMI2-NEXT: shrxq %rsi, %rdi, %r14<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -4257,8 +4257,8 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X86-NOBMI-NEXT: movl %edi, %esi<br>
; X86-NOBMI-NEXT: xorl %edi, %edi<br>
; X86-NOBMI-NEXT: .LBB34_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %ebp<br>
; X86-NOBMI-NEXT: movl $-1, %ebx<br>
; X86-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -4305,8 +4305,8 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X86-BMI1NOTBM-NEXT: movl %edi, %esi<br>
; X86-BMI1NOTBM-NEXT: xorl %edi, %edi<br>
; X86-BMI1NOTBM-NEXT: .LBB34_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
@@ -4352,8 +4352,8 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X86-BMI1BMI2-NEXT: movl %edi, %esi<br>
; X86-BMI1BMI2-NEXT: xorl %edi, %edi<br>
; X86-BMI1BMI2-NEXT: .LBB34_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ebx<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %ebx, %ebp<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %ebx, %ebx<br>
@@ -4388,7 +4388,7 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X64-NOBMI-NEXT: movq (%rdi), %r14<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %r14<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -4410,7 +4410,7 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X64-BMI1NOTBM-NEXT: movq (%rdi), %r14<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %r14<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -4431,7 +4431,7 @@ define i64 @bextr64_c2_load(i64* %w, i64<br>
; X64-BMI1BMI2-NEXT: movq %rdx, %rbx<br>
; X64-BMI1BMI2-NEXT: shrxq %rsi, (%rdi), %r14<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -4685,8 +4685,8 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X86-NOBMI-NEXT: movl %edi, %esi<br>
; X86-NOBMI-NEXT: xorl %edi, %edi<br>
; X86-NOBMI-NEXT: .LBB36_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %ebp<br>
; X86-NOBMI-NEXT: movl $-1, %ebx<br>
; X86-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -4732,8 +4732,8 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X86-BMI1NOTBM-NEXT: movl %edi, %esi<br>
; X86-BMI1NOTBM-NEXT: xorl %edi, %edi<br>
; X86-BMI1NOTBM-NEXT: .LBB36_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
@@ -4778,8 +4778,8 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X86-BMI1BMI2-NEXT: movl %edi, %esi<br>
; X86-BMI1BMI2-NEXT: xorl %edi, %edi<br>
; X86-BMI1BMI2-NEXT: .LBB36_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ebx<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %ebx, %ebp<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %ebx, %ebx<br>
@@ -4814,7 +4814,7 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X64-NOBMI-NEXT: movq %rdi, %r14<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %r14<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -4836,7 +4836,7 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X64-BMI1NOTBM-NEXT: movq %rdi, %r14<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %r14<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -4857,7 +4857,7 @@ define i64 @bextr64_c4_commutative(i64 %<br>
; X64-BMI1BMI2-NEXT: movq %rdx, %rbx<br>
; X64-BMI1BMI2-NEXT: shrxq %rsi, %rdi, %r14<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -4894,8 +4894,8 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X86-NOBMI-NEXT: movl %edi, %esi<br>
; X86-NOBMI-NEXT: xorl %edi, %edi<br>
; X86-NOBMI-NEXT: .LBB37_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %ebx<br>
; X86-NOBMI-NEXT: movl $-1, %ebp<br>
; X86-NOBMI-NEXT: shrl %cl, %ebp<br>
@@ -4946,8 +4946,8 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X86-BMI1NOTBM-NEXT: movl %edi, %esi<br>
; X86-BMI1NOTBM-NEXT: xorl %edi, %edi<br>
; X86-BMI1NOTBM-NEXT: .LBB37_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
@@ -4997,8 +4997,8 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X86-BMI1BMI2-NEXT: movl %edi, %esi<br>
; X86-BMI1BMI2-NEXT: xorl %edi, %edi<br>
; X86-BMI1BMI2-NEXT: .LBB37_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ebp<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %ebp, %ebx<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %ebp, %ebp<br>
@@ -5038,7 +5038,7 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X64-NOBMI-NEXT: movq %rdi, %r15<br>
; X64-NOBMI-NEXT: movl %r14d, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %r15<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -5062,7 +5062,7 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X64-BMI1NOTBM-NEXT: movq %rdi, %r15<br>
; X64-BMI1NOTBM-NEXT: movl %r14d, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %r15<br>
-; X64-BMI1NOTBM-NEXT: negl %edx<br>
+; X64-BMI1NOTBM-NEXT: negb %dl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: movl %edx, %ecx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -5086,7 +5086,7 @@ define i64 @bextr64_c5_skipextrauses(i64<br>
; X64-BMI1BMI2-NEXT: movq %rsi, %r14<br>
; X64-BMI1BMI2-NEXT: shrxq %rsi, %rdi, %r15<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -5118,7 +5118,7 @@ define i32 @bextr32_d0(i32 %val, i32 %nu<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shll %cl, %eax<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -5126,16 +5126,16 @@ define i32 @bextr32_d0(i32 %val, i32 %nu<br>
;<br>
; X86-BMI1NOTBM-LABEL: bextr32_d0:<br>
; X86-BMI1NOTBM: # %bb.0:<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1NOTBM-NEXT: movzbl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1NOTBM-NEXT: shll $8, %eax<br>
-; X86-BMI1NOTBM-NEXT: orl %ecx, %eax<br>
-; X86-BMI1NOTBM-NEXT: bextrl %eax, {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1NOTBM-NEXT: movzbl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: orl %eax, %ecx<br>
+; X86-BMI1NOTBM-NEXT: bextrl %ecx, {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1NOTBM-NEXT: retl<br>
;<br>
; X86-BMI1BMI2-LABEL: bextr32_d0:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, {{[0-9]+}}(%esp), %ecx<br>
; X86-BMI1BMI2-NEXT: bzhil %eax, %ecx, %eax<br>
@@ -5147,7 +5147,7 @@ define i32 @bextr32_d0(i32 %val, i32 %nu<br>
; X64-NOBMI-NEXT: movl %edi, %eax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shll %cl, %eax<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -5245,7 +5245,7 @@ define i32 @bextr32_d2_load(i32* %w, i32<br>
; X86-NOBMI-NEXT: movl (%eax), %eax<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shll %cl, %eax<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -5254,16 +5254,16 @@ define i32 @bextr32_d2_load(i32* %w, i32<br>
; X86-BMI1NOTBM-LABEL: bextr32_d2_load:<br>
; X86-BMI1NOTBM: # %bb.0:<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
-; X86-BMI1NOTBM-NEXT: movzbl {{[0-9]+}}(%esp), %edx<br>
+; X86-BMI1NOTBM-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: shll $8, %ecx<br>
-; X86-BMI1NOTBM-NEXT: orl %edx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: bextrl %ecx, (%eax), %eax<br>
+; X86-BMI1NOTBM-NEXT: movzbl {{[0-9]+}}(%esp), %edx<br>
+; X86-BMI1NOTBM-NEXT: orl %ecx, %edx<br>
+; X86-BMI1NOTBM-NEXT: bextrl %edx, (%eax), %eax<br>
; X86-BMI1NOTBM-NEXT: retl<br>
;<br>
; X86-BMI1BMI2-LABEL: bextr32_d2_load:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %dl<br>
; X86-BMI1BMI2-NEXT: shrxl %edx, (%ecx), %ecx<br>
@@ -5276,7 +5276,7 @@ define i32 @bextr32_d2_load(i32* %w, i32<br>
; X64-NOBMI-NEXT: movl (%rdi), %eax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shll %cl, %eax<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -5381,7 +5381,7 @@ define i32 @bextr32_d5_skipextrauses(i32<br>
; X86-NOBMI-NEXT: movl %eax, %ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shll %cl, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -5396,7 +5396,7 @@ define i32 @bextr32_d5_skipextrauses(i32<br>
; X86-BMI1NOTBM: # %bb.0:<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: subl $8, %esp<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1NOTBM-NEXT: shll $8, %ecx<br>
; X86-BMI1NOTBM-NEXT: movzbl %al, %edx<br>
@@ -5413,7 +5413,7 @@ define i32 @bextr32_d5_skipextrauses(i32<br>
; X86-BMI1BMI2: # %bb.0:<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: subl $8, %esp<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, {{[0-9]+}}(%esp), %edx<br>
; X86-BMI1BMI2-NEXT: bzhil %eax, %edx, %esi<br>
@@ -5430,7 +5430,7 @@ define i32 @bextr32_d5_skipextrauses(i32<br>
; X64-NOBMI-NEXT: movl %edi, %ebx<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shll %cl, %ebx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebx<br>
@@ -5492,8 +5492,8 @@ define i64 @bextr64_d0(i64 %val, i64 %nu<br>
; X86-NOBMI-NEXT: movl %eax, %edi<br>
; X86-NOBMI-NEXT: xorl %eax, %eax<br>
; X86-NOBMI-NEXT: .LBB43_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shldl %cl, %edi, %eax<br>
; X86-NOBMI-NEXT: shll %cl, %edi<br>
; X86-NOBMI-NEXT: testb $32, %cl<br>
@@ -5540,8 +5540,8 @@ define i64 @bextr64_d0(i64 %val, i64 %nu<br>
; X86-BMI1NOTBM-NEXT: movl %eax, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %eax, %eax<br>
; X86-BMI1NOTBM-NEXT: .LBB43_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: shldl %cl, %edi, %eax<br>
; X86-BMI1NOTBM-NEXT: shll %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: testb $32, %cl<br>
@@ -5586,8 +5586,8 @@ define i64 @bextr64_d0(i64 %val, i64 %nu<br>
; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
; X86-BMI1BMI2-NEXT: xorl %esi, %esi<br>
; X86-BMI1BMI2-NEXT: .LBB43_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shldl %cl, %eax, %esi<br>
; X86-BMI1BMI2-NEXT: shlxl %ecx, %eax, %edi<br>
; X86-BMI1BMI2-NEXT: testb $32, %cl<br>
@@ -5617,7 +5617,7 @@ define i64 @bextr64_d0(i64 %val, i64 %nu<br>
; X64-NOBMI-NEXT: movq %rdi, %rax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shlq %cl, %rax<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
@@ -5838,8 +5838,8 @@ define i64 @bextr64_d2_load(i64* %w, i64<br>
; X86-NOBMI-NEXT: movl %eax, %edi<br>
; X86-NOBMI-NEXT: xorl %eax, %eax<br>
; X86-NOBMI-NEXT: .LBB45_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shldl %cl, %edi, %eax<br>
; X86-NOBMI-NEXT: shll %cl, %edi<br>
; X86-NOBMI-NEXT: testb $32, %cl<br>
@@ -5887,8 +5887,8 @@ define i64 @bextr64_d2_load(i64* %w, i64<br>
; X86-BMI1NOTBM-NEXT: movl %eax, %edi<br>
; X86-BMI1NOTBM-NEXT: xorl %eax, %eax<br>
; X86-BMI1NOTBM-NEXT: .LBB45_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: shldl %cl, %edi, %eax<br>
; X86-BMI1NOTBM-NEXT: shll %cl, %edi<br>
; X86-BMI1NOTBM-NEXT: testb $32, %cl<br>
@@ -5934,8 +5934,8 @@ define i64 @bextr64_d2_load(i64* %w, i64<br>
; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
; X86-BMI1BMI2-NEXT: xorl %esi, %esi<br>
; X86-BMI1BMI2-NEXT: .LBB45_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shldl %cl, %eax, %esi<br>
; X86-BMI1BMI2-NEXT: shlxl %ecx, %eax, %edi<br>
; X86-BMI1BMI2-NEXT: testb $32, %cl<br>
@@ -5965,7 +5965,7 @@ define i64 @bextr64_d2_load(i64* %w, i64<br>
; X64-NOBMI-NEXT: movq (%rdi), %rax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shlq %cl, %rax<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
@@ -6193,8 +6193,8 @@ define i64 @bextr64_d5_skipextrauses(i64<br>
; X86-NOBMI-NEXT: movl %esi, %ebx<br>
; X86-NOBMI-NEXT: xorl %esi, %esi<br>
; X86-NOBMI-NEXT: .LBB47_2:<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shldl %cl, %ebx, %esi<br>
; X86-NOBMI-NEXT: shll %cl, %ebx<br>
; X86-NOBMI-NEXT: testb $32, %cl<br>
@@ -6254,8 +6254,8 @@ define i64 @bextr64_d5_skipextrauses(i64<br>
; X86-BMI1NOTBM-NEXT: movl %esi, %ebx<br>
; X86-BMI1NOTBM-NEXT: xorl %esi, %esi<br>
; X86-BMI1NOTBM-NEXT: .LBB47_2:<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: shldl %cl, %ebx, %esi<br>
; X86-BMI1NOTBM-NEXT: shll %cl, %ebx<br>
; X86-BMI1NOTBM-NEXT: testb $32, %cl<br>
@@ -6312,8 +6312,8 @@ define i64 @bextr64_d5_skipextrauses(i64<br>
; X86-BMI1BMI2-NEXT: movl %edx, %edi<br>
; X86-BMI1BMI2-NEXT: xorl %edx, %edx<br>
; X86-BMI1BMI2-NEXT: .LBB47_2:<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shldl %cl, %edi, %edx<br>
; X86-BMI1BMI2-NEXT: shlxl %ecx, %edi, %ebx<br>
; X86-BMI1BMI2-NEXT: testb $32, %cl<br>
@@ -6352,7 +6352,7 @@ define i64 @bextr64_d5_skipextrauses(i64<br>
; X64-NOBMI-NEXT: movq %rdi, %rbx<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
-; X64-NOBMI-NEXT: negl %edx<br>
+; X64-NOBMI-NEXT: negb %dl<br>
; X64-NOBMI-NEXT: movl %edx, %ecx<br>
; X64-NOBMI-NEXT: shlq %cl, %rbx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
<br>
Modified: llvm/trunk/test/CodeGen/X86/extract-lowbits.ll<br>
URL: <a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/extract-lowbits.ll?rev=347917&r1=347916&r2=347917&view=diff" rel="noreferrer" target="_blank" class="gmail-cremed cremed">http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/extract-lowbits.ll?rev=347917&r1=347916&r2=347917&view=diff</a><br>
==============================================================================<br>
--- llvm/trunk/test/CodeGen/X86/extract-lowbits.ll (original)<br>
+++ llvm/trunk/test/CodeGen/X86/extract-lowbits.ll Thu Nov 29 12:58:26 2018<br>
@@ -1436,7 +1436,7 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
; X86-NOBMI-NEXT: pushl %esi<br>
; X86-NOBMI-NEXT: subl $8, %esp<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -1453,7 +1453,7 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: subl $8, %esp<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -1467,18 +1467,18 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
;<br>
; X86-BMI1BMI2-LABEL: bzhi32_c0:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: pushl %esi<br>
+; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: subl $8, %esp<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: addl $8, %esp<br>
-; X86-BMI1BMI2-NEXT: popl %esi<br>
+; X86-BMI1BMI2-NEXT: popl %ebx<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bzhi32_c0:<br>
@@ -1488,7 +1488,7 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
; X64-NOBMI-NEXT: pushq %rax<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: movl %edi, %ebx<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movl $-1, %ebp<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
@@ -1508,7 +1508,7 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
; X64-BMI1NOTBM-NEXT: pushq %rax<br>
; X64-BMI1NOTBM-NEXT: movl %esi, %ecx<br>
; X64-BMI1NOTBM-NEXT: movl %edi, %ebx<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
@@ -1528,8 +1528,8 @@ define i32 @bzhi32_c0(i32 %val, i32 %num<br>
; X64-BMI1BMI2-NEXT: pushq %rax<br>
; X64-BMI1BMI2-NEXT: movl %esi, %ebx<br>
; X64-BMI1BMI2-NEXT: movl %edi, %ebp<br>
-; X64-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -1668,7 +1668,7 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X86-NOBMI-NEXT: subl $8, %esp<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %edx<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %edx<br>
@@ -1687,7 +1687,7 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X86-BMI1NOTBM-NEXT: subl $8, %esp<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %edx<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edx<br>
@@ -1705,9 +1705,10 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: subl $8, %esp<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: bzhil %ecx, (%eax), %esi<br>
-; X86-BMI1BMI2-NEXT: negl %ecx<br>
+; X86-BMI1BMI2-NEXT: # kill: def $cl killed $cl killed $ecx def $ecx<br>
+; X86-BMI1BMI2-NEXT: negb %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %eax<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %eax, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
@@ -1721,7 +1722,7 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: pushq %rbx<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movl $-1, %eax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -1737,7 +1738,7 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X64-BMI1NOTBM: # %bb.0:<br>
; X64-BMI1NOTBM-NEXT: pushq %rbx<br>
; X64-BMI1NOTBM-NEXT: movl %esi, %ecx<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %eax<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %eax<br>
@@ -1753,7 +1754,8 @@ define i32 @bzhi32_c2_load(i32* %w, i32<br>
; X64-BMI1BMI2: # %bb.0:<br>
; X64-BMI1BMI2-NEXT: pushq %rbx<br>
; X64-BMI1BMI2-NEXT: bzhil %esi, (%rdi), %ebx<br>
-; X64-BMI1BMI2-NEXT: negl %esi<br>
+; X64-BMI1BMI2-NEXT: # kill: def $sil killed $sil killed $esi def $esi<br>
+; X64-BMI1BMI2-NEXT: negb %sil<br>
; X64-BMI1BMI2-NEXT: movl $-1, %eax<br>
; X64-BMI1BMI2-NEXT: shrxl %esi, %eax, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -1884,7 +1886,7 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
; X86-NOBMI-NEXT: pushl %esi<br>
; X86-NOBMI-NEXT: subl $8, %esp<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %esi<br>
@@ -1901,7 +1903,7 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: subl $8, %esp<br>
; X86-BMI1NOTBM-NEXT: xorl %ecx, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %esi<br>
@@ -1915,18 +1917,18 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
;<br>
; X86-BMI1BMI2-LABEL: bzhi32_c4_commutative:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: pushl %esi<br>
+; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: subl $8, %esp<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
-; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X86-BMI1BMI2-NEXT: negl %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %bl<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X86-BMI1BMI2-NEXT: negb %al<br>
; X86-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X86-BMI1BMI2-NEXT: shrxl %eax, %ecx, %eax<br>
; X86-BMI1BMI2-NEXT: movl %eax, (%esp)<br>
; X86-BMI1BMI2-NEXT: calll use32<br>
-; X86-BMI1BMI2-NEXT: bzhil %esi, {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: bzhil %ebx, {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: addl $8, %esp<br>
-; X86-BMI1BMI2-NEXT: popl %esi<br>
+; X86-BMI1BMI2-NEXT: popl %ebx<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bzhi32_c4_commutative:<br>
@@ -1936,7 +1938,7 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
; X64-NOBMI-NEXT: pushq %rax<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: movl %edi, %ebx<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movl $-1, %ebp<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %ebp<br>
@@ -1956,7 +1958,7 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
; X64-BMI1NOTBM-NEXT: pushq %rax<br>
; X64-BMI1NOTBM-NEXT: movl %esi, %ecx<br>
; X64-BMI1NOTBM-NEXT: movl %edi, %ebx<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movl $-1, %ebp<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-BMI1NOTBM-NEXT: shrl %cl, %ebp<br>
@@ -1976,8 +1978,8 @@ define i32 @bzhi32_c4_commutative(i32 %v<br>
; X64-BMI1BMI2-NEXT: pushq %rax<br>
; X64-BMI1BMI2-NEXT: movl %esi, %ebx<br>
; X64-BMI1BMI2-NEXT: movl %edi, %ebp<br>
-; X64-BMI1BMI2-NEXT: movl %esi, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movl $-1, %ecx<br>
; X64-BMI1BMI2-NEXT: shrxl %eax, %ecx, %edi<br>
; X64-BMI1BMI2-NEXT: callq use32<br>
@@ -2003,8 +2005,8 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X86-NOBMI-NEXT: pushl %edi<br>
; X86-NOBMI-NEXT: pushl %esi<br>
; X86-NOBMI-NEXT: pushl %eax<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: movl $-1, %edi<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
@@ -2034,8 +2036,8 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X86-BMI1NOTBM-NEXT: pushl %edi<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: pushl %eax<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %edi<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
@@ -2065,8 +2067,8 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X86-BMI1BMI2-NEXT: pushl %edi<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: pushl %eax<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %esi<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %esi, %edi<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %esi, %esi<br>
@@ -2097,7 +2099,7 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X64-NOBMI-NEXT: pushq %rax<br>
; X64-NOBMI-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI-NEXT: movq %rdi, %r14<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -2117,7 +2119,7 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X64-BMI1NOTBM-NEXT: pushq %rax<br>
; X64-BMI1NOTBM-NEXT: movq %rsi, %rcx<br>
; X64-BMI1NOTBM-NEXT: movq %rdi, %r14<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -2138,7 +2140,7 @@ define i64 @bzhi64_c0(i64 %val, i64 %num<br>
; X64-BMI1BMI2-NEXT: movq %rsi, %rbx<br>
; X64-BMI1BMI2-NEXT: movq %rdi, %r14<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br></blockquote></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
@@ -2318,26 +2320,26 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X86-NOBMI-NEXT: pushl %ebx<br>
; X86-NOBMI-NEXT: pushl %edi<br>
; X86-NOBMI-NEXT: pushl %esi<br>
-; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
-; X86-NOBMI-NEXT: movl $-1, %edx<br>
+; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %edx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
+; X86-NOBMI-NEXT: movl $-1, %eax<br>
; X86-NOBMI-NEXT: movl $-1, %ebx<br>
; X86-NOBMI-NEXT: shrl %cl, %ebx<br>
-; X86-NOBMI-NEXT: shrdl %cl, %edx, %edx<br>
+; X86-NOBMI-NEXT: shrdl %cl, %eax, %eax<br>
; X86-NOBMI-NEXT: testb $32, %cl<br>
; X86-NOBMI-NEXT: je .LBB27_2<br>
; X86-NOBMI-NEXT: # %bb.1:<br>
-; X86-NOBMI-NEXT: movl %ebx, %edx<br>
+; X86-NOBMI-NEXT: movl %ebx, %eax<br>
; X86-NOBMI-NEXT: xorl %ebx, %ebx<br>
; X86-NOBMI-NEXT: .LBB27_2:<br>
-; X86-NOBMI-NEXT: movl (%eax), %esi<br>
-; X86-NOBMI-NEXT: andl %edx, %esi<br>
-; X86-NOBMI-NEXT: movl 4(%eax), %edi<br>
+; X86-NOBMI-NEXT: movl (%edx), %esi<br>
+; X86-NOBMI-NEXT: andl %eax, %esi<br>
+; X86-NOBMI-NEXT: movl 4(%edx), %edi<br>
; X86-NOBMI-NEXT: andl %ebx, %edi<br>
; X86-NOBMI-NEXT: subl $8, %esp<br>
; X86-NOBMI-NEXT: pushl %ebx<br>
-; X86-NOBMI-NEXT: pushl %edx<br>
+; X86-NOBMI-NEXT: pushl %eax<br>
; X86-NOBMI-NEXT: calll use64<br>
; X86-NOBMI-NEXT: addl $16, %esp<br>
; X86-NOBMI-NEXT: movl %esi, %eax<br>
@@ -2352,26 +2354,26 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X86-BMI1NOTBM-NEXT: pushl %ebx<br>
; X86-BMI1NOTBM-NEXT: pushl %edi<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
-; X86-BMI1NOTBM-NEXT: movl $-1, %edx<br>
+; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %edx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
+; X86-BMI1NOTBM-NEXT: movl $-1, %eax<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %ebx<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %ebx<br>
-; X86-BMI1NOTBM-NEXT: shrdl %cl, %edx, %edx<br>
+; X86-BMI1NOTBM-NEXT: shrdl %cl, %eax, %eax<br>
; X86-BMI1NOTBM-NEXT: testb $32, %cl<br>
; X86-BMI1NOTBM-NEXT: je .LBB27_2<br>
; X86-BMI1NOTBM-NEXT: # %bb.1:<br>
-; X86-BMI1NOTBM-NEXT: movl %ebx, %edx<br>
+; X86-BMI1NOTBM-NEXT: movl %ebx, %eax<br>
; X86-BMI1NOTBM-NEXT: xorl %ebx, %ebx<br>
; X86-BMI1NOTBM-NEXT: .LBB27_2:<br>
-; X86-BMI1NOTBM-NEXT: movl (%eax), %esi<br>
-; X86-BMI1NOTBM-NEXT: andl %edx, %esi<br>
-; X86-BMI1NOTBM-NEXT: movl 4(%eax), %edi<br>
+; X86-BMI1NOTBM-NEXT: movl (%edx), %esi<br>
+; X86-BMI1NOTBM-NEXT: andl %eax, %esi<br>
+; X86-BMI1NOTBM-NEXT: movl 4(%edx), %edi<br>
; X86-BMI1NOTBM-NEXT: andl %ebx, %edi<br>
; X86-BMI1NOTBM-NEXT: subl $8, %esp<br>
; X86-BMI1NOTBM-NEXT: pushl %ebx<br>
-; X86-BMI1NOTBM-NEXT: pushl %edx<br>
+; X86-BMI1NOTBM-NEXT: pushl %eax<br>
; X86-BMI1NOTBM-NEXT: calll use64<br>
; X86-BMI1NOTBM-NEXT: addl $16, %esp<br>
; X86-BMI1NOTBM-NEXT: movl %esi, %eax<br>
@@ -2386,25 +2388,25 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X86-BMI1BMI2-NEXT: pushl %ebx<br>
; X86-BMI1BMI2-NEXT: pushl %edi<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
-; X86-BMI1BMI2-NEXT: movl $-1, %edx<br>
-; X86-BMI1BMI2-NEXT: shrxl %ecx, %edx, %ebx<br>
-; X86-BMI1BMI2-NEXT: shrdl %cl, %edx, %edx<br>
+; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %edx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
+; X86-BMI1BMI2-NEXT: movl $-1, %eax<br>
+; X86-BMI1BMI2-NEXT: shrxl %ecx, %eax, %ebx<br>
+; X86-BMI1BMI2-NEXT: shrdl %cl, %eax, %eax<br>
; X86-BMI1BMI2-NEXT: testb $32, %cl<br>
; X86-BMI1BMI2-NEXT: je .LBB27_2<br>
; X86-BMI1BMI2-NEXT: # %bb.1:<br>
-; X86-BMI1BMI2-NEXT: movl %ebx, %edx<br>
+; X86-BMI1BMI2-NEXT: movl %ebx, %eax<br>
; X86-BMI1BMI2-NEXT: xorl %ebx, %ebx<br>
; X86-BMI1BMI2-NEXT: .LBB27_2:<br>
-; X86-BMI1BMI2-NEXT: movl (%eax), %esi<br>
-; X86-BMI1BMI2-NEXT: andl %edx, %esi<br>
-; X86-BMI1BMI2-NEXT: movl 4(%eax), %edi<br>
+; X86-BMI1BMI2-NEXT: movl (%edx), %esi<br>
+; X86-BMI1BMI2-NEXT: andl %eax, %esi<br>
+; X86-BMI1BMI2-NEXT: movl 4(%edx), %edi<br>
; X86-BMI1BMI2-NEXT: andl %ebx, %edi<br>
; X86-BMI1BMI2-NEXT: subl $8, %esp<br>
; X86-BMI1BMI2-NEXT: pushl %ebx<br>
-; X86-BMI1BMI2-NEXT: pushl %edx<br>
+; X86-BMI1BMI2-NEXT: pushl %eax<br>
; X86-BMI1BMI2-NEXT: calll use64<br>
; X86-BMI1BMI2-NEXT: addl $16, %esp<br>
; X86-BMI1BMI2-NEXT: movl %esi, %eax<br>
@@ -2418,7 +2420,7 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: pushq %rbx<br>
; X64-NOBMI-NEXT: movq %rsi, %rcx<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movq $-1, %rax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
@@ -2434,7 +2436,7 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X64-BMI1NOTBM: # %bb.0:<br>
; X64-BMI1NOTBM-NEXT: pushq %rbx<br>
; X64-BMI1NOTBM-NEXT: movq %rsi, %rcx<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rax<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rax<br>
@@ -2450,8 +2452,8 @@ define i64 @bzhi64_c2_load(i64* %w, i64<br>
; X64-BMI1BMI2: # %bb.0:<br>
; X64-BMI1BMI2-NEXT: pushq %rbx<br>
; X64-BMI1BMI2-NEXT: bzhiq %rsi, (%rdi), %rbx<br>
-; X64-BMI1BMI2-NEXT: # kill: def $esi killed $esi killed $rsi def $rsi<br>
-; X64-BMI1BMI2-NEXT: negl %esi<br>
+; X64-BMI1BMI2-NEXT: # kill: def $sil killed $sil killed $rsi def $rsi<br>
+; X64-BMI1BMI2-NEXT: negb %sil<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rax<br>
; X64-BMI1BMI2-NEXT: shrxq %rsi, %rax, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -2628,8 +2630,8 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X86-NOBMI-NEXT: pushl %edi<br>
; X86-NOBMI-NEXT: pushl %esi<br>
; X86-NOBMI-NEXT: pushl %eax<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl $-1, %esi<br>
; X86-NOBMI-NEXT: movl $-1, %edi<br>
; X86-NOBMI-NEXT: shrl %cl, %edi<br>
@@ -2659,8 +2661,8 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X86-BMI1NOTBM-NEXT: pushl %edi<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: pushl %eax<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %esi<br>
; X86-BMI1NOTBM-NEXT: movl $-1, %edi<br>
; X86-BMI1NOTBM-NEXT: shrl %cl, %edi<br>
@@ -2690,8 +2692,8 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X86-BMI1BMI2-NEXT: pushl %edi<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: pushl %eax<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: movl $-1, %esi<br>
; X86-BMI1BMI2-NEXT: shrxl %ecx, %esi, %edi<br>
; X86-BMI1BMI2-NEXT: shrdl %cl, %esi, %esi<br>
@@ -2722,7 +2724,7 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X64-NOBMI-NEXT: pushq %rax<br>
; X64-NOBMI-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI-NEXT: movq %rdi, %r14<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: movq $-1, %rbx<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rbx<br>
@@ -2742,7 +2744,7 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X64-BMI1NOTBM-NEXT: pushq %rax<br>
; X64-BMI1NOTBM-NEXT: movq %rsi, %rcx<br>
; X64-BMI1NOTBM-NEXT: movq %rdi, %r14<br>
-; X64-BMI1NOTBM-NEXT: negl %ecx<br>
+; X64-BMI1NOTBM-NEXT: negb %cl<br>
; X64-BMI1NOTBM-NEXT: movq $-1, %rbx<br>
; X64-BMI1NOTBM-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-BMI1NOTBM-NEXT: shrq %cl, %rbx<br>
@@ -2763,7 +2765,7 @@ define i64 @bzhi64_c4_commutative(i64 %v<br>
; X64-BMI1BMI2-NEXT: movq %rsi, %rbx<br>
; X64-BMI1BMI2-NEXT: movq %rdi, %r14<br>
; X64-BMI1BMI2-NEXT: movl %ebx, %eax<br>
-; X64-BMI1BMI2-NEXT: negl %eax<br>
+; X64-BMI1BMI2-NEXT: negb %al<br>
; X64-BMI1BMI2-NEXT: movq $-1, %rcx<br>
; X64-BMI1BMI2-NEXT: shrxq %rax, %rcx, %rdi<br>
; X64-BMI1BMI2-NEXT: callq use64<br>
@@ -2788,7 +2790,7 @@ define i32 @bzhi32_d0(i32 %val, i32 %num<br>
; X86-NOBMI: # %bb.0:<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shll %cl, %eax<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -2796,14 +2798,14 @@ define i32 @bzhi32_d0(i32 %val, i32 %num<br>
;<br>
; X86-BMI1NOTBM-LABEL: bzhi32_d0:<br>
; X86-BMI1NOTBM: # %bb.0:<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1NOTBM-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1NOTBM-NEXT: shll $8, %eax<br>
; X86-BMI1NOTBM-NEXT: bextrl %eax, {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1NOTBM-NEXT: retl<br>
;<br>
; X86-BMI1BMI2-LABEL: bzhi32_d0:<br>
; X86-BMI1BMI2: # %bb.0:<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %al<br>
; X86-BMI1BMI2-NEXT: bzhil %eax, {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
@@ -2811,7 +2813,7 @@ define i32 @bzhi32_d0(i32 %val, i32 %num<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: movl %edi, %eax<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: shll %cl, %eax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -2890,7 +2892,7 @@ define i32 @bzhi32_d2_load(i32* %w, i32<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI-NEXT: movl (%eax), %eax<br>
; X86-NOBMI-NEXT: xorl %ecx, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: shll %cl, %eax<br>
; X86-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X86-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -2899,7 +2901,7 @@ define i32 @bzhi32_d2_load(i32* %w, i32<br>
; X86-BMI1NOTBM-LABEL: bzhi32_d2_load:<br>
; X86-BMI1NOTBM: # %bb.0:<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: shll $8, %ecx<br>
; X86-BMI1NOTBM-NEXT: bextrl %ecx, (%eax), %eax<br>
; X86-BMI1NOTBM-NEXT: retl<br>
@@ -2907,15 +2909,15 @@ define i32 @bzhi32_d2_load(i32* %w, i32<br>
; X86-BMI1BMI2-LABEL: bzhi32_d2_load:<br>
; X86-BMI1BMI2: # %bb.0:<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %ecx<br>
-; X86-BMI1BMI2-NEXT: bzhil %eax, (%ecx), %eax<br>
+; X86-BMI1BMI2-NEXT: movb {{[0-9]+}}(%esp), %cl<br>
+; X86-BMI1BMI2-NEXT: bzhil %ecx, (%eax), %eax<br>
; X86-BMI1BMI2-NEXT: retl<br>
;<br>
; X64-NOBMI-LABEL: bzhi32_d2_load:<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: movl %esi, %ecx<br>
; X64-NOBMI-NEXT: movl (%rdi), %eax<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: shll %cl, %eax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $ecx<br>
; X64-NOBMI-NEXT: shrl %cl, %eax<br>
@@ -3003,8 +3005,8 @@ define i64 @bzhi64_d0(i64 %val, i64 %num<br>
; X86-NOBMI-NEXT: pushl %esi<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %edx<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl %edx, %esi<br>
; X86-NOBMI-NEXT: shll %cl, %esi<br>
; X86-NOBMI-NEXT: shldl %cl, %edx, %eax<br>
@@ -3042,8 +3044,8 @@ define i64 @bzhi64_d0(i64 %val, i64 %num<br>
; X86-BMI1NOTBM-NEXT: pushl %esi<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %edx<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl %edx, %esi<br>
; X86-BMI1NOTBM-NEXT: shll %cl, %esi<br>
; X86-BMI1NOTBM-NEXT: shldl %cl, %edx, %eax<br>
@@ -3080,8 +3082,8 @@ define i64 @bzhi64_d0(i64 %val, i64 %num<br>
; X86-BMI1BMI2-NEXT: pushl %esi<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %esi<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shldl %cl, %eax, %esi<br>
; X86-BMI1BMI2-NEXT: shlxl %ecx, %eax, %edi<br>
; X86-BMI1BMI2-NEXT: xorl %edx, %edx<br>
@@ -3110,7 +3112,7 @@ define i64 @bzhi64_d0(i64 %val, i64 %num<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI-NEXT: movq %rdi, %rax<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: shlq %cl, %rax<br>
; X64-NOBMI-NEXT: # kill: def $cl killed $cl killed $rcx<br>
; X64-NOBMI-NEXT: shrq %cl, %rax<br>
@@ -3281,8 +3283,8 @@ define i64 @bzhi64_d2_load(i64* %w, i64<br>
; X86-NOBMI-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-NOBMI-NEXT: movl (%eax), %edx<br>
; X86-NOBMI-NEXT: movl 4(%eax), %eax<br>
-; X86-NOBMI-NEXT: movl $64, %ecx<br>
-; X86-NOBMI-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-NOBMI-NEXT: movb $64, %cl<br>
+; X86-NOBMI-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-NOBMI-NEXT: movl %edx, %esi<br>
; X86-NOBMI-NEXT: shll %cl, %esi<br>
; X86-NOBMI-NEXT: shldl %cl, %edx, %eax<br>
@@ -3321,8 +3323,8 @@ define i64 @bzhi64_d2_load(i64* %w, i64<br>
; X86-BMI1NOTBM-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1NOTBM-NEXT: movl (%eax), %edx<br>
; X86-BMI1NOTBM-NEXT: movl 4(%eax), %eax<br>
-; X86-BMI1NOTBM-NEXT: movl $64, %ecx<br>
-; X86-BMI1NOTBM-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1NOTBM-NEXT: movb $64, %cl<br>
+; X86-BMI1NOTBM-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1NOTBM-NEXT: movl %edx, %esi<br>
; X86-BMI1NOTBM-NEXT: shll %cl, %esi<br>
; X86-BMI1NOTBM-NEXT: shldl %cl, %edx, %eax<br>
@@ -3360,8 +3362,8 @@ define i64 @bzhi64_d2_load(i64* %w, i64<br>
; X86-BMI1BMI2-NEXT: movl {{[0-9]+}}(%esp), %eax<br>
; X86-BMI1BMI2-NEXT: movl (%eax), %edx<br>
; X86-BMI1BMI2-NEXT: movl 4(%eax), %esi<br>
-; X86-BMI1BMI2-NEXT: movl $64, %ecx<br>
-; X86-BMI1BMI2-NEXT: subl {{[0-9]+}}(%esp), %ecx<br>
+; X86-BMI1BMI2-NEXT: movb $64, %cl<br>
+; X86-BMI1BMI2-NEXT: subb {{[0-9]+}}(%esp), %cl<br>
; X86-BMI1BMI2-NEXT: shldl %cl, %edx, %esi<br>
; X86-BMI1BMI2-NEXT: shlxl %ecx, %edx, %edi<br>
; X86-BMI1BMI2-NEXT: xorl %edx, %edx<br>
@@ -3390,7 +3392,7 @@ define i64 @bzhi64_d2_load(i64* %w, i64<br>
; X64-NOBMI: # %bb.0:<br>
; X64-NOBMI-NEXT: movq %rsi, %rcx<br>
; X64-NOBMI-NEXT: movq (%rdi), %rax<br>
-; X64-NOBMI-NEXT: negl %ecx<br>
+; X64-NOBMI-NEXT: negb %cl<br>
; X64-NOBMI-NEXT: shlq %cl, %rax<b</blockquote></div></blockquote></div></div>
</blockquote></div></div></div>