<div>I'm running into a problem with subregs during trivial coalescing in the linear scan allocator.</div><div><br></div><div>Should RALinScan::attemptTrivialCoalescing be allowed to coalesce a COPY that uses a subreg as a destination?</div>
<div><br></div><div>I've got the following sequence of code (unfortunately for an out of tree target) that is moving 32 and 64 bit sub-registers around within a 128 bit register. By the time the register allocator runs the code looks like:</div>
<div><br></div><div><font class="Apple-style-span" face="'courier new', monospace">92L<span style="white-space:pre-wrap"> </span>%reg16402:dsub0<def> = DEF64.. %reg16402<imp-def>, QPR:%reg16402.... </font></div>
<div><font class="Apple-style-span" face="'courier new', monospace">116L<span style="white-space:pre-wrap"> </span>%reg16405:sub0<def> = COPY %reg16402:sub1, %reg16405<imp-def>; QPR:%reg16405,16402</font></div>
<div><font class="Apple-style-span" face="'courier new', monospace">124L<span style="white-space:pre-wrap"> </span>%reg16413:sub0<def> = COPY %reg16405:sub0<kill>; QPR:%reg16413,16405</font></div><div>
<font class="Apple-style-span" face="'courier new', monospace">.... stuff ....</font></div><div><font class="Apple-style-span" face="'courier new', monospace">468L<span style="white-space:pre-wrap"> </span>%reg16460:sub3<def> = COPY %reg16402:sub0<kill>; QPR:%reg16460,16402</font></div>
<div><br></div><div>dsub0/dsub1 are 64-bit subregs and sub0/1/2/3 are 32 bit subregs. DEF64 is just representative of a 64 bit ALU operation.</div><div><br></div><div>The code is correct at this point.</div><div><br></div>
<div>Q1 (a 128-bit physical register) is assigned to %reg16402 which is ok but then RALinScan::attemptTrivialCoalescing thinks it can coalesce r16405 with r16402 giving:</div>
<div><br></div><div><font class="Apple-style-span" face="'courier new', monospace">92L<span style="white-space:pre-wrap"> </span>Q1:dsub0<def> = DEF64 %reg16402<imp-def>, QPR:%reg16402</font></div>
<div><font class="Apple-style-span" face="'courier new', monospace">116L<span style="white-space:pre-wrap"> </span>Q1:sub0<def> = COPY Q1:sub1, %reg16405<imp-def>; QPR:%reg16405,16402</font></div>
<div><font class="Apple-style-span" face="'courier new', monospace">124L<span style="white-space:pre-wrap"> </span>%reg16413:sub0<def> = COPY Q1:sub0; QPR:%reg16413,16405</font></div><div><font class="Apple-style-span" face="'courier new', monospace">.... stuff ....</font></div>
<div><font class="Apple-style-span" face="'courier new', monospace">468L<span style="white-space:pre-wrap"> </span>%reg16460:sub3<def> = COPY Q1:sub0<kill>; QPR:%reg16460,16402</font></div>
<div><br></div><div>which is wrong: Q1:sub0 (in reg16402) is overwitten at 116 before it's last use at 468.</div><div><br></div><div>Trivial coalescing doesn't have any check for subregs so I'm assuming that something is broke. I've patched things up by rejecting any trivial coalescing attempt where the COPY has a subregister as it's target (none of the tests for arm and x86 hit this case).</div>
<div><br></div><div>Does that seem right? Not sure if I've misused subregs or if the input to the RA is incorrect?</div><div><br></div><div>Krister</div><div><br></div>