<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:"Intel Clear";}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
span.EmailStyle22
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal">Hi Scott,<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Hmm, you’re right. I guess the original reasons for this may now be lost in antiquity… One reason I can think of why we’d want to do this would be to ensure that we don’t have overlapping input registers. Back before subregister liveness
tracking was implemented in codegen, many instructions that use/def a subregister would have implicit uses/defs of the super-registers. I’m not sure if that ever had any impact on how function live-ins were represented, so it’s just a speculation.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Feel free to create a patch, I’ll run some tests on it in our downstream repo. If it works, it should be ok to commit it.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:Consolas">-- </span>
<span style="font-size:9.0pt;font-family:Consolas"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:8.0pt;font-family:Consolas">Krzysztof Parzyszek
<a href="mailto:kparzysz@quicinc.com">kparzysz@quicinc.com</a> AI tools development<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Constable, Scott D <scott.d.constable@intel.com>
<br>
<b>Sent:</b> Wednesday, January 26, 2022 9:58 AM<br>
<b>To:</b> Krzysztof Parzyszek <kparzysz@quicinc.com><br>
<b>Cc:</b> via llvm-dev <llvm-dev@lists.llvm.org><br>
<b>Subject:</b> RE: [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p align="center" style="text-align:center"><strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">WARNING:</span></strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">
This email originated from outside of Qualcomm. Please be wary of any links or attachments, and do not enable macros.</span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Hi Krzysztof,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">I have been looking at more examples of RDF graphs, and it seems that this register lane splitting is only being applied to function arguments. Anywhere else in the function where where a 64-bit GPR
is being def’ed, I only see a single def for that GPR. As an example, in simpleFunction() below there is one def for RAX instead of three def’s for HAX, AXH, and AXL.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Could you clarify this behavior?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">The motivation for my inquiry is that I am working on an analysis that benefits from an SSA property, namely, that each use has a unique def (possibly a phi). But in RDF this property does not hold
for function arguments, so the analysis becomes more tricky. As an experiment, I changed the behavior of RDF to generate only a single phi for each argument, and now my analysis works as expected (and I’m not seeing any crashes, etc.). The code I changed is
in DataFlowGraph::build():<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> MachineRegisterInfo &MRI = MF.getRegInfo();<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> MachineBasicBlock &EntryB = *EA.Addr->getCode();<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> assert(EntryB.pred_empty() && "Function entry block has predecessors");<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- for (std::pair<unsigned,unsigned> P : MRI.liveins())<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- LiveIns.insert(RegisterRef(P.first));<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> if (MRI.tracksLiveness()) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- for (auto I : EntryB.liveins())<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- LiveIns.insert(RegisterRef(I.PhysReg, I.LaneMask));<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- }<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">-<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- // Add function-entry phi nodes for the live-in registers.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- //for (std::pair<RegisterId,LaneBitmask> P : LiveIns) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- for (auto I = LiveIns.rr_begin(), E = LiveIns.rr_end(); I != E; ++I) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- RegisterRef RR = *I;<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- NodeAddr<PhiNode*> PA = newPhi(EA);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- uint16_t PhiFlags = NodeAttrs::PhiRef | NodeAttrs::Preserving;<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- NodeAddr<DefNode*> DA = newDef(PA, RR, PhiFlags);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">- PA.Addr->addMember(DA, *this);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ for (auto I = MRI.livein_begin(), E = MRI.livein_end(); I != E; ++I) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ RegisterRef RR(I->first);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ NodeAddr<PhiNode*> PA = newPhi(EA);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ uint16_t PhiFlags = NodeAttrs::PhiRef | NodeAttrs::Preserving;<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ NodeAddr<DefNode*> DA = newDef(PA, RR, PhiFlags);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ PA.Addr->addMember(DA, *this);<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">+ }<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> }<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Thanks,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Scott Constable<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Constable, Scott D <br>
<b>Sent:</b> Monday, January 24, 2022 1:23 PM<br>
<b>To:</b> Krzysztof Parzyszek <<a href="mailto:kparzysz@quicinc.com">kparzysz@quicinc.com</a>><br>
<b>Cc:</b> via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>><br>
<b>Subject:</b> RE: [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Right! That really should have occurred to me ;). Thanks again.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Scott<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Krzysztof Parzyszek <<a href="mailto:kparzysz@quicinc.com">kparzysz@quicinc.com</a>>
<br>
<b>Sent:</b> Monday, January 24, 2022 1:19 PM<br>
<b>To:</b> Constable, Scott D <<a href="mailto:scott.d.constable@intel.com">scott.d.constable@intel.com</a>><br>
<b>Cc:</b> via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>><br>
<b>Subject:</b> RE: [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">The reason is that storing to EDI actually overwrites the upper 32 bits of RDI, so the EDI and RDI are considered identical in terms of register units.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:Consolas">-- <o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:8.0pt;font-family:Consolas">Krzysztof Parzyszek
</span><a href="mailto:kparzysz@quicinc.com"><span style="font-size:8.0pt;font-family:Consolas">kparzysz@quicinc.com</span></a><span style="font-size:8.0pt;font-family:Consolas"> AI tools development<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Constable, Scott D <<a href="mailto:scott.d.constable@intel.com">scott.d.constable@intel.com</a>>
<br>
<b>Sent:</b> Monday, January 24, 2022 3:11 PM<br>
<b>To:</b> Krzysztof Parzyszek <<a href="mailto:kparzysz@quicinc.com">kparzysz@quicinc.com</a>><br>
<b>Cc:</b> via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>><br>
<b>Subject:</b> RE: [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p align="center" style="text-align:center"><strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">WARNING:</span></strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">
This email originated from outside of Qualcomm. Please be wary of any links or attachments, and do not enable macros.</span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Thank you very much for the explanation, Krzysztof. There is still one part that I am missing. In the example below, RDF generates defs for DIL (bits 0:7 of RDI), DIH (bits 8:15 of RDI), and HDI (bits
16:31 of RDI). Hence bits 32:63 are unaccounted. Could this cause any issues? For example, suppose the program writes to RDI, and then clobbers EAX, but then the upper half of RDI is still defined.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Scott<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> Krzysztof Parzyszek <<a href="mailto:kparzysz@quicinc.com">kparzysz@quicinc.com</a>>
<br>
<b>Sent:</b> Monday, January 24, 2022 11:48 AM<br>
<b>To:</b> Constable, Scott D <<a href="mailto:scott.d.constable@intel.com">scott.d.constable@intel.com</a>><br>
<b>Cc:</b> via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>><br>
<b>Subject:</b> RE: [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Hi Scott,<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">It used to be the case early on that we’d create an entry for the register as it was specified, but I’ve been running into all sorts of complications when calculating liveness. Specifically, when defs/uses of “large” registers were intermingled
with defs/uses of their sub-registers, various “interesting” scenarios showed up. They had to do with representing unions/intersections of registers in terms of actual registers[1]---that was the reason for switching to tracking register units instead (see
RegisterAggr class).<o:p></o:p></p>
<p class="MsoNormal">The consequence of that is that the graph looks more cluttered, but things are easier to express accurately.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">[1] This was before lane masks became popular. Say we have<o:p></o:p></p>
<p class="MsoNormal">AX = def<o:p></o:p></p>
<p class="MsoNormal">AH = clobber<o:p></o:p></p>
<p class="MsoNormal">Here AX is still “defined” since some part of it is defined. Now, if we know that AX is “defined” then what can we say about it after executing “AL = clobber”? Turns out that this is difficult to answer, because we’d have to know the
history of how AX was modified, which isn’t always analyzed in full: (RDF stop def traversal at a PHI node).<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:Consolas">-- <o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:8.0pt;font-family:Consolas">Krzysztof Parzyszek
</span><a href="mailto:kparzysz@quicinc.com"><span style="font-size:8.0pt;font-family:Consolas">kparzysz@quicinc.com</span></a><span style="font-size:8.0pt;font-family:Consolas"> AI tools development<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b>From:</b> llvm-dev <<a href="mailto:llvm-dev-bounces@lists.llvm.org">llvm-dev-bounces@lists.llvm.org</a>>
<b>On Behalf Of </b>Constable, Scott D via llvm-dev<br>
<b>Sent:</b> Saturday, January 22, 2022 3:24 PM<br>
<b>To:</b> <a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a><br>
<b>Subject:</b> [llvm-dev] [RDF] Question about function argument registers on X86<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p align="center" style="text-align:center"><strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">WARNING:</span></strong><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black;background:yellow">
This email originated from outside of Qualcomm. Please be wary of any links or attachments, and do not enable macros.</span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Hi,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Given the function:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">long simpleExample(long x) {<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""> return x;<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">}<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">RDF produces the following data-flow graph:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">DFG dump:[<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">f1: Function: simpleExample<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">b2: --- %bb.0 --- preds(0): succs(0):<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">p8: phi [+d9<DIH:0000000000000001>(,,u15"):]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">p10: phi [+d11<DIL:0000000000000001>(,,u14"):]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">p12: phi [+d13<HDI:0000000000000001>(,,u5"):]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">s3: COPY [d4<RAX>(,,u7):, u5"<RDI>(+d13):, u14"<RDI>(+d11):, u15"<RDI>(+d9):]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">s6: RET [u7<RAX>!(d4):]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Why is the argument in RDI split into three lanes that do not fully span RDI? It would seem more natural to just have a single phi for RDI.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">I think the RDF code responsible for this behavior is located here:
</span><a href="https://github.com/llvm/llvm-project/blob/55d887b833646baeea0e3371fd2cbbd7550a8d4d/llvm/lib/CodeGen/RDFGraph.cpp#L903"><span style="font-family:"Intel Clear"">https://github.com/llvm/llvm-project/blob/55d887b833646baeea0e3371fd2cbbd7550a8d4d/llvm/lib/CodeGen/RDFGraph.cpp#L903</span></a><span style="font-family:"Intel Clear"">,
but I admit I do not fully understand what is going on. <o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Thanks in advance,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear"">Scott Constable<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:"Intel Clear""><o:p> </o:p></span></p>
</div>
</div>
</div>
</div>
</body>
</html>