<html><head><meta http-equiv="Content-Type" content="text/html charset=windows-1252"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;"><br><div><div>On Dec 3, 2013, at 12:19 PM, Hal Finkel <<a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;">----- Original Message -----<br><blockquote type="cite">From: "Andrew Trick" <<a href="mailto:atrick@apple.com">atrick@apple.com</a>><br>To: "Hal Finkel" <<a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a>><br>Cc: "llvm commits" <<a href="mailto:llvm-commits@cs.uiuc.edu">llvm-commits@cs.uiuc.edu</a>><br>Sent: Tuesday, December 3, 2013 1:46:30 PM<br>Subject: Re: [llvm-commits] [llvm] r157977 - in /llvm/trunk: include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>lib/CodeGen/ScoreboardHazardRecognizer.cpp<br><br><br><br><br>On Dec 3, 2013, at 11:31 AM, Hal Finkel < <a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a> > wrote:<br><br><br><br>----- Original Message -----<br><br><br>From: "Andrew Trick" < <a href="mailto:atrick@apple.com">atrick@apple.com</a> ><br>To: "Hal Finkel" < <a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a> ><br>Cc: "llvm commits" < <a href="mailto:llvm-commits@cs.uiuc.edu">llvm-commits@cs.uiuc.edu</a> ><br>Sent: Tuesday, December 3, 2013 1:15:38 PM<br>Subject: Re: [llvm-commits] [llvm] r157977 - in /llvm/trunk:<br>include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>lib/CodeGen/ScoreboardHazardRecognizer.cpp<br><br><br><br><br>On Dec 3, 2013, at 11:05 AM, Hal Finkel < <a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a> > wrote:<br><br><br><br>----- Original Message -----<br><br><br>From: "Andrew Trick" < <a href="mailto:atrick@apple.com">atrick@apple.com</a> ><br>To: "Hal Finkel" < <a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a> ><br>Cc: "llvm commits" < <a href="mailto:llvm-commits@cs.uiuc.edu">llvm-commits@cs.uiuc.edu</a> ><br>Sent: Tuesday, December 3, 2013 12:15:08 PM<br>Subject: Re: [llvm-commits] [llvm] r157977 - in /llvm/trunk:<br>include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>lib/CodeGen/ScoreboardHazardRecognizer.cpp<br><br><br>On Dec 3, 2013, at 9:19 AM, Hal Finkel < <a href="mailto:hfinkel@anl.gov">hfinkel@anl.gov</a> > wrote:<br><br><br><br>----- Original Message -----<br><br><br>From: "Andrew Trick" < <a href="mailto:atrick@apple.com">atrick@apple.com</a> ><br>To: <a href="mailto:llvm-commits@cs.uiuc.edu">llvm-commits@cs.uiuc.edu</a><br>Sent: Monday, June 4, 2012 10:44:32 PM<br>Subject: [llvm-commits] [llvm] r157977 - in /llvm/trunk:<br>include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>lib/CodeGen/ScoreboardHazardRecognizer.cpp<br><br>Author: atrick<br>Date: Mon Jun 4 22:44:32 2012<br>New Revision: 157977<br><br>URL: <a href="http://llvm.org/viewvc/llvm-project?rev=157977&view=rev">http://llvm.org/viewvc/llvm-project?rev=157977&view=rev</a><br>Log:<br>misched: Allow disabling scoreboard hazard checking for subtargets<br>with a<br>valid itinerary but no pipeline stages.<br><br>An itinerary can contain useful scheduling information without<br>specifying pipeline stages for each instruction.<br><br>Modified:<br>llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp<br><br>Modified:<br>llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>URL:<br><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h?rev=157977&r1=157976&r2=157977&view=diff">http://llvm.org/viewvc/llvm-project/llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h?rev=157977&r1=157976&r2=157977&view=diff</a><br>==============================================================================<br>--- llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h<br>(original)<br>+++ llvm/trunk/include/llvm/CodeGen/ScheduleHazardRecognizer.h Mon<br>Jun 4 22:44:32 2012<br>@@ -46,6 +46,8 @@<br><br>/// atIssueLimit - Return true if no more instructions may be<br>issued in this<br>/// cycle.<br>+ ///<br>+ /// FIXME: remove this once MachineScheduler is the only<br>client.<br><br>Does this mean that the MachineScheduler does, or will, do PostRA<br>scheduling as well?<br><br>When I implemented MachineScheduler, the plan was that both SD<br>scheduler and PostRA scheduler would be eliminated. Then the<br>scheduling data structure could be further streamlined for<br>simplicity and compile time. I don’t see a fundamental reason the<br>machine scheduler can’t run as a PostRA pass, it just needs to be<br>setup to do that.<br><br>Do you mean calling substitutePass(&PostRASchedulerID,<br>&MachineSchedulerID)? Or some kind of 'setup' within the machine<br>scheduler itself?<br><br><br><br>I’m pretty sure something will break when you do that. But it should<br>just be a matter of disabling certain features when running in<br>PostRA mode. e.g. LiveIntervals and RegPressure tracking should be<br>disabled.<br><br>Fair enough. We'd also want to run anti-dependency breaking.<br><br><br><br>I’m not sure if you’ll want AliasAnalysis.<br><br>I'm also not sure; I'm experimenting with this now for avoiding<br>load-after-store hazards.<br><br><br><br><br><br>It would be nice if we could conditionally enable postRA scheduling<br>just for blocks with spill/prolog/epilog code. Maybe we want an<br>Analysis pass to remember these blocks.<br><br>Agreed (unless I need a second top-down pass anyway, for dispatch<br>group formation on the POWER cores, for example).<br><br><br>Your commit mentioned that only postRA handled dispatch groups. Do<br>you simply mean that it is a top-down scheduling pass?<br></blockquote><br>This is one of the things that I'm working on fixing. It had been the case for some time that we were only doing dispatch-group formation PostRA. I don't remember exactly why now, but it had something the way that the existing hazard recognizer was written.<br><br><blockquote type="cite"><br><br>The bottom-up pass should also be trying to form dispatch groups. The<br>actual groups will differ at execution time because of the partial<br>group at the beginning of the block, so the processor may<br>opportunistically move some instructions into earlier groups, but<br>final throughput should be the same as in the scheduler’s model.<br></blockquote><br>I agree. However, there are two parts to this: First is modeling the dispatch slots as resources (which can certainly be done in either direction). The second is inserting noops where necessary to force a new dispatch groups. I'm not sure that doing that bottom-up makes sense. On the other hand, I think that doing this bottom-up may only be problematic if you need to insert multiple noops (because you don't know how many to insert until you schedule the preceding group), and for all recent POWER cores, there is a special 'group ending' noop, so this may not be an issue.<br></div></blockquote><div><br></div><div>I didn’t realize you were inserting nops to avoid hazards. That’s odd for an OOO core. If that’s important, I could see why you want a second postRA pass.</div><div>I recently took a look at adding in-order resources to the out-of-order machine scheduler (new scheduling model/no itinieraries). I have limited ability/time to test it though beyond some quick evaluation on cortex-a9. I can checkin the feature in prototype form if you want to experiment and potentially enable it for your target.</div><div><br></div>-Andy</div><div><br><blockquote type="cite"><div style="font-size: 12px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px;"><blockquote type="cite">In<br>fact, I would guess that the processor can dispatch across branches,<br>so the scheduler’s groups won’t perfectly reflect reality anyway.<br></blockquote><br>I could be wrong, but I think that, for the purpose of avoiding LS hazards, this is not true (but certainly could effect non-epoch-based hazards [my terminology] like register rename slots).<br><br>Thanks again,<br>Hal<br><br><blockquote type="cite"><br><br>-Andy<br><br><br><br><br><br>-Hal<br><br><br><br><br>-Andy<br><br><br><br><br>-Hal<br><br><br><br><br>-Andy<br><br><br><br>-Hal<br><br><br><br>virtual bool atIssueLimit() const { return false; }<br><br>/// getHazardType - Return the hazard type of emitting this<br>node.<br>There are<br><br>Modified: llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp<br>URL:<br><a href="http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp?rev=157977&r1=157976&r2=157977&view=diff">http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp?rev=157977&r1=157976&r2=157977&view=diff</a><br>==============================================================================<br>--- llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp<br>(original)<br>+++ llvm/trunk/lib/CodeGen/ScoreboardHazardRecognizer.cpp Mon Jun<br>4<br>22:44:32 2012<br>@@ -39,9 +39,9 @@<br>DebugType = ParentDebugType;<br>#endif<br><br>- // Determine the maximum depth of any itinerary. This<br>determines<br>the<br>- // depth of the scoreboard. We always make the scoreboard at<br>least<br>1<br>- // cycle deep to avoid dealing with the boundary condition.<br>+ // Determine the maximum depth of any itinerary. This<br>determines<br>the depth of<br>+ // the scoreboard. We always make the scoreboard at least 1<br>cycle<br>deep to<br>+ // avoid dealing with the boundary condition.<br>unsigned ScoreboardDepth = 1;<br>if (ItinData && !ItinData->isEmpty()) {<br>IssueWidth = ItinData->IssueWidth;<br>@@ -63,16 +63,22 @@<br>// Find the next power-of-2 >= ItinDepth<br>while (ItinDepth > ScoreboardDepth) {<br>ScoreboardDepth *= 2;<br>+ // Don't set MaxLookAhead until we find at least one<br>nonzero<br>stage.<br>+ // This way, an itinerary with no stages has<br>MaxLookAhead==0, which<br>+ // completely bypasses the scoreboard hazard logic.<br>+ MaxLookAhead = ScoreboardDepth;<br>}<br>}<br>- MaxLookAhead = ScoreboardDepth;<br>}<br><br>ReservedScoreboard.reset(ScoreboardDepth);<br>RequiredScoreboard.reset(ScoreboardDepth);<br><br>- DEBUG(dbgs() << "Using scoreboard hazard recognizer: Depth = "<br>- << ScoreboardDepth << '\n');<br>+ if (!MaxLookAhead)<br>+ DEBUG(dbgs() << "Disabled scoreboard hazard recognizer\n");<br>+ else<br>+ DEBUG(dbgs() << "Using scoreboard hazard recognizer: Depth =<br>"<br>+ << ScoreboardDepth << '\n');<br>}<br><br>void ScoreboardHazardRecognizer::Reset() {<br><br><br>_______________________________________________<br>llvm-commits mailing list<br>llvm-commits@cs.uiuc.edu<br>http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits<br><br><br>--<br>Hal Finkel<br>Assistant Computational Scientist<br>Leadership Computing Facility<br>Argonne National Laboratory<br><br><br><br>--<br>Hal Finkel<br>Assistant Computational Scientist<br>Leadership Computing Facility<br>Argonne National Laboratory<br><br><br>--<br>Hal Finkel<br>Assistant Computational Scientist<br>Leadership Computing Facility<br>Argonne National Laboratory<br><br></blockquote><br>--<span class="Apple-converted-space"> </span><br>Hal Finkel<br>Assistant Computational Scientist<br>Leadership Computing Facility<br>Argonne National Laboratory</div></blockquote></div><br></body></html>