Tue Aug 8 22:56:53 PDT 2006

Changes in directory llvm-www/releases/1.8/docs:

AliasAnalysis.html added (r1.1)
Bugpoint.html added (r1.1)
BytecodeFormat.html added (r1.1)
CFEBuildInstrs.html added (r1.1)
CodeGenerator.html added (r1.1)
CodingStandards.html added (r1.1)
CommandLine.html added (r1.1)
CompilerDriver.html added (r1.1)
CompilerWriterInfo.html added (r1.1)
ExtendingLLVM.html added (r1.1)
FAQ.html added (r1.1)
GarbageCollection.html added (r1.1)
GettingStarted.html added (r1.1)
GettingStartedVS.html added (r1.1)
HowToReleaseLLVM.html added (r1.1)
HowToSubmitABug.html added (r1.1)
LangRef.html added (r1.1)
Lexicon.html added (r1.1)
Makefile added (r1.1)
MakefileGuide.html added (r1.1)
ProgrammersManual.html added (r1.1)
Projects.html added (r1.1)
ReleaseNotes.html added (r1.1)
SourceLevelDebugging.html added (r1.1)
Stacker.html added (r1.1)
SystemLibrary.html added (r1.1)
TableGenFundamentals.html added (r1.1)
TestingGuide.html added (r1.1)
UsingLibraries.html added (r1.1)
WritingAnLLVMBackend.html added (r1.1)
WritingAnLLVMPass.html added (r1.1)
doxygen.cfg.in added (r1.1)
doxygen.css added (r1.1)
doxygen.footer added (r1.1)
doxygen.header added (r1.1)
doxygen.intro added (r1.1)
index.html added (r1.1)
llvm.css added (r1.1)
---
Log message:

Adding  1.8 docs

---
Diffs of the changes:  (+30599 -0)

 AliasAnalysis.html        |  959 +++++++++++
 Bugpoint.html             |  238 ++
 BytecodeFormat.html       | 2154 +++++++++++++++++++++++++
 CFEBuildInstrs.html       |  364 ++++
 CodeGenerator.html        | 1293 +++++++++++++++
 CodingStandards.html      |  679 ++++++++
 CommandLine.html          | 1930 +++++++++++++++++++++++
 CompilerDriver.html       |  823 +++++++++
 CompilerWriterInfo.html   |  260 +++
 ExtendingLLVM.html        |  389 ++++
 FAQ.html                  |  678 ++++++++
 GarbageCollection.html    |  533 ++++++
 GettingStarted.html       | 1601 +++++++++++++++++++
 GettingStartedVS.html     |  353 ++++
 HowToReleaseLLVM.html     |  474 +++++
 HowToSubmitABug.html      |  359 ++++
 LangRef.html              | 3846 ++++++++++++++++++++++++++++++++++++++++++++++
 Lexicon.html              |  178 ++
 Makefile                  |   83 
 MakefileGuide.html        | 1010 ++++++++++++
 ProgrammersManual.html    | 2288 +++++++++++++++++++++++++++
 Projects.html             |  460 +++++
 ReleaseNotes.html         |  691 ++++++++
 SourceLevelDebugging.html | 1762 +++++++++++++++++++++
 Stacker.html              | 1412 ++++++++++++++++
 SystemLibrary.html        |  344 ++++
 TableGenFundamentals.html |  567 ++++++
 TestingGuide.html         |  620 +++++++
 UsingLibraries.html       |  398 ++++
 WritingAnLLVMBackend.html |  260 +++
 WritingAnLLVMPass.html    | 1600 +++++++++++++++++++
 doxygen.cfg.in            | 1230 ++++++++++++++
 doxygen.css               |  378 ++++
 doxygen.footer            |    9 
 doxygen.header            |    9 
 doxygen.intro             |   18 
 index.html                |  265 +++
 llvm.css                  |   84 +
 38 files changed, 30599 insertions(+)

Index: llvm-www/releases/1.8/docs/AliasAnalysis.html
diff -c /dev/null llvm-www/releases/1.8/docs/AliasAnalysis.html:1.1
*** /dev/null	Wed Aug  9 00:56:50 2006
--- llvm-www/releases/1.8/docs/AliasAnalysis.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,959 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM Alias Analysis Infrastructure</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   LLVM Alias Analysis Infrastructure
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+ 
+   <li><a href="#overview"><tt>AliasAnalysis</tt> Class Overview</a>
+     <ul>
+     <li><a href="#pointers">Representation of Pointers</a></li>
+     <li><a href="#alias">The <tt>alias</tt> method</a></li>
+     <li><a href="#ModRefInfo">The <tt>getModRefInfo</tt> methods</a></li>
+     <li><a href="#OtherItfs">Other useful <tt>AliasAnalysis</tt> methods</a></li>
+     </ul>
+   </li>
+ 
+   <li><a href="#writingnew">Writing a new <tt>AliasAnalysis</tt> Implementation</a>
+     <ul>
+     <li><a href="#passsubclasses">Different Pass styles</a></li>
+     <li><a href="#requiredcalls">Required initialization calls</a></li>
+     <li><a href="#interfaces">Interfaces which may be specified</a></li>
+     <li><a href="#chaining"><tt>AliasAnalysis</tt> chaining behavior</a></li>
+     <li><a href="#updating">Updating analysis results for transformations</a></li>
+     <li><a href="#implefficiency">Efficiency Issues</a></li>
+     </ul>
+   </li>
+ 
+   <li><a href="#using">Using alias analysis results</a>
+     <ul>
+     <li><a href="#loadvn">Using the <tt>-load-vn</tt> Pass</a></li>
+     <li><a href="#ast">Using the <tt>AliasSetTracker</tt> class</a></li>
+     <li><a href="#direct">Using the <tt>AliasAnalysis</tt> interface directly</a></li>
+     </ul>
+   </li>
+ 
+   <li><a href="#exist">Existing alias analysis implementations and clients</a>
+     <ul>
+     <li><a href="#impls">Available <tt>AliasAnalysis</tt> implementations</a></li>
+     <li><a href="#aliasanalysis-xforms">Alias analysis driven transformations</a></li>
+     <li><a href="#aliasanalysis-debug">Clients for debugging and evaluation of
+     implementations</a></li>
+     </ul>
+   </li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Alias Analysis (aka Pointer Analysis) is a class of techniques which attempt
+ to determine whether or not two pointers ever can point to the same object in
+ memory.  There are many different algorithms for alias analysis and many
+ different ways of classifying them: flow-sensitive vs flow-insensitive,
+ context-sensitive vs context-insensitive, field-sensitive vs field-insensitive,
+ unification-based vs subset-based, etc.  Traditionally, alias analyses respond
+ to a query with a <a href="#MustNoMay">Must, May, or No</a> alias response,
+ indicating that two pointers always point to the same object, might point to the
+ same object, or are known to never point to the same object.</p>
+ 
+ <p>The LLVM <a
+ href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html"><tt>AliasAnalysis</tt></a>
+ class is the primary interface used by clients and implementations of alias
+ analyses in the LLVM system.  This class is the common interface between clients
+ of alias analysis information and the implementations providing it, and is
+ designed to support a wide range of implementations and clients (but currently
+ all clients are assumed to be flow-insensitive).  In addition to simple alias
+ analysis information, this class exposes Mod/Ref information from those
+ implementations which can provide it, allowing for powerful analyses and
+ transformations to work well together.</p>
+ 
+ <p>This document contains information necessary to successfully implement this
+ interface, use it, and to test both sides.  It also explains some of the finer
+ points about what exactly results mean.  If you feel that something is unclear
+ or should be added, please <a href="mailto:sabre at nondot.org">let me
+ know</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="overview"><tt>AliasAnalysis</tt> Class Overview</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The <a
+ href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html"><tt>AliasAnalysis</tt></a>
+ class defines the interface that the various alias analysis implementations
+ should support.  This class exports two important enums: <tt>AliasResult</tt>
+ and <tt>ModRefResult</tt> which represent the result of an alias query or a
+ mod/ref query, respectively.</p>
+ 
+ <p>The <tt>AliasAnalysis</tt> interface exposes information about memory,
+ represented in several different ways.  In particular, memory objects are
+ represented as a starting address and size, and function calls are represented
+ as the actual <tt>call</tt> or <tt>invoke</tt> instructions that performs the
+ call.  The <tt>AliasAnalysis</tt> interface also exposes some helper methods
+ which allow you to get mod/ref information for arbitrary instructions.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="pointers">Representation of Pointers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Most importantly, the <tt>AliasAnalysis</tt> class provides several methods
+ which are used to query whether or not two memory objects alias, whether
+ function calls can modify or read a memory object, etc.  For all of these
+ queries, memory objects are represented as a pair of their starting address (a
+ symbolic LLVM <tt>Value*</tt>) and a static size.</p>
+ 
+ <p>Representing memory objects as a starting address and a size is critically
+ important for correct Alias Analyses.  For example, consider this (silly, but
+ possible) C code:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ int i;
+ char C[2];
+ char A[10]; 
+ /* ... */
+ for (i = 0; i != 10; ++i) {
+   C[0] = A[i];          /* One byte store */
+   C[1] = A[9-i];        /* One byte store */
+ }
+ </pre>
+ </div>
+ 
+ <p>In this case, the <tt>basicaa</tt> pass will disambiguate the stores to
+ <tt>C[0]</tt> and <tt>C[1]</tt> because they are accesses to two distinct
+ locations one byte apart, and the accesses are each one byte.  In this case, the
+ LICM pass can use store motion to remove the stores from the loop.  In
+ constrast, the following code:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ int i;
+ char C[2];
+ char A[10]; 
+ /* ... */
+ for (i = 0; i != 10; ++i) {
+   ((short*)C)[0] = A[i];  /* Two byte store! */
+   C[1] = A[9-i];          /* One byte store */
+ }
+ </pre>
+ </div>
+ 
+ <p>In this case, the two stores to C do alias each other, because the access to
+ the <tt>&C[0]</tt> element is a two byte access.  If size information wasn't
+ available in the query, even the first case would have to conservatively assume
+ that the accesses alias.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="alias">The <tt>alias</tt> method</a>
+ </div>
+   
+ <div class="doc_text">
+ The <tt>alias</tt> method is the primary interface used to determine whether or
+ not two memory objects alias each other.  It takes two memory objects as input
+ and returns MustAlias, MayAlias, or NoAlias as appropriate.
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="MustMayNo">Must, May, and No Alias Responses</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>An Alias Analysis implementation can return one of three responses:
+ MustAlias, MayAlias, and NoAlias.  The No and May alias results are obvious: if
+ the two pointers can never equal each other, return NoAlias, if they might,
+ return MayAlias.</p>
+ 
+ <p>The MustAlias response is trickier though.  In LLVM, the Must Alias response
+ may only be returned if the two memory objects are guaranteed to always start at
+ exactly the same location.  If two memory objects overlap, but do not start at
+ the same location, return MayAlias.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ModRefInfo">The <tt>getModRefInfo</tt> methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>getModRefInfo</tt> methods return information about whether the
+ execution of an instruction can read or modify a memory location.  Mod/Ref
+ information is always conservative: if an instruction <b>might</b> read or write
+ a location, ModRef is returned.</p>
+ 
+ <p>The <tt>AliasAnalysis</tt> class also provides a <tt>getModRefInfo</tt>
+ method for testing dependencies between function calls.  This method takes two
+ call sites (CS1 & CS2), returns NoModRef if the two calls refer to disjoint
+ memory locations, Ref if CS1 reads memory written by CS2, Mod if CS1 writes to
+ memory read or written by CS2, or ModRef if CS1 might read or write memory
+ accessed by CS2.  Note that this relation is not commutative.  Clients that use
+ this method should be predicated on the <tt>hasNoModRefInfoForCalls()</tt>
+ method, which indicates whether or not an analysis can provide mod/ref
+ information for function call pairs (most can not).  If this predicate is false,
+ the client shouldn't waste analysis time querying the <tt>getModRefInfo</tt>
+ method many times.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="OtherItfs">Other useful <tt>AliasAnalysis</tt> methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ Several other tidbits of information are often collected by various alias
+ analysis implementations and can be put to good use by various clients.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   The <tt>getMustAliases</tt> method
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>getMustAliases</tt> method returns all values that are known to
+ always must alias a pointer.  This information can be provided in some cases for
+ important objects like the null pointer and global values.  Knowing that a
+ pointer always points to a particular function allows indirect calls to be
+ turned into direct calls, for example.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   The <tt>pointsToConstantMemory</tt> method
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>pointsToConstantMemory</tt> method returns true if and only if the
+ analysis can prove that the pointer only points to unchanging memory locations
+ (functions, constant global variables, and the null pointer).  This information
+ can be used to refine mod/ref information: it is impossible for an unchanging
+ memory location to be modified.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="simplemodref">The <tt>doesNotAccessMemory</tt> and
+   <tt>onlyReadsMemory</tt> methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>These methods are used to provide very simple mod/ref information for
+ function calls.  The <tt>doesNotAccessMemory</tt> method returns true for a
+ function if the analysis can prove that the function never reads or writes to
+ memory, or if the function only reads from constant memory.  Functions with this
+ property are side-effect free and only depend on their input arguments, allowing
+ them to be eliminated if they form common subexpressions or be hoisted out of
+ loops.  Many common functions behave this way (e.g., <tt>sin</tt> and
+ <tt>cos</tt>) but many others do not (e.g., <tt>acos</tt>, which modifies the
+ <tt>errno</tt> variable).</p>
+ 
+ <p>The <tt>onlyReadsMemory</tt> method returns true for a function if analysis
+ can prove that (at most) the function only reads from non-volatile memory.
+ Functions with this property are side-effect free, only depending on their input
+ arguments and the state of memory when they are called.  This property allows
+ calls to these functions to be eliminated and moved around, as long as there is
+ no store instruction that changes the contents of memory.  Note that all
+ functions that satisfy the <tt>doesNotAccessMemory</tt> method also satisfies
+ <tt>onlyReadsMemory</tt>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="writingnew">Writing a new <tt>AliasAnalysis</tt> Implementation</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Writing a new alias analysis implementation for LLVM is quite
+ straight-forward.  There are already several implementations that you can use
+ for examples, and the following information should help fill in any details.
+ For a examples, take a look at the <a href="#impls">various alias analysis
+ implementations</a> included with LLVM.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="passsubclasses">Different Pass styles</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The first step to determining what type of <a
+ href="WritingAnLLVMPass.html">LLVM pass</a> you need to use for your Alias
+ Analysis.  As is the case with most other analyses and transformations, the
+ answer should be fairly obvious from what type of problem you are trying to
+ solve:</p>
+ 
+ <ol>
+   <li>If you require interprocedural analysis, it should be a
+       <tt>Pass</tt>.</li>
+   <li>If you are a function-local analysis, subclass <tt>FunctionPass</tt>.</li>
+   <li>If you don't need to look at the program at all, subclass 
+       <tt>ImmutablePass</tt>.</li>
+ </ol>
+ 
+ <p>In addition to the pass that you subclass, you should also inherit from the
+ <tt>AliasAnalysis</tt> interface, of course, and use the
+ <tt>RegisterAnalysisGroup</tt> template to register as an implementation of
+ <tt>AliasAnalysis</tt>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="requiredcalls">Required initialization calls</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Your subclass of <tt>AliasAnalysis</tt> is required to invoke two methods on
+ the <tt>AliasAnalysis</tt> base class: <tt>getAnalysisUsage</tt> and
+ <tt>InitializeAliasAnalysis</tt>.  In particular, your implementation of
+ <tt>getAnalysisUsage</tt> should explicitly call into the
+ <tt>AliasAnalysis::getAnalysisUsage</tt> method in addition to doing any
+ declaring any pass dependencies your pass has.  Thus you should have something
+ like this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ void getAnalysisUsage(AnalysisUsage &AU) const {
+   AliasAnalysis::getAnalysisUsage(AU);
+   <i>// declare your dependencies here.</i>
+ }
+ </pre>
+ </div>
+ 
+ <p>Additionally, your must invoke the <tt>InitializeAliasAnalysis</tt> method
+ from your analysis run method (<tt>run</tt> for a <tt>Pass</tt>,
+ <tt>runOnFunction</tt> for a <tt>FunctionPass</tt>, or <tt>InitializePass</tt>
+ for an <tt>ImmutablePass</tt>).  For example (as part of a <tt>Pass</tt>):</p>
+ 
+ <div class="doc_code">
+ <pre>
+ bool run(Module &M) {
+   InitializeAliasAnalysis(this);
+   <i>// Perform analysis here...</i>
+   return false;
+ }
+ </pre>
+ </div>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="interfaces">Interfaces which may be specified</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>All of the <a
+ href="/doxygen/classllvm_1_1AliasAnalysis.html"><tt>AliasAnalysis</tt></a>
+ virtual methods default to providing <a href="#chaining">chaining</a> to another
+ alias analysis implementation, which ends up returning conservatively correct
+ information (returning "May" Alias and "Mod/Ref" for alias and mod/ref queries
+ respectively).  Depending on the capabilities of the analysis you are
+ implementing, you just override the interfaces you can improve.</p>
+ 
+ </div>
+ 
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="chaining"><tt>AliasAnalysis</tt> chaining behavior</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>With only two special exceptions (the <tt><a
+ href="#basic-aa">basicaa</a></tt> and <a href="#no-aa"><tt>no-aa</tt></a>
+ passes) every alias analysis pass chains to another alias analysis
+ implementation (for example, the user can specify "<tt>-basicaa -ds-aa
+ -anders-aa -licm</tt>" to get the maximum benefit from the three alias
+ analyses).  The alias analysis class automatically takes care of most of this
+ for methods that you don't override.  For methods that you do override, in code
+ paths that return a conservative MayAlias or Mod/Ref result, simply return
+ whatever the superclass computes.  For example:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ AliasAnalysis::AliasResult alias(const Value *V1, unsigned V1Size,
+                                  const Value *V2, unsigned V2Size) {
+   if (...)
+     return NoAlias;
+   ...
+ 
+   <i>// Couldn't determine a must or no-alias result.</i>
+   return AliasAnalysis::alias(V1, V1Size, V2, V2Size);
+ }
+ </pre>
+ </div>
+ 
+ <p>In addition to analysis queries, you must make sure to unconditionally pass
+ LLVM <a href="#updating">update notification</a> methods to the superclass as
+ well if you override them, which allows all alias analyses in a change to be
+ updated.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="updating">Updating analysis results for transformations</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ Alias analysis information is initially computed for a static snapshot of the
+ program, but clients will use this information to make transformations to the
+ code.  All but the most trivial forms of alias analysis will need to have their
+ analysis results updated to reflect the changes made by these transformations.
+ </p>
+ 
+ <p>
+ The <tt>AliasAnalysis</tt> interface exposes two methods which are used to
+ communicate program changes from the clients to the analysis implementations.
+ Various alias analysis implementations should use these methods to ensure that
+ their internal data structures are kept up-to-date as the program changes (for
+ example, when an instruction is deleted), and clients of alias analysis must be
+ sure to call these interfaces appropriately.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">The <tt>deleteValue</tt> method</div>
+ 
+ <div class="doc_text">
+ The <tt>deleteValue</tt> method is called by transformations when they remove an
+ instruction or any other value from the program (including values that do not
+ use pointers).  Typically alias analyses keep data structures that have entries
+ for each value in the program.  When this method is called, they should remove
+ any entries for the specified value, if they exist.
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">The <tt>copyValue</tt> method</div>
+ 
+ <div class="doc_text">
+ The <tt>copyValue</tt> method is used when a new value is introduced into the
+ program.  There is no way to introduce a value into the program that did not
+ exist before (this doesn't make sense for a safe compiler transformation), so
+ this is the only way to introduce a new value.  This method indicates that the
+ new value has exactly the same properties as the value being copied.
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">The <tt>replaceWithNewValue</tt> method</div>
+ 
+ <div class="doc_text">
+ This method is a simple helper method that is provided to make clients easier to
+ use.  It is implemented by copying the old analysis information to the new
+ value, then deleting the old value.  This method cannot be overridden by alias
+ analysis implementations.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="implefficiency">Efficiency Issues</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>From the LLVM perspective, the only thing you need to do to provide an
+ efficient alias analysis is to make sure that alias analysis <b>queries</b> are
+ serviced quickly.  The actual calculation of the alias analysis results (the
+ "run" method) is only performed once, but many (perhaps duplicate) queries may
+ be performed.  Because of this, try to move as much computation to the run
+ method as possible (within reason).</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="using">Using alias analysis results</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>There are several different ways to use alias analysis results.  In order of
+ preference, these are...</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="loadvn">Using the <tt>-load-vn</tt> Pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>load-vn</tt> pass uses alias analysis to provide value numbering
+ information for <tt>load</tt> instructions and pointer values.  If your analysis
+ or transformation can be modeled in a form that uses value numbering
+ information, you don't have to do anything special to handle load instructions:
+ just use the <tt>load-vn</tt> pass, which uses alias analysis.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ast">Using the <tt>AliasSetTracker</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Many transformations need information about alias <b>sets</b> that are active
+ in some scope, rather than information about pairwise aliasing.  The <tt><a
+ href="/doxygen/classllvm_1_1AliasSetTracker.html">AliasSetTracker</a></tt> class
+ is used to efficiently build these Alias Sets from the pairwise alias analysis
+ information provided by the <tt>AliasAnalysis</tt> interface.</p>
+ 
+ <p>First you initialize the AliasSetTracker by using the "<tt>add</tt>" methods
+ to add information about various potentially aliasing instructions in the scope
+ you are interested in.  Once all of the alias sets are completed, your pass
+ should simply iterate through the constructed alias sets, using the
+ <tt>AliasSetTracker</tt> <tt>begin()</tt>/<tt>end()</tt> methods.</p>
+ 
+ <p>The <tt>AliasSet</tt>s formed by the <tt>AliasSetTracker</tt> are guaranteed
+ to be disjoint, calculate mod/ref information and volatility for the set, and
+ keep track of whether or not all of the pointers in the set are Must aliases.
+ The AliasSetTracker also makes sure that sets are properly folded due to call
+ instructions, and can provide a list of pointers in each set.</p>
+ 
+ <p>As an example user of this, the <a href="/doxygen/structLICM.html">Loop
+ Invariant Code Motion</a> pass uses <tt>AliasSetTracker</tt>s to calculate alias
+ sets for each loop nest.  If an <tt>AliasSet</tt> in a loop is not modified,
+ then all load instructions from that set may be hoisted out of the loop.  If any
+ alias sets are stored to <b>and</b> are must alias sets, then the stores may be
+ sunk to outside of the loop, promoting the memory location to a register for the
+ duration of the loop nest.  Both of these transformations only apply if the
+ pointer argument is loop-invariant.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   The AliasSetTracker implementation
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The AliasSetTracker class is implemented to be as efficient as possible.  It
+ uses the union-find algorithm to efficiently merge AliasSets when a pointer is
+ inserted into the AliasSetTracker that aliases multiple sets.  The primary data
+ structure is a hash table mapping pointers to the AliasSet they are in.</p>
+ 
+ <p>The AliasSetTracker class must maintain a list of all of the LLVM Value*'s
+ that are in each AliasSet.  Since the hash table already has entries for each
+ LLVM Value* of interest, the AliasesSets thread the linked list through these
+ hash-table nodes to avoid having to allocate memory unnecessarily, and to make
+ merging alias sets extremely efficient (the linked list merge is constant time).
+ </p>
+ 
+ <p>You shouldn't need to understand these details if you are just a client of
+ the AliasSetTracker, but if you look at the code, hopefully this brief
+ description will help make sense of why things are designed the way they
+ are.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="direct">Using the <tt>AliasAnalysis</tt> interface directly</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If neither of these utility class are what your pass needs, you should use
+ the interfaces exposed by the <tt>AliasAnalysis</tt> class directly.  Try to use
+ the higher-level methods when possible (e.g., use mod/ref information instead of
+ the <a href="#alias"><tt>alias</tt></a> method directly if possible) to get the
+ best precision and efficiency.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="exist">Existing alias analysis implementations and clients</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>If you're going to be working with the LLVM alias analysis infrastructure,
+ you should know what clients and implementations of alias analysis are
+ available.  In particular, if you are implementing an alias analysis, you should
+ be aware of the <a href="#aliasanalysis-debug">the clients</a> that are useful
+ for monitoring and evaluating different implementations.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="impls">Available <tt>AliasAnalysis</tt> implementations</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This section lists the various implementations of the <tt>AliasAnalysis</tt>
+ interface.  With the exception of the <a href="#no-aa"><tt>-no-aa</tt></a> and
+ <a href="#basic-aa"><tt>-basicaa</tt></a> implementations, all of these <a
+ href="#chaining">chain</a> to other alias analysis implementations.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="no-aa">The <tt>-no-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-no-aa</tt> pass is just like what it sounds: an alias analysis that
+ never returns any useful information.  This pass can be useful if you think that
+ alias analysis is doing something wrong and are trying to narrow down a
+ problem.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="basic-aa">The <tt>-basicaa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-basicaa</tt> pass is the default LLVM alias analysis.  It is an
+ aggressive local analysis that "knows" many important facts:</p>
+ 
+ <ul>
+ <li>Distinct globals, stack allocations, and heap allocations can never
+     alias.</li>
+ <li>Globals, stack allocations, and heap allocations never alias the null
+     pointer.</li>
+ <li>Different fields of a structure do not alias.</li>
+ <li>Indexes into arrays with statically differing subscripts cannot alias.</li>
+ <li>Many common standard C library functions <a
+     href="#simplemodref">never access memory or only read memory</a>.</li>
+ <li>Pointers that obviously point to constant globals
+     "<tt>pointToConstantMemory</tt>".</li>
+ <li>Function calls can not modify or references stack allocations if they never
+     escape from the function that allocates them (a common case for automatic
+     arrays).</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="globalsmodref">The <tt>-globalsmodref-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This pass implements a simple context-sensitive mod/ref and alias analysis
+ for internal global variables that don't "have their address taken".  If a
+ global does not have its address taken, the pass knows that no pointers alias
+ the global.  This pass also keeps track of functions that it knows never access
+ memory or never read memory.  This allows certain optimizations (e.g. GCSE) to
+ eliminate call instructions entirely.
+ </p>
+ 
+ <p>The real power of this pass is that it provides context-sensitive mod/ref 
+ information for call instructions.  This allows the optimizer to know that 
+ calls to a function do not clobber or read the value of the global, allowing 
+ loads and stores to be eliminated.</p>
+ 
+ <p>Note that this pass is somewhat limited in its scope (only support 
+ non-address taken globals), but is very quick analysis.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="anders-aa">The <tt>-anders-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-anders-aa</tt> pass implements the well-known "Andersen's algorithm"
+ for interprocedural alias analysis.  This algorithm is a subset-based,
+ flow-insensitive, context-insensitive, and field-insensitive alias analysis that
+ is widely believed to be fairly precise.  Unfortunately, this algorithm is also
+ O(N<sup>3</sup>).  The LLVM implementation currently does not implement any of
+ the refinements (such as "online cycle elimination" or "offline variable
+ substitution") to improve its efficiency, so it can be quite slow in common
+ cases.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="steens-aa">The <tt>-steens-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-steens-aa</tt> pass implements a variation on the well-known
+ "Steensgaard's algorithm" for interprocedural alias analysis.  Steensgaard's
+ algorithm is a unification-based, flow-insensitive, context-insensitive, and
+ field-insensitive alias analysis that is also very scalable (effectively linear
+ time).</p>
+ 
+ <p>The LLVM <tt>-steens-aa</tt> pass implements a "speculatively
+ field-<b>sensitive</b>" version of Steensgaard's algorithm using the Data
+ Structure Analysis framework.  This gives it substantially more precision than
+ the standard algorithm while maintaining excellent analysis scalability.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ds-aa">The <tt>-ds-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-ds-aa</tt> pass implements the full Data Structure Analysis
+ algorithm.  Data Structure Analysis is a modular unification-based,
+ flow-insensitive, context-<b>sensitive</b>, and speculatively
+ field-<b>sensitive</b> alias analysis that is also quite scalable, usually at
+ O(n*log(n)).</p>
+ 
+ <p>This algorithm is capable of responding to a full variety of alias analysis
+ queries, and can provide context-sensitive mod/ref information as well.  The
+ only major facility not implemented so far is support for must-alias
+ information.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="aliasanalysis-xforms">Alias analysis driven transformations</a>
+ </div>
+ 
+ <div class="doc_text">
+ LLVM includes several alias-analysis driven transformations which can be used
+ with any of the implementations above.
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="adce">The <tt>-adce</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-adce</tt> pass, which implements Aggressive Dead Code Elimination
+ uses the <tt>AliasAnalysis</tt> interface to delete calls to functions that do
+ not have side-effects and are not used.</p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="licm">The <tt>-licm</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-licm</tt> pass implements various Loop Invariant Code Motion related
+ transformations.  It uses the <tt>AliasAnalysis</tt> interface for several
+ different transformations:</p>
+ 
+ <ul>
+ <li>It uses mod/ref information to hoist or sink load instructions out of loops
+ if there are no instructions in the loop that modifies the memory loaded.</li>
+ 
+ <li>It uses mod/ref information to hoist function calls out of loops that do not
+ write to memory and are loop-invariant.</li>
+ 
+ <li>If uses alias information to promote memory objects that are loaded and
+ stored to in loops to live in a register instead.  It can do this if there are
+ no may aliases to the loaded/stored memory location.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="argpromotion">The <tt>-argpromotion</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The <tt>-argpromotion</tt> pass promotes by-reference arguments to be passed in
+ by-value instead.  In particular, if pointer arguments are only loaded from it
+ passes in the value loaded instead of the address to the function.  This pass
+ uses alias information to make sure that the value loaded from the argument
+ pointer is not modified between the entry of the function and any load of the
+ pointer.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="gcseloadvn">The <tt>-load-vn</tt> & <tt>-gcse</tt> passes</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-load-vn</tt> pass uses alias analysis to "<a href="#loadvn">value
+ number</a>" loads and pointers values, which is used by the GCSE pass to
+ eliminate instructions.  The <tt>-load-vn</tt> pass relies on alias information
+ and must-alias information.  This combination of passes can make the following
+ transformations:</p>
+ 
+ <ul>
+ <li>Redundant load instructions are eliminated.</li>
+ <li>Load instructions that follow a store to the same location are replaced with
+ the stored value ("store forwarding").</li>
+ <li>Pointers values (e.g. formal arguments) that must-alias simpler expressions
+ (e.g. global variables or the null pointer) are replaced.  Note that this
+ implements transformations like "virtual method resolution", turning indirect
+ calls into direct calls.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="aliasanalysis-debug">Clients for debugging and evaluation of
+   implementations</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>These passes are useful for evaluating the various alias analysis
+ implementations.  You can use them with commands like '<tt>opt -anders-aa -ds-aa
+ -aa-eval foo.bc -disable-output -stats</tt>'.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="print-alias-sets">The <tt>-print-alias-sets</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-print-alias-sets</tt> pass is exposed as part of the
+ <tt>opt</tt> tool to print out the Alias Sets formed by the <a
+ href="#ast"><tt>AliasSetTracker</tt></a> class.  This is useful if you're using
+ the <tt>AliasSetTracker</tt> class.  To use it, use something like:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ % opt -ds-aa -print-alias-sets -disable-output
+ </pre>
+ </div>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="count-aa">The <tt>-count-aa</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-count-aa</tt> pass is useful to see how many queries a particular
+ pass is making and what responses are returned by the alias analysis.  As an
+ example,</p>
+ 
+ <div class="doc_code">
+ <pre>
+ % opt -basicaa -count-aa -ds-aa -count-aa -licm
+ </pre>
+ </div>
+ 
+ <p>will print out how many queries (and what responses are returned) by the
+ <tt>-licm</tt> pass (of the <tt>-ds-aa</tt> pass) and how many queries are made
+ of the <tt>-basicaa</tt> pass by the <tt>-ds-aa</tt> pass.  This can be useful
+ when debugging a transformation or an alias analysis implementation.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="aa-eval">The <tt>-aa-eval</tt> pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>-aa-eval</tt> pass simply iterates through all pairs of pointers in a
+ function and asks an alias analysis whether or not the pointers alias.  This
+ gives an indication of the precision of the alias analysis.  Statistics are
+ printed indicating the percent of no/may/must aliases found (a more precise
+ algorithm will have a lower number of may aliases).</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/Bugpoint.html
diff -c /dev/null llvm-www/releases/1.8/docs/Bugpoint.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/Bugpoint.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,238 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" 
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM bugpoint tool: design and usage</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ 
+ <div class="doc_title">
+   LLVM bugpoint tool: design and usage
+ </div>
+ 
+ <ul>
+   <li><a href="#desc">Description</a></li>
+   <li><a href="#design">Design Philosophy</a>
+   <ul>
+     <li><a href="#autoselect">Automatic Debugger Selection</a></li>
+     <li><a href="#crashdebug">Crash debugger</a></li>
+     <li><a href="#codegendebug">Code generator debugger</a></li>
+     <li><a href="#miscompilationdebug">Miscompilation debugger</a></li>
+   </ul></li>
+   <li><a href="#advice">Advice for using <tt>bugpoint</tt></a></li>
+ </ul>
+ 
+ <div class="doc_author">
+ <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+ <a name="desc">Description</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p><tt>bugpoint</tt> narrows down the source of problems in LLVM tools and
+ passes.  It can be used to debug three types of failures: optimizer crashes,
+ miscompilations by optimizers, or bad native code generation (including problems
+ in the static and JIT compilers).  It aims to reduce large test cases to small,
+ useful ones.  For example, if <tt>gccas</tt> crashes while optimizing a
+ file, it will identify the optimization (or combination of optimizations) that
+ causes the crash, and reduce the file down to a small example which triggers the
+ crash.</p>
+ 
+ <p>For detailed case scenarios, such as debugging <tt>gccas</tt>,
+ <tt>gccld</tt>, or one of the LLVM code generators, see <a
+ href="HowToSubmitABug.html">How To Submit a Bug Report document</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+ <a name="design">Design Philosophy</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p><tt>bugpoint</tt> is designed to be a useful tool without requiring any
+ hooks into the LLVM infrastructure at all.  It works with any and all LLVM
+ passes and code generators, and does not need to "know" how they work.  Because
+ of this, it may appear to do stupid things or miss obvious
+ simplifications.  <tt>bugpoint</tt> is also designed to trade off programmer
+ time for computer time in the compiler-debugging process; consequently, it may
+ take a long period of (unattended) time to reduce a test case, but we feel it
+ is still worth it. Note that <tt>bugpoint</tt> is generally very quick unless
+ debugging a miscompilation where each test of the program (which requires 
+ executing it) takes a long time.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="autoselect">Automatic Debugger Selection</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>bugpoint</tt> reads each <tt>.bc</tt> or <tt>.ll</tt> file specified on
+ the command line and links them together into a single module, called the test
+ program.  If any LLVM passes are specified on the command line, it runs these
+ passes on the test program.  If any of the passes crash, or if they produce
+ malformed output (which causes the verifier to abort), <tt>bugpoint</tt> starts
+ the <a href="#crashdebug">crash debugger</a>.</p>
+ 
+ <p>Otherwise, if the <tt>-output</tt> option was not specified,
+ <tt>bugpoint</tt> runs the test program with the C backend (which is assumed to
+ generate good code) to generate a reference output.  Once <tt>bugpoint</tt> has
+ a reference output for the test program, it tries executing it with the
+ selected code generator.  If the selected code generator crashes,
+ <tt>bugpoint</tt> starts the <a href="#crashdebug">crash debugger</a> on the
+ code generator.  Otherwise, if the resulting output differs from the reference
+ output, it assumes the difference resulted from a code generator failure, and
+ starts the <a href="#codegendebug">code generator debugger</a>.</p>
+ 
+ <p>Finally, if the output of the selected code generator matches the reference
+ output, <tt>bugpoint</tt> runs the test program after all of the LLVM passes
+ have been applied to it.  If its output differs from the reference output, it
+ assumes the difference resulted from a failure in one of the LLVM passes, and
+ enters the <a href="#miscompilationdebug">miscompilation debugger</a>.
+ Otherwise, there is no problem <tt>bugpoint</tt> can debug.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="crashdebug">Crash debugger</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If an optimizer or code generator crashes, <tt>bugpoint</tt> will try as hard
+ as it can to reduce the list of passes (for optimizer crashes) and the size of
+ the test program.  First, <tt>bugpoint</tt> figures out which combination of
+ optimizer passes triggers the bug. This is useful when debugging a problem
+ exposed by <tt>gccas</tt>, for example, because it runs over 38 passes.</p>
+ 
+ <p>Next, <tt>bugpoint</tt> tries removing functions from the test program, to
+ reduce its size.  Usually it is able to reduce a test program to a single
+ function, when debugging intraprocedural optimizations.  Once the number of
+ functions has been reduced, it attempts to delete various edges in the control
+ flow graph, to reduce the size of the function as much as possible.  Finally,
+ <tt>bugpoint</tt> deletes any individual LLVM instructions whose absence does
+ not eliminate the failure.  At the end, <tt>bugpoint</tt> should tell you what
+ passes crash, give you a bytecode file, and give you instructions on how to
+ reproduce the failure with <tt>opt</tt>, <tt>analyze</tt>, or <tt>llc</tt>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="codegendebug">Code generator debugger</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The code generator debugger attempts to narrow down the amount of code that
+ is being miscompiled by the selected code generator.  To do this, it takes the
+ test program and partitions it into two pieces: one piece which it compiles
+ with the C backend (into a shared object), and one piece which it runs with
+ either the JIT or the static LLC compiler.  It uses several techniques to
+ reduce the amount of code pushed through the LLVM code generator, to reduce the
+ potential scope of the problem.  After it is finished, it emits two bytecode
+ files (called "test" [to be compiled with the code generator] and "safe" [to be
+ compiled with the C backend], respectively), and instructions for reproducing
+ the problem.  The code generator debugger assumes that the C backend produces
+ good code.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="miscompilationdebug">Miscompilation debugger</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The miscompilation debugger works similarly to the code generator debugger.
+ It works by splitting the test program into two pieces, running the
+ optimizations specified on one piece, linking the two pieces back together, and
+ then executing the result.  It attempts to narrow down the list of passes to
+ the one (or few) which are causing the miscompilation, then reduce the portion
+ of the test program which is being miscompiled.  The miscompilation debugger
+ assumes that the selected code generator is working properly.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="advice">Advice for using bugpoint</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <tt>bugpoint</tt> can be a remarkably useful tool, but it sometimes works in
+ non-obvious ways.  Here are some hints and tips:<p>
+ 
+ <ol>
+ <li>In the code generator and miscompilation debuggers, <tt>bugpoint</tt> only
+     works with programs that have deterministic output.  Thus, if the program
+     outputs <tt>argv[0]</tt>, the date, time, or any other "random" data,
+     <tt>bugpoint</tt> may misinterpret differences in these data, when output,
+     as the result of a miscompilation.  Programs should be temporarily modified
+     to disable outputs that are likely to vary from run to run.
+ 
+ <li>In the code generator and miscompilation debuggers, debugging will go
+     faster if you manually modify the program or its inputs to reduce the
+     runtime, but still exhibit the problem.
+ 
+ <li><tt>bugpoint</tt> is extremely useful when working on a new optimization:
+     it helps track down regressions quickly.  To avoid having to relink
+     <tt>bugpoint</tt> every time you change your optimization however, have
+     <tt>bugpoint</tt> dynamically load your optimization with the
+     <tt>-load</tt> option.
+ 
+ <li><p><tt>bugpoint</tt> can generate a lot of output and run for a long period
+     of time.  It is often useful to capture the output of the program to file.
+     For example, in the C shell, you can run:</p>
+ 
+ <div class="doc_code">
+ <p><tt>bugpoint  ... |& tee bugpoint.log</tt></p>
+ </div>
+ 
+     <p>to get a copy of <tt>bugpoint</tt>'s output in the file
+     <tt>bugpoint.log</tt>, as well as on your terminal.</p>
+ 
+ <li><tt>bugpoint</tt> cannot debug problems with the LLVM linker. If
+     <tt>bugpoint</tt> crashes before you see its "All input ok" message,
+     you might try <tt>llvm-link -v</tt> on the same set of input files. If
+     that also crashes, you may be experiencing a linker bug.
+ 
+ <li>If your program is <b>supposed</b> to crash, <tt>bugpoint</tt> will be
+     confused. One way to deal with this is to cause bugpoint to ignore the exit
+     code from your program, by giving it the <tt>-check-exit-code=false</tt>
+     option.
+     
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/BytecodeFormat.html
diff -c /dev/null llvm-www/releases/1.8/docs/BytecodeFormat.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/BytecodeFormat.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,2154 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>LLVM Bytecode File Format</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+   <style type="text/css">
+     TR, TD { border: 2px solid gray; padding-left: 4pt; padding-right: 4pt; 
+              padding-top: 2pt; padding-bottom: 2pt; }
+     TH { border: 2px solid gray; font-weight: bold; font-size: 105%; }
+     TABLE { text-align: center; border: 2px solid black; 
+       border-collapse: collapse; margin-top: 1em; margin-left: 1em; 
+       margin-right: 1em; margin-bottom: 1em; }
+     .td_left { border: 2px solid gray; text-align: left; }
+   </style>
+ </head>
+ <body>
+ <div class="doc_title"> LLVM Bytecode File Format </div>
+ <ol>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#concepts">Concepts</a>
+     <ol>
+       <li><a href="#blocks">Blocks</a></li>
+       <li><a href="#lists">Lists</a></li>
+       <li><a href="#fields">Fields</a></li>
+       <li><a href="#align">Alignment</a></li>
+       <li><a href="#vbr">Variable Bit-Rate Encoding</a></li>
+       <li><a href="#encoding">Encoding Primitives</a></li>
+       <li><a href="#slots">Slots</a></li>
+     </ol>
+   </li>
+   <li><a href="#general">General Structure</a> </li>
+   <li><a href="#blockdefs">Block Definitions</a>
+     <ol>
+       <li><a href="#signature">Signature Block</a></li>
+       <li><a href="#module">Module Block</a></li>
+       <li><a href="#globaltypes">Global Type Pool</a></li>
+       <li><a href="#globalinfo">Module Info Block</a></li>
+       <li><a href="#constantpool">Global Constant Pool</a></li>
+       <li><a href="#functiondefs">Function Definition</a></li>
+       <li><a href="#compactiontable">Compaction Table</a></li>
+       <li><a href="#instructionlist">Instructions List</a></li>
+       <li><a href="#instructions">Instructions</a></li>
+       <li><a href="#symtab">Symbol Table</a></li>
+     </ol>
+   </li>
+   <li><a href="#versiondiffs">Version Differences</a>
+     <ol>
+       <li><a href="#vers13">Version 1.3 Differences From 1.4</a></li>
+       <li><a href="#vers12">Version 1.2 Differences From 1.3</a></li>
+       <li><a href="#vers11">Version 1.1 Differences From 1.2</a></li>
+       <li><a href="#vers10">Version 1.0 Differences From 1.1</a></li>
+     </ol>
+   </li>
+ </ol>
+ <div class="doc_author">
+ <p>Written by <a href="mailto:rspencer at x10sys.com">Reid Spencer</a>
+ </p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="abstract">Abstract </a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+ <p>This document describes the LLVM bytecode file format. It specifies
+ the binary encoding rules of the bytecode file format so that
+ equivalent systems can encode bytecode files correctly. The LLVM
+ bytecode representation is used to store the intermediate
+ representation on disk in compacted form.</p>
+ <p>The LLVM bytecode format may change in the future, but LLVM will
+ always be backwards compatible with older formats. This document will
+ only describe the most current version of the bytecode format. See <a
+  href="#versiondiffs">Version Differences</a> for the details on how
+ the current version is different from previous versions.</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="concepts">Concepts</a> </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+ <p>This section describes the general concepts of the bytecode file
+ format without getting into specific layout details. It is recommended
+ that you read this section thoroughly before interpreting the detailed
+ descriptions.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="blocks">Blocks</a> </div>
+ <div class="doc_text">
+ <p>LLVM bytecode files consist simply of a sequence of blocks of bytes
+ using a binary encoding Each block begins with an header of two
+ unsigned integers. The first value identifies the type of block and the
+ second value provides the size of the block in bytes. The block
+ identifier is used because it is possible for entire blocks to be
+ omitted from the file if they are empty. The block identifier helps the
+ reader determine which kind of block is next in the file. Note that
+ blocks can be nested within other blocks.</p>
+ <p> All blocks are variable length, and the block header specifies the
+ size of the block. All blocks begin on a byte index that is aligned to
+ an even 32-bit boundary. That is, the first block is 32-bit aligned
+ because it starts at offset 0. Each block is padded with zero fill
+ bytes to ensure that the next block also starts on a 32-bit boundary.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="lists">Lists</a> </div>
+ <div class="doc_text">
+ <p>LLVM Bytecode blocks often contain lists of things of a similar
+ type. For example, a function contains a list of instructions and a
+ function type contains a list of argument types. There are two basic
+ types of lists: length lists (<a href="#llist">llist</a>), and null
+ terminated lists (<a href="#zlist">zlist</a>), as described below in
+ the <a href="#encoding">Encoding Primitives</a>.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="fields">Fields</a> </div>
+ <div class="doc_text">
+ <p>Fields are units of information that LLVM knows how to write atomically. Most 
+ fields have a uniform length or some kind of length indication built into their 
+ encoding. For example, a constant string (array of bytes) is written simply as 
+ the length followed by the characters. Although this is similar to a list, 
+ constant strings are treated atomically and are thus fields.</p>
+ <p>Fields use a condensed bit format specific to the type of information
+ they must contain. As few bits as possible are written for each field. The
+ sections that follow will provide the details on how these fields are
+ written and how the bits are to be interpreted.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="align">Alignment</a> </div>
+ <div class="doc_text">
+   <p>To support cross-platform differences, the bytecode file is aligned on 
+   certain boundaries. This means that a small amount of padding (at most 3 
+   bytes) will be added to ensure that the next entry is aligned to a 32-bit 
+   boundary.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="vbr">Variable Bit-Rate Encoding</a>
+ </div>
+ <div class="doc_text">
+ <p>Most of the values written to LLVM bytecode files are small integers. To 
+ minimize the number of bytes written for these quantities, an encoding scheme 
+ similar to UTF-8 is used to write integer data. The scheme is known as
+ variable bit rate (vbr) encoding. In this encoding, the high bit of
+ each byte is used to indicate if more bytes follow. If (byte &
+ 0x80) is non-zero in any given byte, it means there is another byte
+ immediately following that also contributes to the value. For the final
+ byte (byte & 0x80) is false (the high bit is not set). In each byte
+ only the low seven bits contribute to the value. Consequently 32-bit
+ quantities can take from one to <em>five</em> bytes to encode. In
+ general, smaller quantities will encode in fewer bytes, as follows:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th>Byte #</th>
+       <th>Significant Bits</th>
+       <th>Maximum Value</th>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>0-6</td>
+       <td>127</td>
+     </tr>
+     <tr>
+       <td>2</td>
+       <td>7-13</td>
+       <td>16,383</td>
+     </tr>
+     <tr>
+       <td>3</td>
+       <td>14-20</td>
+       <td>2,097,151</td>
+     </tr>
+     <tr>
+       <td>4</td>
+       <td>21-27</td>
+       <td>268,435,455</td>
+     </tr>
+     <tr>
+       <td>5</td>
+       <td>28-34</td>
+       <td>34,359,738,367</td>
+     </tr>
+     <tr>
+       <td>6</td>
+       <td>35-41</td>
+       <td>4,398,046,511,103</td>
+     </tr>
+     <tr>
+       <td>7</td>
+       <td>42-48</td>
+       <td>562,949,953,421,311</td>
+     </tr>
+     <tr>
+       <td>8</td>
+       <td>49-55</td>
+       <td>72,057,594,037,927,935</td>
+     </tr>
+     <tr>
+       <td>9</td>
+       <td>56-62</td>
+       <td>9,223,372,036,854,775,807</td>
+     </tr>
+     <tr>
+       <td>10</td>
+       <td>63-69</td>
+       <td>1,180,591,620,717,411,303,423</td>
+     </tr>
+   </tbody>
+ </table>
+ <p>Note that in practice, the tenth byte could only encode bit 63 since
+ the maximum quantity to use this encoding is a 64-bit integer.</p>
+ <p><em>Signed</em> VBR values are encoded with the standard vbr
+ encoding, but with the sign bit as the low order bit instead of the
+ high order bit. This allows small negative quantities to be encoded
+ efficiently. For example, -3
+ is encoded as "((3 << 1) | 1)" and 3 is encoded as "(3 <<
+ 1) | 0)", emitted with the standard vbr encoding above.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="encoding">Encoding Primitives</a> </div>
+ <div class="doc_text">
+ <p>Each field in the bytecode format is encoded into the file using a
+ small set of primitive formats. The table below defines the encoding
+ rules for the various primitives used and gives them each a type name.
+ The type names used in the descriptions of blocks and fields in the <a
+  href="#details">Detailed Layout</a>next section. Any type name with
+ the suffix <em>_vbr</em> indicates a quantity that is encoded using
+ variable bit rate encoding as described above.</p>
+ <table class="doc_table">
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Rule</b></th>
+     </tr>
+     <tr>
+       <td><a name="unsigned"><b>unsigned</b></a></td>
+       <td class="td_left">A 32-bit unsigned integer that always occupies four 
+       consecutive bytes. The unsigned integer is encoded using LSB first 
+       ordering. That is bits 2<sup>0</sup> through 2<sup>7</sup> are in the 
+       byte with the lowest file offset (little endian).</td>
+     </tr>
+     <tr>
+       <td style="vertical-align: top;"><a name="uint24_vbr">
+         <b>uint24_vbr</b></a></td>
+       <td style="vertical-align: top; text-align: left;">A 24-bit unsigned 
+       integer that occupies from one to four bytes using variable bit rate 
+       encoding.</td>
+     </tr>
+     <tr>
+       <td><a name="uint32_vbr"><b>uint32_vbr</b></a></td>
+       <td class="td_left">A 32-bit unsigned integer that occupies from one to 
+         five bytes using variable bit rate encoding.</td>
+     </tr>
+     <tr>
+       <td><a name="uint64_vbr"><b>uint64_vbr</b></a></td>
+       <td class="td_left">A 64-bit unsigned integer that occupies from one to ten 
+         bytes using variable bit rate encoding.</td>
+     </tr>
+     <tr>
+       <td><a name="int64_vbr"><b>int64_vbr</b></a></td>
+       <td class="td_left">A 64-bit signed integer that occupies from one to ten 
+         bytes using the signed variable bit rate encoding.</td>
+     </tr>
+     <tr>
+       <td><a name="char"><b>char</b></a></td>
+       <td class="td_left">A single unsigned character encoded into one byte</td>
+     </tr>
+     <tr>
+       <td><a name="bit"><b>bit(n-m)</b></a></td>
+       <td class="td_left">A set of bit within some larger integer field. The values 
+         of <code>n</code> and <code>m</code> specify the inclusive range of bits 
+         that define the subfield. The value for <code>m</code> may be omitted if 
+         its the same as <code>n</code>.</td>
+     </tr>
+     <tr>
+       <td style="vertical-align: top;"><b><a name="float"><b>float</b></a></b></td>
+       <td style="vertical-align: top; text-align: left;">A floating point value encoded 
+         as a 32-bit IEEE value written in little-endian form.<br>
+       </td>
+     </tr>
+     <tr>
+       <td style="vertical-align: top;"><b><b><a name="double"><b>double</b></a></b></b></td>
+       <td style="vertical-align: top; text-align: left;">A floating point value encoded 
+         as a64-bit IEEE value written in little-endian form</td>
+     </tr>
+     <tr>
+       <td><a name="string"><b>string</b></a></td>
+       <td class="td_left">A uint32_vbr indicating the type of the
+ constant string which also includes its length, immediately followed by
+ the characters of the string. There is no terminating null byte in the
+ string.</td>
+     </tr>
+     <tr>
+       <td><a name="data"><b>data</b></a></td>
+       <td class="td_left">An arbitrarily long segment of data to which
+ no interpretation is implied. This is used for constant initializers.<br>
+       </td>
+     </tr>
+     <tr>
+       <td><a name="llist"><b>llist(x)</b></a></td>
+       <td class="td_left">A length list of x. This means the list is
+ encoded as an <a href="#uint32_vbr">uint32_vbr</a> providing the
+ length of the list, followed by a sequence of that many "x" items. This
+ implies that the reader should iterate the number of times provided by
+ the length.</td>
+     </tr>
+     <tr>
+       <td><a name="zlist"><b>zlist(x)</b></a></td>
+       <td class="td_left">A zero-terminated list of x. This means the
+ list is encoded as a sequence of an indeterminate number of "x" items,
+ followed by an <a href="#uint32_vbr">uint32_vbr</a> terminating value.
+ This implies that none of the "x" items can have a zero value (or else
+ the list terminates).</td>
+     </tr>
+     <tr>
+       <td><a name="block"><b>block</b></a></td>
+       <td class="td_left">A block of data that is logically related. A
+ block is an unsigned 32-bit integer that encodes the type of the block
+ in the low 5 bits and the size of the block in the high 27 bits. The
+ length does not include the block header or any alignment bytes at the
+ end of the block. Blocks may compose other blocks. </td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="notation">Field Notation</a> </div>
+ <div class="doc_text">
+ <p>In the detailed block and field descriptions that follow, a regex
+ like notation is used to describe optional and repeated fields. A very
+ limited subset of regex is used to describe these, as given in the
+ following table: </p>
+ <table class="doc_table">
+   <tbody>
+     <tr>
+       <th><b>Character</b></th>
+       <th class="td_left"><b>Meaning</b></th>
+     </tr>
+     <tr>
+       <td><b><code>?</code></b></td>
+       <td class="td_left">The question mark indicates 0 or 1
+ occurrences of the thing preceding it.</td>
+     </tr>
+     <tr>
+       <td><b><code>*</code></b></td>
+       <td class="td_left">The asterisk indicates 0 or more occurrences
+ of the thing preceding it.</td>
+     </tr>
+     <tr>
+       <td><b><code>+</code></b></td>
+       <td class="td_left">The plus sign indicates 1 or more occurrences
+ of the thing preceding it.</td>
+     </tr>
+     <tr>
+       <td><b><code>()</code></b></td>
+       <td class="td_left">Parentheses are used for grouping.</td>
+     </tr>
+     <tr>
+       <td><b><code>,</code></b></td>
+       <td class="td_left">The comma separates sequential fields.</td>
+     </tr>
+   </tbody>
+ </table>
+ <p>So, for example, consider the following specifications:</p>
+ <div class="doc_code">
+ <ol>
+   <li><code>string?</code></li>
+   <li><code>(uint32_vbr,uin32_vbr)+</code></li>
+   <li><code>(unsigned?,uint32_vbr)*</code></li>
+   <li><code>(llist(unsigned))?</code></li>
+ </ol>
+ </div>
+ <p>with the following interpretations:</p>
+ <ol>
+   <li>An optional string. Matches either nothing or a single string</li>
+   <li>One or more pairs of uint32_vbr.</li>
+   <li>Zero or more occurrences of either an unsigned followed by a
+ uint32_vbr or just a uint32_vbr.</li>
+   <li>An optional length list of unsigned values.</li>
+ </ol>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="slots">Slots</a> </div>
+ <div class="doc_text">
+ <p>The bytecode format uses the notion of a "slot" to reference Types
+ and Values. Since the bytecode file is a <em>direct</em> representation of
+ LLVM's intermediate representation, there is a need to represent pointers in
+ the file.  Slots are used for this purpose. For example, if one has the following
+ assembly:
+ </p>
+ <div class="doc_code"><code> %MyType = type { int, sbyte }<br>
+ %MyVar = external global %MyType
+ </code></div>
+ <p>there are two definitions. The definition of <tt>%MyVar</tt> uses <tt>%MyType</tt>.
+ In the C++ IR this linkage between <tt>%MyVar</tt> and <tt>%MyType</tt>
+ is explicit through the use of C++ pointers. In bytecode, however, there's no
+ ability to store memory addresses. Instead, we compute and write out
+ slot numbers for every Type and Value written to the file.</p>
+ <p>A slot number is simply an unsigned 32-bit integer encoded in the variable
+ bit rate scheme (see <a href="#encoding">encoding</a>). This ensures that
+ low slot numbers are encoded in one byte. Through various bits of magic LLVM
+ attempts to always keep the slot numbers low. The first attempt is to associate
+ slot numbers with their "type plane". That is, Values of the same type
+ are written to the bytecode file in a list (sequentially). Their order in 
+ that list determines their slot number. This means that slot #1 doesn't mean
+ anything unless you also specify for which type you want slot #1. Types are
+ always written to the file first (in the <a href="#globaltypes">Global Type 
+ Pool</a>) and in such a way that both forward and backward references of the 
+ types can often be resolved with a single pass through the type pool. </p>
+ <p>Slot numbers are also kept small by rearranging their order. Because
+ of the structure of LLVM, certain values are much more likely to be used
+ frequently in the body of a function. For this reason, a compaction table is
+ provided in the body of a function if its use would make the function body 
+ smaller.  Suppose you have a function body that uses just the types "int*" and
+ "{double}" but uses them thousands of time. Its worthwhile to ensure that the 
+ slot number for these types are low so they can be encoded in a single byte 
+ (via vbr). This is exactly what the compaction table does.</p>
+ <p>In summary then, a slot number can be though of as just a vbr encoded index 
+ into a list of Type* or Value*. To keep slot numbers low, Value* are indexed by
+ two slot numbers: the "type plane index" (type slot) and the "value index"
+ (value slot).</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="general">General Structure</a> </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+ <p>This section provides the general structure of the LLVM bytecode
+ file format. The bytecode file format requires blocks to be in a
+ certain order and nested in a particular way so that an LLVM module can
+ be constructed efficiently from the contents of the file. This ordering
+ defines a general structure for bytecode files as shown below. The
+ table below shows the order in which all block types may appear. Please
+ note that some of the blocks are optional and some may be repeated. The
+ structure is fairly loose because optional blocks, if empty, are
+ completely omitted from the file.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th>ID</th>
+       <th>Parent</th>
+       <th>Optional?</th>
+       <th>Repeated?</th>
+       <th>Level</th>
+       <th>Block Type</th>
+       <th>Description</th>
+     </tr>
+     <tr>
+       <td>N/A</td>
+       <td>File</td>
+       <td>No</td>
+       <td>No</td>
+       <td>0</td>
+       <td class="td_left"><a href="#signature">Signature</a></td>
+       <td class="td_left">This contains the file signature (magic
+ number) that identifies the file as LLVM bytecode.</td>
+     </tr>
+     <tr>
+       <td>0x01</td>
+       <td>File</td>
+       <td>No</td>
+       <td>No</td>
+       <td>0</td>
+       <td class="td_left"><a href="#module">Module</a></td>
+       <td class="td_left">This is the top level block in a bytecode
+ file. It contains all the other blocks. </td>
+     </tr>
+     <tr>
+       <td>0x06</td>
+       <td>Module</td>
+       <td>No</td>
+       <td>No</td>
+       <td>1</td>
+       <td class="td_left">   <a href="#globaltypes">Global Type Pool</a></td>
+       <td class="td_left">This block contains all the global (module)
+ level types.</td>
+     </tr>
+     <tr>
+       <td>0x05</td>
+       <td>Module</td>
+       <td>No</td>
+       <td>No</td>
+       <td>1</td>
+       <td class="td_left">   <a href="#globalinfo">Module Globals Info</a></td>
+       <td class="td_left">This block contains the type, constness, and
+ linkage for each of the global variables in the module. It also
+ contains the type of the functions and the constant initializers.</td>
+     </tr>
+     <tr>
+       <td>0x03</td>
+       <td>Module</td>
+       <td>Yes</td>
+       <td>No</td>
+       <td>1</td>
+       <td class="td_left">   <a href="#constantpool">Module Constant Pool</a></td>
+       <td class="td_left">This block contains all the global constants
+ except function arguments, global values and constant strings.</td>
+     </tr>
+     <tr>
+       <td>0x02</td>
+       <td>Module</td>
+       <td>Yes</td>
+       <td>Yes</td>
+       <td>1</td>
+       <td class="td_left">   <a href="#functiondefs">Function Definitions</a>*</td>
+       <td class="td_left">One function block is written for each
+ function in the module. The function block contains the instructions,
+ compaction table, type constant pool, and symbol table for the function.</td>
+     </tr>
+     <tr>
+       <td>0x03</td>
+       <td>Function</td>
+       <td>Yes</td>
+       <td>No</td>
+       <td>2</td>
+       <td class="td_left">      <a
+  href="#constantpool">Function Constant Pool</a></td>
+       <td class="td_left">Any constants (including types) used solely
+ within the function are emitted here in the function constant pool. </td>
+     </tr>
+     <tr>
+       <td>0x08</td>
+       <td>Function</td>
+       <td>Yes</td>
+       <td>No</td>
+       <td>2</td>
+       <td class="td_left">      <a
+  href="#compactiontable">Compaction Table</a></td>
+       <td class="td_left">This table reduces bytecode size by providing
+ a funtion-local mapping of type and value slot numbers to their global
+ slot numbers</td>
+     </tr>
+     <tr>
+       <td>0x07</td>
+       <td>Function</td>
+       <td>No</td>
+       <td>No</td>
+       <td>2</td>
+       <td class="td_left">      <a
+  href="#instructionlist">Instruction List</a></td>
+       <td class="td_left">This block contains all the instructions of
+ the function. The basic blocks are inferred by terminating
+ instructions. </td>
+     </tr>
+     <tr>
+       <td>0x04</td>
+       <td>Function</td>
+       <td>Yes</td>
+       <td>No</td>
+       <td>2</td>
+       <td class="td_left">      <a
+  href="#symtab">Function Symbol Table</a></td>
+       <td class="td_left">This symbol table provides the names for the
+ function specific values used (basic block labels mostly).</td>
+     </tr>
+     <tr>
+       <td>0x04</td>
+       <td>Module</td>
+       <td>Yes</td>
+       <td>No</td>
+       <td>1</td>
+       <td class="td_left">   <a href="#symtab">Module Symbol Table</a></td>
+       <td class="td_left">This symbol table provides the names for the
+ various entries in the file that are not function specific (global
+ vars, and functions mostly).</td>
+     </tr>
+   </tbody>
+ </table>
+ <p>Use the links in the table for details about the contents of each of
+ the block types.</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="blockdefs">Block Definitions</a> </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+ <p>This section provides the detailed layout of the individual block
+ types in the LLVM bytecode file format. </p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="signature">Signature Block</a> </div>
+ <div class="doc_text">
+ <p>The signature occurs in every LLVM bytecode file and is always first.
+ It simply provides a few bytes of data to identify the file as being an LLVM
+ bytecode file. This block is always four bytes in length and differs from the
+ other blocks because there is no identifier and no block length at the start
+ of the block. Essentially, this block is just the "magic number" for the file.
+ </p>
+ <p>There are two types of signatures for LLVM bytecode: uncompressed and
+ compressed as shown in the table below. </p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Uncompressed</b></th>
+       <th class="td_left"><b>Compressed</b></th>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a></td>
+       <td class="td_left">Constant "l" (0x6C)</td>
+       <td class="td_left">Constant "l" (0x6C)</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a></td>
+       <td class="td_left">Constant "l" (0x6C)</td>
+       <td class="td_left">Constant "l" (0x6C)</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a></td>
+       <td class="td_left">Constant "v" (0x76)</td>
+       <td class="td_left">Constant "v" (0x76)</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a></td>
+       <td class="td_left">Constant "m" (0x6D)</td>
+       <td class="td_left">Constant "c" (0x63)</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a></td>
+       <td class="td_left">N/A</td>
+       <td class="td_left">'0'=null,'1'=gzip,'2'=bzip2</td>
+     </tr>
+   </tbody>
+ </table>
+ <p>In other words, the uncompressed signature is just the characters 'llvm'
+ while the compressed signature is the characters 'llvc' followed by an ascii
+ digit ('0', '1', or '2') that indicates the kind of compression used. A value of
+ '0' indicates that null compression was used. This can happen when compression
+ was requested on a platform that wasn't configured for gzip or bzip2. A value of
+ '1' means that the rest of the file is compressed using the gzip algorithm and
+ should be uncompressed before interpretation. A value of '2' means that the rest
+ of the file is compressed using the bzip2 algorithm and should be uncompressed
+ before interpretation. In all cases, the data resulting from uncompression
+ should be interpreted as if it occurred immediately after the 'llvm'
+ signature (i.e. the uncompressed data begins with the 
+ <a href="#module">Module Block</a></p>
+ <p><b>NOTE:</b> As of LLVM 1.4, all bytecode files produced by the LLVM tools
+ are compressed by default. To disable compression, pass the 
+ <tt>--disable-compression</tt> option to the tool, if it supports it.
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="module">Module Block</a> </div>
+ <div class="doc_text">
+ <p>The module block contains a small pre-amble and all the other blocks in
+ the file. The table below shows the structure of the module block. Note that it
+ only provides the module identifier, size of the module block, and the format
+ information. Everything else is contained in other blocks, described in other
+ sections.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#unsigned">unsigned</a><br></td>
+       <td class="td_left"><a href="#mod_header">Module Block Identifier
+           (0x01)</a></td>
+     </tr>
+     <tr>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left"><a href="#mod_header">Module Block Size</a></td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left"><a href="#format">Format Information</a></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left"><a href="#globaltypes">Global Type Pool</a></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left"><a href="#globalinfo">Module Globals Info</a></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left"><a href="#constantpool">Module Constant Pool</a></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a>*</td>
+       <td class="td_left"><a href="#functiondefs">Function Definitions</a></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left"><a href="#symtab">Module Symbol Table</a></td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="mod_header">Module Block Header</a></div>
+ <div class="doc_text">
+   <p>The block header for the module block uses a longer format than the other
+   blocks in a bytecode file. Specifically, instead of encoding the type and size
+   of the block into a 32-bit integer with 5-bits for type and 27-bits for size,
+   the module block header uses two 32-bit unsigned values, one for type, and one
+   for size. While the 2<sup>27</sup> byte limit on block size is sufficient for the blocks
+   contained in the module, it isn't sufficient for the module block itself
+   because we want to ensure that bytecode files as large as 2<sup>32</sup> bytes
+   are possible. For this reason, the module block (and only the module block)
+   uses a long format header.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="format">Format Information</a></div>
+ <div class="doc_text">
+ <p>The format information field is encoded into a <a href="#uint32_vbr">uint32_vbr</a>
+ as shown in the following table.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(0)</a></td>
+       <td class="td_left">Target is big endian?</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(1)</a></td>
+       <td class="td_left">On target pointers are 64-bit?</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(2)</a></td>
+       <td class="td_left">Target has no endianess?</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(3)</a></td>
+       <td class="td_left">Target has no pointer size?</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(4-31)</a></td>
+       <td class="td_left">Bytecode format version</td>
+     </tr>
+   </tbody>
+ </table>
+ <p>
+ Of particular note, the bytecode format number is simply a 28-bit
+ monotonically increasing integer that identifies the version of the bytecode
+ format (which is not directly related to the LLVM release number). The
+ bytecode versions defined so far are (note that this document only
+ describes the latest version, 1.3):</p>
+ <ul>
+   <li>#0: LLVM 1.0 & 1.1</li>
+   <li>#1: LLVM 1.2</li>
+   <li>#2: LLVM 1.2.5 (not released)</li>
+   <li>#3: LLVM 1.3</li>
+   <li>#4: LLVM 1.3.x (not released)</li>
+   <li>#5: LLVM 1.4 and newer</li>
+   </li>
+ </ul>
+ <p>Note that we plan to eventually expand the target description
+ capabilities
+ of bytecode files to <a href="http://llvm.org/PR263">target
+ triples</a>.
+ </p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="globaltypes">Global Type Pool</a> </div>
+ <div class="doc_text">
+ <p>The global type pool consists of type definitions. Their order of appearance
+ in the file determines their type slot number (0 based). Slot numbers are
+ used to replace pointers in the intermediate representation. Each slot number 
+ uniquely identifies one entry in a type plane (a collection of values of the
+ same type).  Since all values have types and are associated with the order in 
+ which the type pool is written, the global type pool <em>must</em> be written 
+ as the first block of a module. If it is not, attempts to read the file will
+ fail because both forward and backward type resolution will not be possible.</p>
+ <p>The type pool is simply a list of type definitions, as shown in the
+ table below.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#unsigned">block</a></td>
+       <td class="td_left">Type Pool Identifier (0x06) + Size<br>
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#llist">llist</a>(<a href="#type">type</a>)</td>
+       <td class="td_left">A length list of type definitions.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="type">Type Definitions</a></div>
+ <div class="doc_text">
+ <p>Types in the type pool are defined using a different format for each kind
+ of type, as given in the following sections.</p>
+ <h3>Primitive Types</h3>
+ <p>The primitive types encompass the basic integer and floating point
+ types. They are encoded simply as their TypeID.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID for the primitive types (values 1 to
+ 11) <sup>1</sup></td>
+     </tr>
+   </tbody>
+ </table>
+ Notes:
+ <ol>
+   <li>The values for the Type IDs for the primitive types are provided
+ by the definition of the <code>llvm::Type::TypeID</code> enumeration
+ in <code>include/llvm/Type.h</code>. The enumeration gives the
+ following mapping:
+     <ol>
+       <li>bool</li>
+       <li>ubyte</li>
+       <li>sbyte</li>
+       <li>ushort</li>
+       <li>short</li>
+       <li>uint</li>
+       <li>int</li>
+       <li>ulong</li>
+       <li>long</li>
+       <li>float</li>
+       <li>double</li>
+     </ol>
+   </li>
+ </ol>
+ <h3>Function Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID for function types (13)</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type slot number of function's return type.</td>
+     </tr>
+     <tr>
+       <td><a href="#llist">llist</a>(<a href="#uint24_vbr">uint24_vbr</a>)</td>
+       <td class="td_left">Type slot number of each argument's type.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>?</td>
+       <td class="td_left">Value 0 if this is a varargs function,
+ missing otherwise.</td>
+     </tr>
+   </tbody>
+ </table>
+ <h3>Structure Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID for structure types (14)</td>
+     </tr>
+     <tr>
+       <td><a href="#zlist">zlist</a>(<a href="#uint24_vbr">uint24_vbr</a>)</td>
+       <td class="td_left">Slot number of each of the element's fields.</td>
+     </tr>
+   </tbody>
+ </table>
+ <h3>Array Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID for Array Types (15)</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type slot number of array's element type.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The number of elements in the array.</td>
+     </tr>
+   </tbody>
+ </table>
+ <h3>Pointer Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID For Pointer Types (16)</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type slot number of pointer's element type.</td>
+     </tr>
+   </tbody>
+ </table>
+ <h3>Opaque Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID For Opaque Types (17)</td>
+     </tr>
+   </tbody>
+ </table>
+ <h3>Packed Types</h3>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type ID for Packed Types (18)</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Slot number of packed vector's element type.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The number of elements in the packed vector.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="globalinfo">Module Global Info</a>
+ </div>
+ <div class="doc_text">
+ <p>The module global info block contains the definitions of all global
+ variables including their initializers and the <em>declaration</em> of
+ all functions. The format is shown in the table below:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">Module global info identifier (0x05) + size</td>
+     </tr>
+     <tr>
+       <td><a href="#zlist">zlist</a>(<a href="#globalvar">globalvar</a>)</td>
+       <td class="td_left">A zero terminated list of global var
+ definitions occurring in the module.</td>
+     </tr>
+     <tr>
+       <td><a href="#zlist">zlist</a>(<a href="#funcfield">funcfield</a>)</td>
+       <td class="td_left">A zero terminated list of function definitions
+ occurring in the module.</td>
+     </tr>
+     <tr>
+       <td><a href="#llist">llist</a>(<a href="#string">string</a>)</td>
+       <td class="td_left">A length list
+ of strings that specify the names of the libraries that this module
+ depends upon.</td>
+     </tr>
+     <tr>
+       <td><a href="#string">string</a></td>
+       <td class="td_left">The target
+ triple for the module (blank means no target triple specified, i.e. a
+ platform independent module).</td>
+     </tr>
+     <tr>
+       <td><a href="#llist">llist</a>(<a href="#string">string</a>)</td>
+       <td class="td_left">A length list
+ of strings that defines a table of section strings for globals.  A global's
+ SectionID is an index into this table.</td>
+     </tr>
+     <tr>
+       <td><a href="#string">string</a></td>
+       <td class="td_left">The inline asm block for this module.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="globalvar">Global Variable Field</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Global variables are written using an <a href="#uint32_vbr">uint32_vbr</a>
+ that encodes information about the global variable, an optional extension vbr,
+ and a an optional initializers for the global var.</p>
+ 
+ <p>The table below provides the bit layout of the first <a
+  href="#uint32_vbr">uint32_vbr</a> that describes the global variable.</p>
+  
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(0)</a></td>
+       <td class="td_left">Is constant?</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(1)</a></td>
+       <td class="td_left">Has initializer? Note that this bit
+ determines whether the constant initializer field (described below)
+ follows. </td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(2-4)</a></td>
+       <td class="td_left">Linkage type: 0=External, 1=Weak,
+ 2=Appending, 3=Internal, 4=LinkOnce</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(5-31)</a></td>
+       <td class="td_left">Type slot number of type for the global variable.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>When the Linkage type is set to 3 (internal) and the initializer field is set
+ to 0 (an invalid combination), an extension word follows the first <a
+ href="#uint32_vbr">uint32_vbr</a> which encodes the real linkage and init flag,
+ and can includes more information:</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(0)</a></td>
+       <td class="td_left">Has initializer?  Indicates the real value of the "Has
+         initializer" field for the global. </td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(2-4)</a></td>
+       <td class="td_left">Linkage type: Indicates the real value of the "linkage
+         type" field for the global.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(4-8)</a></td>
+       <td class="td_left">The log-base-2 of the alignment for the global.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(9)</a></td>
+       <td class="td_left">If this bit is set, a SectionID follows this vbr.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(10-31)</a></td>
+       <td class="td_left">Currently unassigned.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>If the SectionID bit is set above, the following field is included:</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>
+       </td>
+       <td class="td_left">An optional section ID number, specifying the string
+         to use for the section of the global.  This an index (+1) of an entry
+         into the SectionID llist in the <a href="#globalinfo">Module Global
+         Info</a> block.  If this value is 0 or not present, the global has an
+         empty section string.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>If the "Has initializer" field is set, the following field is included:</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>
+       </td>
+       <td class="td_left">An optional value slot number for the global 
+           variable's constant initializer.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="funcfield">Function Field</a>
+ </div>
+ <div class="doc_text">
+ <p>Functions are written using an <a href="#uint32_vbr">uint32_vbr</a>
+ that encodes information about the function and a set of flags.  If needed,
+ an extension word may follow this first field.</p>
+ 
+ <p>The table below provides the bit layout of the <a
+ href="#uint32_vbr">uint32_vbr</a> that describes the function.</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(0-3)</a></td>
+       <td class="td_left">
+       Encodes the calling convention number of the function. The
+       CC number of the function is the value of this field minus one.
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(4)</a></td>
+       <td class="td_left">If this bit is set to 1, the indicated function is
+       external, and there is no <a href="#functiondefs">Function Definiton
+       Block</a> in the bytecode file for the function.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(5-30)</a></td>
+       <td class="td_left">Type slot number of type for the function.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(31)</a></td>
+       <td class="td_left">Indicates whether an extension word follows.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>If bit(31) is set, an additional <a href="#uint32_vbr">uint32_vbr</a> word
+ follows with the following fields:</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(0-4)</a></td>
+       <td class="td_left">The log-base-2 of the alignment for the function.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(5-9)</a></td>
+       <td class="td_left">The top nibble of the calling convention.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(10)</a></td>
+       <td class="td_left">If this bit is set, a SectionID follows this vbr.</td>
+     </tr>
+     <tr>
+       <td><a href="#bit">bit(11-31)</a></td>
+       <td class="td_left">Currently unassigned.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>If the SectionID bit is set above, the following field is included:</p>
+ 
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>
+       </td>
+       <td class="td_left">An optional section ID number, specifying the string
+         to use for the section of the function.  This an index (+1) of an entry
+         into the SectionID llist in the <a href="#globalinfo">Module Global
+         Info</a> block.  If this value is 0 or not present, the function has an
+         empty section string.</td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="constantpool">Constant Pool</a> </div>
+ <div class="doc_text">
+ <p>A constant pool defines as set of constant values. There are
+ actually two types of constant pool blocks: one for modules and one for
+ functions. For modules, the block begins with the constant strings
+ encountered anywhere in the module. For functions, the block begins
+ with types only encountered in the function. In both cases the header
+ is identical. The tables that follow, show the header, module constant
+ pool preamble, function constant pool preamble, and the part common to
+ both function and module constant pools.</p>
+ <p><b>Common Block Header</b></p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">Constant pool identifier (0x03) + size<br>
+       </td>
+     </tr>
+   </tbody>
+ </table>
+ <p><b>Module Constant Pool Preamble (constant strings)</b></p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The number of constant strings that follow.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Zero. This identifies the following "plane"
+ as containing the constant strings. This is needed to identify it
+ uniquely from other constant planes that follow. </td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a>+</td>
+       <td class="td_left">Type slot number of the constant string's type.
+ Note that the constant string's type implicitly defines the length of
+ the string. </td>
+     </tr>
+   </tbody>
+ </table>
+ <p><b>Function Constant Pool Preamble (function types)</b></p>
+ <p>The structure of the types for functions is identical to the <a
+  href="#globaltypes">Global Type Pool</a>. Please refer to that section
+ for the details. </p>
+ <p><b>Common Part (other constants)</b></p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Number of entries in this type plane.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type slot number of this plane.</td>
+     </tr>
+     <tr>
+       <td><a href="#constant">constant</a>+</td>
+       <td class="td_left">The definition of a constant (see below).</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="constant">Simple Constant Pool
+ Entries</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>Constant pool entries come in many shapes and flavors. The sections that
+ follow define the format for each of them. All constants start with a <a
+  href="#uint32_vbr">uint32_vbr</a> encoded integer that provides the
+ number of operands for the constant. For primitive, structure, and
+ array constants, this will always be zero to indicate that the form of the 
+ constant is solely determined by its type. In this case, we have the following
+ field definitions, based on type:</p>
+ 
+ <ul>
+   <li><b>Bool</b>. This is written as an <a href="#uint32_vbr">uint32_vbr</a>
+ of value 1U or 0U.</li>
+   <li><b>Signed Integers (sbyte,short,int,long)</b>. These are written
+ as an <a href="#int64_vbr">int64_vbr</a> with the corresponding value.</li>
+   <li><b>Unsigned Integers (ubyte,ushort,uint,ulong)</b>. These are
+ written as an <a href="#uint64_vbr">uint64_vbr</a> with the
+ corresponding value. </li>
+   <li><b>Floating Point</b>. Both the float and double types are
+ written literally in binary format.</li>
+   <li><b>Arrays</b>. Arrays are written simply as a list of <a
+  href="#uint32_vbr">uint32_vbr</a> encoded value slot numbers to the constant
+ element values.</li>
+   <li><b>Structures</b>. Structures are written simply as a list of <a
+  href="#uint32_vbr">uint32_vbr</a> encoded value slot numbers to the constant
+ field values of the structure.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Undef Entries</a></div>
+ 
+ <div class="doc_text">
+ <p>When the number of operands to the constant is one, we have an 'undef' value
+ of the specified type.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Inline Assembler Entries</a></div>
+ 
+ <div class="doc_text">
+ <p>Inline Assembler entries are stored in the constant pool, though they are not
+    officially LLVM constants.  These entries are marked with a value of
+    "4294967295" (all ones) for the number of operands.  They are encoded as
+    follows:</p>
+    
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#string">string</a></td>
+       <td class="td_left">The asm string.</td>
+     </tr>
+     <tr>
+       <td><a href="#string">string</a></td>
+       <td class="td_left">The constraints string.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Flags</sup></td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>Currently, the only defined flag, the low bit, indicates whether or not the
+    inline assembler has side effects.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Constant Expression Entries</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>Otherwise, we have a constant expression.  The format of the constant
+ expression is specified in the table below, and the number is equal to the
+ number of operands+1.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Op code of the instruction for the constant
+ expression.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The value slot number of the constant value for an
+ operand.<sup>1</sup></td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">The type slot number for the type of the constant
+ value for an operand.<sup>1</sup></td>
+     </tr>
+   </tbody>
+ </table>
+ Notes:
+ <ol>
+   <li>Both these fields are repeatable but only in pairs.</li>
+ </ol>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="functiondefs">Function Definition</a></div>
+ <div class="doc_text">
+ <p>Function definitions contain the linkage, constant pool or
+ compaction table, instruction list, and symbol table for a function.
+ The following table shows the structure of a function definition.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a><br>
+       </td>
+       <td class="td_left">Function definition block identifier (0x02) +
+ size<br>
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The linkage type of the function: 0=External,
+ 1=Weak, 2=Appending, 3=Internal, 4=LinkOnce<sup>1</sup></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">The <a href="#constantpool">constant pool</a>
+ block for this function.<sup>2</sup></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">The <a href="#compactiontable">compaction
+ table</a> block for the function.<sup>2</sup></td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">The <a href="#instructionlist">instruction
+ list</a> for the function.</td>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a></td>
+       <td class="td_left">The function's <a href="#symtab">symbol
+ table</a> containing only those symbols pertinent to the function
+ (mostly block labels).</td>
+     </tr>
+   </tbody>
+ </table>
+ Notes:
+ <ol>
+   <li>Note that if the linkage type is "External" then none of the
+ other fields will be present as the function is defined elsewhere.</li>
+   <li>Note that only one of the constant pool or compaction table will
+ be written. Compaction tables are only written if they will actually
+ save bytecode space. If not, then a regular constant pool is written.</li>
+ </ol>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="compactiontable">Compaction Table</a>
+ </div>
+ <div class="doc_text">
+ <p>Compaction tables are part of a function definition. They are merely
+ a device for reducing the size of bytecode files. The size of a
+ bytecode file is dependent on the <em>values</em> of the slot numbers
+ used because larger values use more bytes in the variable bit rate
+ encoding scheme. Furthermore, the compressed instruction format
+ reserves only six bits for the type of the instruction. In large
+ modules, declaring hundreds or thousands of types, the values of the
+ slot numbers can be quite large. However, functions may use only a
+ small fraction of the global types. In such cases a compaction table is
+ created that maps the global type and value slot numbers to smaller
+ values used by a function. Functions will contain either a
+ function-specific constant pool <em>or</em> a compaction table but not
+ both. Compaction tables have the format shown in the table below.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The number of types that follow</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a>+</td>
+       <td class="td_left">The type slot number in the global types of
+ the type that will be referenced in the function with the index of this
+ entry in the compaction table.</td>
+     </tr>
+     <tr>
+       <td><a href="#type_len">type_len</a></td>
+       <td class="td_left">An encoding of the type and number of values
+ that follow. This field's encoding varies depending on the size of the
+ type plane. See <a href="#type_len">Type and Length</a> for further
+ details.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>+</td>
+       <td class="td_left">The value slot number in the global values
+ that will be referenced in the function with the index of this entry in
+ the compaction table.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="type_len">Type and Length</a></div>
+ <div class="doc_text">
+ <p>The type and length of a compaction table type plane is encoded
+ differently depending on the length of the plane. For planes of length
+ 1 or 2, the length is encoded into bits 0 and 1 of a <a
+  href="#uint32_vbr">uint32_vbr</a> and the type is encoded into bits
+ 2-31. Because type numbers are often small, this often saves an extra
+ byte per plane. If the length of the plane is greater than 2 then the
+ encoding uses a <a href="#uint32_vbr">uint32_vbr</a> for each of the
+ length and type, in that order.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="instructionlist">Instruction List</a></div>
+ <div class="doc_text">
+ <p>The instructions in a function are written as a simple list. Basic
+ blocks are inferred by the terminating instruction types. The format of
+ the block is given in the following table.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a><br>
+       </td>
+       <td class="td_left">Instruction list identifier (0x07) + size<br>
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#instruction">instruction</a>+</td>
+       <td class="td_left">An instruction. Instructions have a variety
+ of formats. See <a href="#instruction">Instructions</a> for details.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="instructions">Instructions</a></div>
+ 
+ <div class="doc_text">
+ <p>Instructions are written out one at a time as distinct units.  Each
+ instruction
+ record contains at least an <a href="#opcodes">opcode</a> and a type field, 
+ and may contain a <a href="#instoperands">list of operands</a> (whose
+ interpretation depends on the opcode). Based on the number of operands, the
+ <a href="#instencode">instruction is encoded</a> in a
+ dense format that tries to encoded each instruction into 32-bits if 
+ possible. </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="opcodes">Instruction Opcodes</a></div>
+ <div class="doc_text">
+   <p>Instructions encode an opcode that identifies the kind of instruction.
+   Opcodes are an enumerated integer value. The specific values used depend on
+   the version of LLVM you're using. The opcode values are defined in the
+   <a href="http://llvm.org/cvsweb/cvsweb.cgi/llvm/include/llvm/Instruction.def">
+   <tt>include/llvm/Instruction.def</tt></a> file. You should check there for the
+   most recent definitions. The table below provides the opcodes defined as of
+   the writing of this document. The table associates each opcode mnemonic with
+   its enumeration value and the bytecode and LLVM version numbers in which the
+   opcode was introduced.</p>
+   <table>
+     <tbody>
+       <tr>
+         <th>Opcode</th>
+         <th>Number</th>
+         <th>Bytecode Version</th>
+         <th>LLVM Version</th>
+       </tr>
+       <tr><td colspan="4"><b>Terminator Instructions</b></td></tr>
+       <tr><td>Ret</td><td>1</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Br</td><td>2</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Switch</td><td>3</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Invoke</td><td>4</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Unwind</td><td>5</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Unreachable</td><td>6</td><td>1</td><td>1.4</td></tr>
+       <tr><td colspan="4"><b>Binary Operators</b></td></tr>
+       <tr><td>Add</td><td>7</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Sub</td><td>8</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Mul</td><td>9</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Div</td><td>10</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Rem</td><td>11</td><td>1</td><td>1.0</td></tr>
+       <tr><td colspan="4"><b>Logical Operators</b></td></tr>
+       <tr><td>And</td><td>12</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Or</td><td>13</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Xor</td><td>14</td><td>1</td><td>1.0</td></tr>
+       <tr><td colspan="4"><b>Binary Comparison Operators</b></td></tr>
+       <tr><td>SetEQ</td><td>15</td><td>1</td><td>1.0</td></tr>
+       <tr><td>SetNE</td><td>16</td><td>1</td><td>1.0</td></tr>
+       <tr><td>SetLE</td><td>17</td><td>1</td><td>1.0</td></tr>
+       <tr><td>SetGE</td><td>18</td><td>1</td><td>1.0</td></tr>
+       <tr><td>SetLT</td><td>19</td><td>1</td><td>1.0</td></tr>
+       <tr><td>SetGT</td><td>20</td><td>1</td><td>1.0</td></tr>
+       <tr><td colspan="4"><b>Memory Operators</b></td></tr>
+       <tr><td>Malloc</td><td>21</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Free</td><td>22</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Alloca</td><td>23</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Load</td><td>24</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Store</td><td>25</td><td>1</td><td>1.0</td></tr>
+       <tr><td>GetElementPtr</td><td>26</td><td>1</td><td>1.0</td></tr>
+       <tr><td colspan="4"><b>Other Operators</b></td></tr>
+       <tr><td>PHI</td><td>27</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Cast</td><td>28</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Call</td><td>29</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Shl</td><td>30</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Shr</td><td>31</td><td>1</td><td>1.0</td></tr>
+       <tr><td>VANext</td><td>32</td><td>1</td><td>1.0</td></tr>
+       <tr><td>VAArg</td><td>33</td><td>1</td><td>1.0</td></tr>
+       <tr><td>Select</td><td>34</td><td>2</td><td>1.2</td></tr>
+       <tr><td colspan="4">
+           <b>Pseudo Instructions<a href="#pi_note">*</a></b>
+       </td></tr>
+       <tr><td>Invoke+CC </td><td>56</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Invoke+FastCC</td><td>57</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Call+CC</td><td>58</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Call+FastCC+TailCall</td><td>59</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Call+FastCC</td><td>60</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Call+CCC+TailCall</td><td>61</td><td>5</td><td>1.5</td></tr>
+       <tr><td>Load+Volatile</td><td>62</td><td>3</td><td>1.3</td></tr>
+       <tr><td>Store+Volatile</td><td>63</td><td>3</td><td>1.3</td></tr>
+     </tbody>
+   </table>
+ 
+ <p><b><a name="pi_note">* Note: </a></b>
+ These aren't really opcodes from an LLVM language perspective. They encode
+ information into other opcodes without reserving space for that information. 
+ For example, opcode=63 is a Volatile Store. The opcode for this
+ instruction is 25 (Store) but we encode it as 63 to indicate that is a Volatile
+ Store. The same is done for the calling conventions and tail calls.
+ In each of these entries in range 56-63, the opcode is documented as the base
+ opcode (Invoke, Call, Store) plus some set of modifiers, as follows:</p>
+ <dl>
+   <dt>CC</dt>
+   <dd>This means an arbitrary calling convention is specified
+   in a VBR that follows the opcode. This is used when the instruction cannot
+   be encoded with one of the more compact forms.
+   </dd>
+   <dt>FastCC</dt>
+   <dd>This indicates that the Call or Invoke is using the FastCC calling 
+   convention.</dd>
+   <dt>CCC</dt>
+   <dd>This indicates that the Call or Invoke is using the native "C" calling 
+   convention.</dd>
+   <dt>TailCall</dt>
+   <dd>This indicates that the Call has the 'tail' modifier.</dd>
+ </dl>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="instoperands">Instruction
+ Operands</a></div>
+ 
+ <div class="doc_text">
+ <p>
+ Based on the instruction opcode and type, the bytecode format implicitly (to 
+ save space) specifies the interpretation of the operand list.  For most
+ instructions, the type of each operand is implicit from the type of the 
+ instruction itself (e.g. the type of operands of a binary operator must match
+ the type of the instruction).  As such, the bytecode format generally only 
+ encodes the value number of the operand, not the type.</p>
+ 
+ <p>In some cases, however, this is not sufficient.  This section enumerates
+ those cases:</p>
+ 
+ <ul>
+ <li>getelementptr: the slot numbers for sequential type indexes are shifted up
+ two bits.  This allows the low order bits will encode the type of index used,
+ as follows: 0=uint, 1=int, 2=ulong, 3=long.</li>
+ <li>cast: the result type number is encoded as the second operand.</li>
+ <li>alloca/malloc: If the allocation has an explicit alignment, the log2 of the
+     alignment is encoded as the second operand.</li>
+ <li>call: If the tail marker and calling convention cannot be <a 
+     href="#pi_note">encoded into the opcode</a> of the call, it is passed as an
+     additional operand.  The low bit of the operand is a flag indicating whether
+     the call is a tail call.  The rest of the bits contain the calling 
+     convention number (shifted left by one bit).</li>
+ </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="instencode">Instruction 
+ Encoding</a></div>
+ 
+ <div class="doc_text">
+ <p>For brevity, instructions are written in one of four formats,
+ depending on the number of operands to the instruction. Each
+ instruction begins with a <a href="#uint32_vbr">uint32_vbr</a> that
+ encodes the type of the instruction as well as other things. The tables
+ that follow describe the format of this first part of each instruction.</p>
+ <p><b>Instruction Format 0</b></p>
+ <p>This format is used for a few instructions that can't easily be
+ shortened because they have large numbers of operands (e.g. PHI Node or
+ getelementptr). Each of the opcode, type, and operand fields is found in
+ successive fields.</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Specifies the opcode of the instruction. Note
+ that for compatibility with the other instruction formats, the opcode
+ is shifted left by 2 bits. Bits 0 and 1 must have value zero for this
+ format.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint24_vbr">uint24_vbr</a></td>
+       <td class="td_left">Provides the type slot number of the result type of
+         the instruction.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">The number of operands that follow.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a>+</td>
+       <td class="td_left">The slot number of the value(s) for the operand(s).
+       </td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p><b>Instruction Format 1</b></p>
+ <p>This format encodes the opcode, type and a single operand into a
+ single <a href="#uint32_vbr">uint32_vbr</a> as follows:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Bits</b></th>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td>0-1</td>
+       <td>constant "1"</td>
+       <td class="td_left">These two bits must be the value 1 which identifies 
+         this as an instruction of format 1.</td>
+     </tr>
+     <tr>
+       <td>2-7</td>
+       <td><a href="#instructions">opcode</a></td>
+       <td class="td_left">Specifies the opcode of the instruction. Note that 
+         the maximum opcode value is 63.</td>
+     </tr>
+     <tr>
+       <td>8-19</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the type for this 
+         instruction. Maximum slot number is 2<sup>12</sup>-1=4095.</td>
+     </tr>
+     <tr>
+       <td>20-31</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the 
+         first operand. Maximum slot number is 2<sup>12</sup>-1=4095. Note that 
+         the value 2<sup>12</sup>-1 denotes zero operands.</td>
+     </tr>
+   </tbody>
+ </table>
+ <p><b>Instruction Format 2</b></p>
+ <p>This format encodes the opcode, type and two operands into a single <a
+  href="#uint32_vbr">uint32_vbr</a> as follows:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Bits</b></th>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td>0-1</td>
+       <td>constant "2"</td>
+       <td class="td_left">These two bits must be the value 2 which identifies 
+         this as an instruction of format 2.</td>
+     </tr>
+     <tr>
+       <td>2-7</td>
+       <td><a href="#instructions">opcode</a></td>
+       <td class="td_left">Specifies the opcode of the instruction. Note that 
+         the maximum opcode value is 63.</td>
+     </tr>
+     <tr>
+       <td>8-15</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the type for this 
+         instruction. Maximum slot number is 2<sup>8</sup>-1=255.</td>
+     </tr>
+     <tr>
+       <td>16-23</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the first 
+         operand. Maximum slot number is 2<sup>8</sup>-1=255.</td>
+     </tr>
+     <tr>
+       <td>24-31</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the second 
+         operand. Maximum slot number is 2<sup>8</sup>-1=255.</td>
+     </tr>
+   </tbody>
+ </table>
+ <p><b>Instruction Format 3</b></p>
+ <p>This format encodes the opcode, type and three operands into a
+ single <a href="#uint32_vbr">uint32_vbr</a> as follows:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Bits</b></th>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td>0-1</td>
+       <td>constant "3"</td>
+       <td class="td_left">These two bits must be the value 3 which identifies 
+         this as an instruction of format 3.</td>
+     </tr>
+     <tr>
+       <td>2-7</td>
+       <td><a href="#instructions">opcode</a></td>
+       <td class="td_left">Specifies the opcode of the instruction. Note that 
+         the maximum opcode value is 63.</td>
+     </tr>
+     <tr>
+       <td>8-13</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the type for this 
+         instruction. Maximum slot number is 2<sup>6</sup>-1=63.</td>
+     </tr>
+     <tr>
+       <td>14-19</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the first 
+         operand. Maximum slot number is 2<sup>6</sup>-1=63.</td>
+     </tr>
+     <tr>
+       <td>20-25</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the second
+         operand. Maximum slot number is 2<sup>6</sup>-1=63.</td>
+     </tr>
+     <tr>
+       <td>26-31</td>
+       <td><a href="#unsigned">unsigned</a></td>
+       <td class="td_left">Specifies the slot number of the value for the third
+         operand. Maximum slot number is 2<sup>6</sup>-1=63.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="symtab">Symbol Table</a> </div>
+ <div class="doc_text">
+ <p>A symbol table can be put out in conjunction with a module or a function. A
+ symbol table has a list of name/type associations followed by a list of
+ name/value associations. The name/value associations are organized into "type
+ planes" so that all values of a common type are listed together.  Each type 
+ plane starts with the number of entries in the plane and the type slot number
+ for all the values in that plane (so the type can be looked up in the global 
+ type pool). For each entry in a type plane, the slot number of the value and 
+ the name associated with that value are written. The format is given in the 
+ table below. </p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#block">block</a><br>
+       </td>
+       <td class="td_left">Symbol Table Identifier (0x04)</td>
+     </tr>
+     <tr>
+       <td><a href="#llist">llist</a>(<a href="#symtab_entry">type_entry</a>)</td>
+       <td class="td_left">A length list of symbol table entries for
+         <tt>Type</tt>s
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#zlist">llist</a>(<a href="#symtab_plane">symtab_plane</a>)</td>
+       <td class="td_left">A length list of "type planes" of symbol table
+         entries for <tt>Value</tt>s</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="type_entry">Symbol Table Type
+ Entry</a>
+ </div>
+ <div class="doc_text">
+ <p>A symbol table type entry associates a name with a type. The name is provided
+ simply as an array of chars. The type is provided as a type slot number (index)
+ into the global type pool. The format is given in the following table:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint24_vbr</a></td>
+       <td class="td_left">Type slot number of the type being given a
+         name relative to the global type pool.
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Length of the character array that follows.</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a>+</td>
+       <td class="td_left">The characters of the name.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="symtab_plane">Symbol Table
+ Plane</a>
+ </div>
+ <div class="doc_text">
+ <p>A symbol table plane provides the symbol table entries for all
+ values of a common type. The encoding is given in the following table:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Number of entries in this plane.</td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Type slot number of type for all values in this plane..</td>
+     </tr>
+     <tr>
+       <td><a href="#value_entry">value_entry</a>+</td>
+       <td class="td_left">The symbol table entries for to associate values with
+         names.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="value_entry">Symbol Table Value
+ Entry</a>
+ </div>
+ <div class="doc_text">
+ <p>A symbol table value entry provides the assocation between a value and the
+ name given to the value. The value is referenced by its slot number. The
+ format is given in the following table:</p>
+ <table>
+   <tbody>
+     <tr>
+       <th><b>Type</b></th>
+       <th class="td_left"><b>Field Description</b></th>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint24_vbr</a></td>
+       <td class="td_left">Value slot number of the value being given a name.
+       </td>
+     </tr>
+     <tr>
+       <td><a href="#uint32_vbr">uint32_vbr</a></td>
+       <td class="td_left">Length of the character array that follows.</td>
+     </tr>
+     <tr>
+       <td><a href="#char">char</a>+</td>
+       <td class="td_left">The characters of the name.</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="versiondiffs">Version Differences</a>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+ <p>This section describes the differences in the Bytecode Format across
+ LLVM
+ versions. The versions are listed in reverse order because it assumes
+ the current version is as documented in the previous sections. Each
+ section here
+ describes the differences between that version and the one that <i>follows</i>.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="vers13">Version 1.3 Differences From 
+     1.4</a></div>
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_subsubsection">Unreachable Instruction</div>
+ <div class="doc_text">
+   <p>The LLVM <a href="LangRef.html#i_unreachable">Unreachable</a> instruction
+   was added in version 1.4 of LLVM.  This caused all instruction numbers after
+   it to shift down by one.</p>
+ </div>
+ 
+ <div class="doc_subsubsection">Function Flags</div>
+ <div class="doc_text">
+   <p>LLVM bytecode versions prior to 1.4 did not include the 5 bit offset 
+      in <a href="#funcfield">the function list</a> in the <a
+      href="#globalinfo">Module Global Info</a> block.</p>
+ </div>
+ 
+ <div class="doc_subsubsection">Function Flags</div>
+ <div class="doc_text">
+   <p>LLVM bytecode versions prior to 1.4 did not include the 'undef' constant
+      value, which affects the encoding of <a href="#constant">Constant
+      Fields</a>.</p>
+ </div>
+ 
+ <!--
+ <div class="doc_subsubsection">Aligned Data</div>
+ <div class="doc_text">
+   <p>In version 1.3, certain data items were aligned to 32-bit boundaries. In
+   version 1.4, alignment of data was done away with completely. The need for
+   alignment has gone away and the only thing it adds is bytecode file size
+   overhead. In most cases this overhead was small. However, in functions with
+   large numbers of format 0 instructions (GEPs and PHIs with lots of parameters)
+   or regular instructions with large valued operands (e.g. because there's just
+   a lot of instructions in the function) the overhead can be extreme. In one
+   test case, the overhead was 44,000 bytes (34% of the total file size).
+   Consequently in release 1.4, the decision was made to eliminate alignment
+   altogether.</p>
+   <p>In version 1.3 format, the following bytecode constructs were aligned (i.e.
+   they were followed by one to three bytes of padding):</p>
+   <ul>
+     <li>All blocks.</li>
+     <li>Instructions using the long format (format 0).</li>
+     <li>All call instructions that called a var args function.</li>
+     <li>The target triple (a string field at the end of the module block).</li>
+     <li>The version field (immediately following the signature).</li>
+   </ul>
+   <p>None of these constructs are aligned in version 1.4</p>
+ </div>
+ -->
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="vers12">Version 1.2 Differences
+ From 1.3</a></div>
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_subsubsection">Type Derives From Value</div>
+ <div class="doc_text">
+ <p>In version 1.2, the Type class in the LLVM IR derives from the Value
+ class. This is not the case in version 1.3. Consequently, in version
+ 1.2 the notion of a "Type Type" was used to write out values that were
+ Types. The types always occuped plane 12 (corresponding to the
+ TypeTyID) of any type planed set of values. In 1.3 this representation
+ is not convenient because the TypeTyID (12) is not present and its
+ value is now used for LabelTyID. Consequently, the data structures
+ written that involve types do so by writing all the types first and
+ then each of the value planes according to those types. In version 1.2,
+ the types would have been written intermingled with the values.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Restricted getelementptr Types</div>
+ <div class="doc_text">
+ <p>In version 1.2, the getelementptr instruction required a ubyte type
+ index for accessing a structure field and a long type index for
+ accessing an array element. Consequently, it was only possible to
+ access structures of 255 or fewer elements. Starting in version 1.3,
+ this restriction was lifted. Structures must now be indexed with uint
+ constants. Arrays may now be indexed with int, uint, long, or ulong
+ typed values. The consequence of this was that the bytecode format had
+ to change in order to accommodate the larger range of structure indices.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Short Block Headers</div>
+ <div class="doc_text">
+ <p>In version 1.2, block headers were always 8 bytes being comprised of
+ both an unsigned integer type and an unsigned integer size. For very
+ small modules, these block headers turn out to be a large fraction of
+ the total bytecode file size. In an attempt to make these small files
+ smaller, the type and size information was encoded into a single
+ unsigned integer (4 bytes) comprised of 5 bits for the block type
+ (maximum 31 block types) and 27 bits for the block size (max
+ ~134MBytes). These limits seemed sufficient for any blocks or sizes
+ forseen in the future. Note that the module block, which encloses all
+ the other blocks is still written as 8 bytes since bytecode files
+ larger than 134MBytes might be possible.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Dependent Libraries and Target Triples</div>
+ <div class="doc_text">
+ <p>In version 1.2, the bytecode format does not store module's target
+ triple or dependent. These fields have been added to the end of the <a
+  href="#globalinfo">module global info block</a>. The purpose of these
+ fields is to allow a front end compiler to specifiy that the generated
+ module is specific to a particular target triple (operating
+ system/manufacturer/processor) which makes it non-portable; and to
+ allow front end compilers to specify the list of libraries that the
+ module depends on for successful linking.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Types Restricted to 24-bits</div>
+ <div class="doc_text">
+ <p>In version 1.2, type slot identifiers were written as 32-bit VBR
+ quantities. In 1.3 this has been reduced to 24-bits in order to ensure
+ that it is not possible to overflow the type field of a global variable
+ definition. 24-bits for type slot numbers is deemed sufficient for any
+ practical use of LLVM.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="vers11">Version 1.1 Differences
+ From 1.2 </a></div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Explicit Primitive Zeros</div>
+ <div class="doc_text">
+ <p>In version 1.1, the zero value for primitives was explicitly encoded
+ into the bytecode format. Since these zero values are constant values
+ in the LLVM IR and never change, there is no reason to explicitly
+ encode them. This explicit encoding was removed in version 1.2.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Inconsistent Module Global Info</div>
+ <div class="doc_text">
+ <p>In version 1.1, the Module Global Info block was not aligned causing
+ the next block to be read in on an unaligned boundary. This problem was
+ corrected in version 1.2.<br>
+ <br>
+ </p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="vers10">Version 1.0 Differences
+ From 1.1</a></div>
+ <div class="doc_text">
+ <p>None. Version 1.0 and 1.1 bytecode formats are identical.</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <hr>
+ <address> <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+  src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+ <a href="http://validator.w3.org/check/referer"><img
+  src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ <a href="mailto:rspencer at x10sys.com">Reid Spencer</a> and <a
+  href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+ <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+ Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CFEBuildInstrs.html
diff -c /dev/null llvm-www/releases/1.8/docs/CFEBuildInstrs.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CFEBuildInstrs.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,364 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+   <link rel="stylesheet" href="llvm.css" type="text/css" media="screen">
+   <title>Bootstrapping the LLVM C/C++ Front-End</title>
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   Bootstrapping the LLVM C/C++ Front-End
+ </div>
+ 
+ <ol>
+   <li><a href="#cautionarynote">A Cautionary Note</a>
+     <ul>
+       <li><a href="#cygwin">Building under Cygwin</a></li>
+       <li><a href="#aix">Building under AIX</a></li>
+     </ul>
+   </li>
+   <li><a href="#instructions">llvm-gcc4 Instructions</a></li>
+   <li><a href="#llvm-gcc3-instructions">llvm-gcc3 Instructions</a></li>
+   <li><a href="#license">License Information</a></li>
+ </ol>
+ 
+ <div class="doc_author">    
+   <p>Written by Brian R. Gaeke and 
+      <a href="http://nondot.org/sabre">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="cautionarynote">A Cautionary Note</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>This document is intended to explain the process of building the
+ LLVM C/C++ front-end from its source code. You have to do this, for example, if
+ you are porting LLVM to a new architecture or operating system, if you are
+ working from Top-Of-Tree CVS/SVN, or if there is no precompiled snapshot
+ available.</p>
+ 
+ <p><b>NOTE:</b> This is currently a somewhat fragile, error-prone
+ process, and you should <b>only</b> try to do it if:</p>
+ 
+ <ol>
+   <li>you really, really, really can't use the binaries we distribute</li>
+   <li>you are an elite GCC hacker.</li>
+   <li>you want to use the latest bits from CVS.</li>
+ </ol>
+ 
+ <p>We welcome patches to help make this process simpler.</p>
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_subsection">
+   <a name="cygwin">Building under Cygwin</a>
+ </div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ <p>If you are building LLVM and the GCC front-end under Cygwin, please note that
+ the LLVM and GCC makefiles do not correctly handle spaces in paths.  To deal
+ with this issue, make sure that your LLVM and GCC source and build trees are 
+ located in a top-level directory (like <tt>/cygdrive/c/llvm</tt> and 
+ <tt>/cygdrive/c/llvm-cfrontend</tt>), not in a directory that contains a space
+ (which includes your "home directory", because it lives under the "Documents 
+ and Settings" directory).  We welcome patches to fix this issue.
+ </p>
+ <p>It has been found that the GCC 3.3.3 compiler provided with recent Cygwin
+ versions is incapable of compiling the LLVM GCC front-end correctly. If your
+ Cygwin
+ installation includes GCC 3.3.3, we <i>strongly</i> recommend that you download
+ GCC 3.4.3, build it separately, and use it for compiling the LLVM GCC front-end.
+  This has been
+ shown to work correctly.</p>
+ <p>Some versions of Cygwin utilize an experimental version of GNU binutils that
+ will cause the GNU <tt>ld</tt> linker to fail an assertion when linking
+ components of the libstdc++. It is recommended that you replace the entire
+ binutils package with version 2.15 such that "<tt>ld --version</tt>" responds
+ with</p>
+ <pre>GNU ld version 2.15</pre>
+ not with:<br/>
+ <pre>GNU ld version 2.15.91 20040725</pre>
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_subsection"><a name="aix">Building under AIX</a></div>
+ <div class="doc_text">
+ <p>If you are building LLVM and the GCC front-end under AIX, do NOT use GNU
+ Binutils.  They are not stable under AIX and may produce incorrect and/or
+ invalid code.  Instead, use the system assembler and linker.
+ </p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="instructions">llvm-gcc4 Instructions</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section describes how to aquire and build llvm-gcc4, which is based on
+ the GCC 4.0.1 front-end.  This front-end supports C, C++, Objective-C, and 
+ Objective-C++.  Note that the instructions for building this front-end are
+ completely different than those for building llvm-gcc3.
+ </p>
+ 
+ <ol>
+ <li>
+ <p>Retrieve the appropriate llvm-gcc4-x.y.source.tar.gz archive from the llvm
+ web site.</p>
+ <p>It is also possible to download the sources of the llvm-gcc4 front end from
+ a read-only mirror using subversion.  To check out the code the first time use:
+ </p>
+ 
+ <tt>svn co svn://anonsvn.opensource.apple.com/svn/llvm/trunk
+ <i>dst-directory</i></tt>
+ 
+ <p>After that, the code can be be updated in the destination directory using;
+ </p>
+ 
+ <tt>svn update</tt>
+ 
+ <p>The mirror is brought up to date every evening.</p>
+ </li>
+ 
+ <li>Follow the directions in the top-level README.LLVM file for up-to-date
+     instructions on how to build llvm-gcc4.</li>
+ </ol>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="llvm-gcc3-instructions">llvm-gcc3 Instructions</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <ol>
+ <li>Aquire llvm-gcc3 from <a href="GettingStarted.html#checkout">LLVM CVS</a> or
+ from a <a href="http://llvm.org/releases/">release tarball</a>.</li>
+ 
+ <li><p>Configure and build the LLVM libraries and tools. There are two ways to
+ do this: either with <i>objdir</i> == <i>srcdir</i> or
+ <i>objdir</i> != <i>srcdir</i>. It is recommended 
+ that <i>srcdir</i> be the same as <i>objdir</i> for your LLVM tree (but note
+ that you should always use <i>srcdir</i> != <i>objdir</i> for llvm-gcc):</p>
+ <ul>
+   <li>With <i>objdir</i> != <i>srcdir</i>:<pre>
+  % cd <i>objdir</i>
+  % <i>srcdir</i>/configure --prefix=/some/path/you/can/install/to [options...]
+  % gmake tools-only
+   </pre></li>
+   <li>With <i>objdir</i> == <i>srcdir</i>:<pre>
+  % cd llvm
+  % ./configure --prefix=/some/path/you/can/install/to [options...]
+  % gmake tools-only
+   </pre></li>
+ </ul>
+ <p>This will build all of the LLVM tools and libraries. The <tt>--prefix</tt> 
+ option defaults to /usr/local (per configure standards) but unless you are a 
+ system administrator, you probably won't be able to install LLVM there because
+ of permissions. Specify a path into which LLVM can be installed (e.g.
+ <tt>--prefix=/home/user/llvm</tt>).</p>
+ </li>
+ 
+ <li><p>Add the directory containing the tools to your PATH.</p>
+ <pre>
+  % set path = ( `cd llvm/Debug/bin && pwd` $path )
+ </pre></li>
+ 
+ <li><p>Unpack the C/C++ front-end source into cfrontend/src, either by
+        untar'ing a cfrontend.source.tar.gz file or checking out CVS into this
+        directory.</p></li>
+ 
+ <li><p>Make "build" and "install" directories as siblings of the "src" tree:</p>
+ <pre>
+  % pwd
+  /usr/local/example/cfrontend/src
+  % cd ..
+  % mkdir build install
+  % set CFEINSTALL = `pwd`/install
+ </pre></li>
+ 
+ 
+ <li><p>Configure, build, and install the GCC front-end:</p>
+ 
+ <p>
+ <b>Linux/x86:</b><br>
+ <b>Linux/IA-64:</b><br>
+ <b>MacOS X/PowerPC</b> (requires dlcompat library):<br>
+ <b>AIX/PowerPC:</b>
+ </p>
+ 
+ <pre>
+  % cd build
+  % ../src/configure --prefix=$CFEINSTALL --disable-threads --disable-nls \
+    --disable-shared --enable-languages=c,c++ --program-prefix=llvm-
+  % gmake all; gmake install
+ </pre>
+ 
+ <p><b>Cygwin/x86:</b></p>
+ 
+ <pre>
+  % cd build
+  % ../src/configure --prefix=$CFEINSTALL --disable-threads --disable-nls \
+    --disable-shared --enable-languages=c,c++ --disable-c-mbchar \
+    --program-prefix=llvm-
+  % gmake all; gmake install
+ </pre>
+ 
+ <p><b>Solaris/SPARC:</b></p>
+ 
+ <p>
+ The GCC front-end can be configured for either SPARC V8 (32 bit) or SPARC V9 (64
+ bit).  This changes, among other things, the sizes of integer types and the
+ macros defined for conditional compilation.
+ </p>
+ 
+ <p>
+ The SPARC V8 ABI support is more robust than the V9 ABI support and can generate
+ SPARC V9 code.  It is highly recommended that you use the V8 ABI with LLVM, as
+ shown below.  Also,
+ note that Solaris has trouble with various wide (multibyte) character
+ functions from C as referenced from C++, so we typically configure with
+ --disable-c-mbchar (cf. <a href="http://llvm.org/PR206">Bug 206</a>).
+ </p>
+ 
+ <pre>
+  % cd build
+  % ../src/configure --prefix=$CFEINSTALL --disable-threads --disable-nls \
+    --disable-shared --enable-languages=c,c++ --host=sparc-sun-solaris2.8 \
+    --disable-c-mbchar --program-prefix=llvm-
+  % gmake all; gmake install
+ </pre>
+ 
+  <p><b>Common Problem:</b> You may get error messages regarding the fact
+  that LLVM does not support inline assembly. Here are two common
+  fixes:</p>
+ 
+  <ul>
+   <li><p><b>Fix 1:</b> If you have system header files that include
+    inline assembly, you may have to modify them to remove the inline
+    assembly and install the modified versions in
+    <code>$CFEINSTALL/lib/gcc/<i>target-triplet</i>/3.4-llvm/include</code>.</li>
+ 
+   <li><b>Fix 2:</b> If you are building the C++ front-end on a CPU we
+    haven't tried yet, you will probably have to edit the appropriate
+    version of atomicity.h under
+    <code>src/libstdc++-v3/config/cpu/<i>name-of-cpu</i>/atomicity.h</code>
+    and apply a patch so that it does not use inline assembly.</li>
+  </ul>
+ 
+  <p><b>Porting to a new architecture:</b> If you are porting the front-end
+  to a new architecture or compiling in a configuration that we have
+  not tried previously, there are probably several changes you will have to make
+  to the GCC target to get it to work correctly.  These include:<p>
+ 
+  <ul>
+   <li>Often targets include special assembler or linker flags which
+       <tt>gccas</tt>/<tt>gccld</tt> does not understand.  In general, these can
+       just be removed.</li>
+   <li>LLVM currently does not support any floating point values other than 
+       32-bit and 64-bit IEEE floating point.  The primary effect of this is
+       that you may have to map "long double" onto "double".</li>
+   <li>The profiling hooks in GCC do not apply at all to the LLVM front-end.
+       These may need to be disabled.</li>
+   <li>No inline assembly for position independent code.  At the LLVM level,
+       everything is position independent.</li>
+   <li>We handle <tt>.init</tt> and <tt>.fini</tt> differently.</li>
+   <li>You may have to disable multilib support in your target.  Using multilib
+       support causes the GCC compiler driver to add a lot of "<tt>-L</tt>"
+       options to the link line, which do not relate to LLVM and confuse
+       <tt>gccld</tt>.  To disable multilibs, delete any
+       <tt>MULTILIB_OPTIONS</tt> lines from your target files.</li>
+   <li>Did we mention that we don't support inline assembly?  You'll probably
+       have to add some fixinclude hacks to disable it in the system
+       headers.</li>
+  </ul>
+ </li>
+ 
+ <li><p>Put <tt>$CFEINSTALL/bin</tt> into your <tt>PATH</tt> environment
+ variable.</p>
+   <ul>
+     <li>sh: <tt>export PATH=$CFEINSTALL/bin:$PATH</tt></li>
+     <li>csh: <tt>setenv PATH $CFEINSTALL/bin:$PATH</tt></li>
+   </ul>
+ </li>
+ 
+ <li><p>Go back into the LLVM source tree proper.  Rerun configure, using
+ the same options as the last time. This will cause the configuration to now find
+ the newly built llvm-gcc and llvm-g++ executables. </p></li>
+ 
+ <li><p>Rebuild your CVS tree.  This shouldn't cause the whole thing to be
+   rebuilt, but it should build the runtime libraries.  After the tree is
+   built, install the runtime libraries into your GCC front-end build tree.
+   These are the commands you need:</p>
+ <pre>
+  % gmake
+  % gmake -C runtime install-bytecode
+ </pre></li>
+ 
+ <li><p>Optionally, build a symbol table for the newly installed runtime 
+ libraries. Although this step is optional, you are strongly encouraged to 
+ do this as the symbol tables will make a significant difference in your 
+ link times. Use the <tt>llvm-ranlib</tt> tool to do this, as follows:</p>
+ <pre>
+  % cd $CFEINSTALL/lib
+  % llvm-ranlib libiberty.a
+  % llvm-ranlib libstdc++.a
+  % llvm-ranlib libsupc++.a
+  % cd $CFEINSTALL/lib/gcc/<i>target-triplet</i>/3.4-llvm
+  % llvm-ranlib libgcc.a
+  % llvm-ranlib libgcov.a
+ </pre>
+ 
+ <li><p>Test the newly-installed C frontend by one or more of the
+ following means:</p>
+  <ul>
+   <li> running the feature & regression tests via <tt>make check</tt></li>
+   <li> compiling and running a "hello, LLVM" program in C and C++.</li>
+   <li> running the tests found in the <tt>llvm-test</tt> CVS module</li>
+  </ul></li>
+ </ol>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="license">License Information</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The LLVM GCC frontend is licensed to you under the GNU General Public License
+ and the GNU Lesser General Public License.  Please see the files COPYING and
+ COPYING.LIB for more details.
+ </p>
+ 
+ <p>
+ More information is <a href="FAQ.html#license">available in the FAQ</a>.
+ </p>
+ </pre>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   Brian Gaeke<br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CodeGenerator.html
diff -c /dev/null llvm-www/releases/1.8/docs/CodeGenerator.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CodeGenerator.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1293 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>The LLVM Target-Independent Code Generator</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   The LLVM Target-Independent Code Generator
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a>
+     <ul>
+       <li><a href="#required">Required components in the code generator</a></li>
+       <li><a href="#high-level-design">The high-level design of the code
+           generator</a></li>
+       <li><a href="#tablegen">Using TableGen for target description</a></li>
+     </ul>
+   </li>
+   <li><a href="#targetdesc">Target description classes</a>
+     <ul>
+       <li><a href="#targetmachine">The <tt>TargetMachine</tt> class</a></li>
+       <li><a href="#targetdata">The <tt>TargetData</tt> class</a></li>
+       <li><a href="#targetlowering">The <tt>TargetLowering</tt> class</a></li>
+       <li><a href="#mregisterinfo">The <tt>MRegisterInfo</tt> class</a></li>
+       <li><a href="#targetinstrinfo">The <tt>TargetInstrInfo</tt> class</a></li>
+       <li><a href="#targetframeinfo">The <tt>TargetFrameInfo</tt> class</a></li>
+       <li><a href="#targetsubtarget">The <tt>TargetSubtarget</tt> class</a></li>
+       <li><a href="#targetjitinfo">The <tt>TargetJITInfo</tt> class</a></li>
+     </ul>
+   </li>
+   <li><a href="#codegendesc">Machine code description classes</a>
+     <ul>
+     <li><a href="#machineinstr">The <tt>MachineInstr</tt> class</a></li>
+     <li><a href="#machinebasicblock">The <tt>MachineBasicBlock</tt>
+                                      class</a></li>
+     <li><a href="#machinefunction">The <tt>MachineFunction</tt> class</a></li>
+     </ul>
+   </li>
+   <li><a href="#codegenalgs">Target-independent code generation algorithms</a>
+     <ul>
+     <li><a href="#instselect">Instruction Selection</a>
+       <ul>
+       <li><a href="#selectiondag_intro">Introduction to SelectionDAGs</a></li>
+       <li><a href="#selectiondag_process">SelectionDAG Code Generation
+                                           Process</a></li>
+       <li><a href="#selectiondag_build">Initial SelectionDAG
+                                         Construction</a></li>
+       <li><a href="#selectiondag_legalize">SelectionDAG Legalize Phase</a></li>
+       <li><a href="#selectiondag_optimize">SelectionDAG Optimization
+                                            Phase: the DAG Combiner</a></li>
+       <li><a href="#selectiondag_select">SelectionDAG Select Phase</a></li>
+       <li><a href="#selectiondag_sched">SelectionDAG Scheduling and Formation
+                                         Phase</a></li>
+       <li><a href="#selectiondag_future">Future directions for the
+                                          SelectionDAG</a></li>
+       </ul></li>
+     <li><a href="#codeemit">Code Emission</a>
+         <ul>
+         <li><a href="#codeemit_asm">Generating Assembly Code</a></li>
+         <li><a href="#codeemit_bin">Generating Binary Machine Code</a></li>
+         </ul></li>
+     </ul>
+   </li>
+   <li><a href="#targetimpls">Target-specific Implementation Notes</a>
+     <ul>
+     <li><a href="#x86">The X86 backend</a></li>
+     </ul>
+   </li>
+ 
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <div class="doc_warning">
+   <p>Warning: This is a work in progress.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM target-independent code generator is a framework that provides a
+ suite of reusable components for translating the LLVM internal representation to
+ the machine code for a specified target -- either in assembly form (suitable for
+ a static compiler) or in binary machine code format (usable for a JIT compiler).
+ The LLVM target-independent code generator consists of five main components:</p>
+ 
+ <ol>
+ <li><a href="#targetdesc">Abstract target description</a> interfaces which
+ capture important properties about various aspects of the machine, independently
+ of how they will be used.  These interfaces are defined in
+ <tt>include/llvm/Target/</tt>.</li>
+ 
+ <li>Classes used to represent the <a href="#codegendesc">machine code</a> being
+ generated for a target.  These classes are intended to be abstract enough to
+ represent the machine code for <i>any</i> target machine.  These classes are
+ defined in <tt>include/llvm/CodeGen/</tt>.</li>
+ 
+ <li><a href="#codegenalgs">Target-independent algorithms</a> used to implement
+ various phases of native code generation (register allocation, scheduling, stack
+ frame representation, etc).  This code lives in <tt>lib/CodeGen/</tt>.</li>
+ 
+ <li><a href="#targetimpls">Implementations of the abstract target description
+ interfaces</a> for particular targets.  These machine descriptions make use of
+ the components provided by LLVM, and can optionally provide custom
+ target-specific passes, to build complete code generators for a specific target.
+ Target descriptions live in <tt>lib/Target/</tt>.</li>
+ 
+ <li><a href="#jit">The target-independent JIT components</a>.  The LLVM JIT is
+ completely target independent (it uses the <tt>TargetJITInfo</tt> structure to
+ interface for target-specific issues.  The code for the target-independent
+ JIT lives in <tt>lib/ExecutionEngine/JIT</tt>.</li>
+ 
+ </ol>
+ 
+ <p>
+ Depending on which part of the code generator you are interested in working on,
+ different pieces of this will be useful to you.  In any case, you should be
+ familiar with the <a href="#targetdesc">target description</a> and <a
+ href="#codegendesc">machine code representation</a> classes.  If you want to add
+ a backend for a new target, you will need to <a href="#targetimpls">implement the
+ target description</a> classes for your new target and understand the <a
+ href="LangRef.html">LLVM code representation</a>.  If you are interested in
+ implementing a new <a href="#codegenalgs">code generation algorithm</a>, it
+ should only depend on the target-description and machine code representation
+ classes, ensuring that it is portable.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+  <a name="required">Required components in the code generator</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The two pieces of the LLVM code generator are the high-level interface to the
+ code generator and the set of reusable components that can be used to build
+ target-specific backends.  The two most important interfaces (<a
+ href="#targetmachine"><tt>TargetMachine</tt></a> and <a
+ href="#targetdata"><tt>TargetData</tt></a>) are the only ones that are
+ required to be defined for a backend to fit into the LLVM system, but the others
+ must be defined if the reusable code generator components are going to be
+ used.</p>
+ 
+ <p>This design has two important implications.  The first is that LLVM can
+ support completely non-traditional code generation targets.  For example, the C
+ backend does not require register allocation, instruction selection, or any of
+ the other standard components provided by the system.  As such, it only
+ implements these two interfaces, and does its own thing.  Another example of a
+ code generator like this is a (purely hypothetical) backend that converts LLVM
+ to the GCC RTL form and uses GCC to emit machine code for a target.</p>
+ 
+ <p>This design also implies that it is possible to design and
+ implement radically different code generators in the LLVM system that do not
+ make use of any of the built-in components.  Doing so is not recommended at all,
+ but could be required for radically different targets that do not fit into the
+ LLVM machine description model: programmable FPGAs for example.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+  <a name="high-level-design">The high-level design of the code generator</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM target-independent code generator is designed to support efficient and
+ quality code generation for standard register-based microprocessors.  Code
+ generation in this model is divided into the following stages:</p>
+ 
+ <ol>
+ <li><b><a href="#instselect">Instruction Selection</a></b> - This phase
+ determines an efficient way to express the input LLVM code in the target
+ instruction set.
+ This stage produces the initial code for the program in the target instruction
+ set, then makes use of virtual registers in SSA form and physical registers that
+ represent any required register assignments due to target constraints or calling
+ conventions.  This step turns the LLVM code into a DAG of target
+ instructions.</li>
+ 
+ <li><b><a href="#selectiondag_sched">Scheduling and Formation</a></b> - This
+ phase takes the DAG of target instructions produced by the instruction selection
+ phase, determines an ordering of the instructions, then emits the instructions
+ as <tt><a href="#machineinstr">MachineInstr</a></tt>s with that ordering.  Note
+ that we describe this in the <a href="#instselect">instruction selection
+ section</a> because it operates on a <a
+ href="#selectiondag_intro">SelectionDAG</a>.
+ </li>
+ 
+ <li><b><a href="#ssamco">SSA-based Machine Code Optimizations</a></b> - This 
+ optional stage consists of a series of machine-code optimizations that 
+ operate on the SSA-form produced by the instruction selector.  Optimizations 
+ like modulo-scheduling or peephole optimization work here.
+ </li>
+ 
+ <li><b><a href="#regalloc">Register Allocation</a></b> - The
+ target code is transformed from an infinite virtual register file in SSA form 
+ to the concrete register file used by the target.  This phase introduces spill 
+ code and eliminates all virtual register references from the program.</li>
+ 
+ <li><b><a href="#proepicode">Prolog/Epilog Code Insertion</a></b> - Once the 
+ machine code has been generated for the function and the amount of stack space 
+ required is known (used for LLVM alloca's and spill slots), the prolog and 
+ epilog code for the function can be inserted and "abstract stack location 
+ references" can be eliminated.  This stage is responsible for implementing 
+ optimizations like frame-pointer elimination and stack packing.</li>
+ 
+ <li><b><a href="#latemco">Late Machine Code Optimizations</a></b> - Optimizations
+ that operate on "final" machine code can go here, such as spill code scheduling
+ and peephole optimizations.</li>
+ 
+ <li><b><a href="#codeemit">Code Emission</a></b> - The final stage actually 
+ puts out the code for the current function, either in the target assembler 
+ format or in machine code.</li>
+ 
+ </ol>
+ 
+ <p>
+ The code generator is based on the assumption that the instruction selector will
+ use an optimal pattern matching selector to create high-quality sequences of
+ native instructions.  Alternative code generator designs based on pattern 
+ expansion and
+ aggressive iterative peephole optimization are much slower.  This design 
+ permits efficient compilation (important for JIT environments) and
+ aggressive optimization (used when generating code offline) by allowing 
+ components of varying levels of sophistication to be used for any step of 
+ compilation.</p>
+ 
+ <p>
+ In addition to these stages, target implementations can insert arbitrary
+ target-specific passes into the flow.  For example, the X86 target uses a
+ special pass to handle the 80x87 floating point stack architecture.  Other
+ targets with unusual requirements can be supported with custom passes as needed.
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+  <a name="tablegen">Using TableGen for target description</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The target description classes require a detailed description of the target
+ architecture.  These target descriptions often have a large amount of common
+ information (e.g., an <tt>add</tt> instruction is almost identical to a 
+ <tt>sub</tt> instruction).
+ In order to allow the maximum amount of commonality to be factored out, the LLVM
+ code generator uses the <a href="TableGenFundamentals.html">TableGen</a> tool to
+ describe big chunks of the target machine, which allows the use of
+ domain-specific and target-specific abstractions to reduce the amount of 
+ repetition.
+ </p>
+ 
+ <p>As LLVM continues to be developed and refined, we plan to move more and more
+ of the target description to be in <tt>.td</tt> form.  Doing so gives us a
+ number of advantages.  The most important is that it makes it easier to port
+ LLVM, because it reduces the amount of C++ code that has to be written and the
+ surface area of the code generator that needs to be understood before someone
+ can get in an get something working.  Second, it is also important to us because
+ it makes it easier to change things: in particular, if tables and other things
+ are all emitted by tblgen, we only need to change one place (tblgen) to update
+ all of the targets to a new interface.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="targetdesc">Target description classes</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM target description classes (which are located in the
+ <tt>include/llvm/Target</tt> directory) provide an abstract description of the
+ target machine; independent of any particular client.  These classes are
+ designed to capture the <i>abstract</i> properties of the target (such as the
+ instructions and registers it has), and do not incorporate any particular pieces
+ of code generation algorithms.</p>
+ 
+ <p>All of the target description classes (except the <tt><a
+ href="#targetdata">TargetData</a></tt> class) are designed to be subclassed by
+ the concrete target implementation, and have virtual methods implemented.  To
+ get to these implementations, the <tt><a
+ href="#targetmachine">TargetMachine</a></tt> class provides accessors that
+ should be implemented by the target.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetmachine">The <tt>TargetMachine</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>TargetMachine</tt> class provides virtual methods that are used to
+ access the target-specific implementations of the various target description
+ classes via the <tt>get*Info</tt> methods (<tt>getInstrInfo</tt>,
+ <tt>getRegisterInfo</tt>, <tt>getFrameInfo</tt>, etc.).  This class is 
+ designed to be specialized by
+ a concrete target implementation (e.g., <tt>X86TargetMachine</tt>) which
+ implements the various virtual methods.  The only required target description
+ class is the <a href="#targetdata"><tt>TargetData</tt></a> class, but if the
+ code generator components are to be used, the other interfaces should be
+ implemented as well.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetdata">The <tt>TargetData</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>TargetData</tt> class is the only required target description class,
+ and it is the only class that is not extensible (you cannot derived  a new 
+ class from it).  <tt>TargetData</tt> specifies information about how the target 
+ lays out memory for structures, the alignment requirements for various data 
+ types, the size of pointers in the target, and whether the target is 
+ little-endian or big-endian.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetlowering">The <tt>TargetLowering</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>TargetLowering</tt> class is used by SelectionDAG based instruction
+ selectors primarily to describe how LLVM code should be lowered to SelectionDAG
+ operations.  Among other things, this class indicates:
+ <ul><li>an initial register class to use for various ValueTypes</li>
+   <li>which operations are natively supported by the target machine</li>
+   <li>the return type of setcc operations</li>
+   <li>the type to use for shift amounts</li>
+   <li>various high-level characteristics, like whether it is profitable to turn
+       division by a constant into a multiplication sequence</li>
+ </ol></p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="mregisterinfo">The <tt>MRegisterInfo</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>MRegisterInfo</tt> class (which will eventually be renamed to
+ <tt>TargetRegisterInfo</tt>) is used to describe the register file of the
+ target and any interactions between the registers.</p>
+ 
+ <p>Registers in the code generator are represented in the code generator by
+ unsigned numbers.  Physical registers (those that actually exist in the target
+ description) are unique small numbers, and virtual registers are generally
+ large.  Note that register #0 is reserved as a flag value.</p>
+ 
+ <p>Each register in the processor description has an associated
+ <tt>TargetRegisterDesc</tt> entry, which provides a textual name for the register
+ (used for assembly output and debugging dumps) and a set of aliases (used to
+ indicate that one register overlaps with another).
+ </p>
+ 
+ <p>In addition to the per-register description, the <tt>MRegisterInfo</tt> class
+ exposes a set of processor specific register classes (instances of the
+ <tt>TargetRegisterClass</tt> class).  Each register class contains sets of
+ registers that have the same properties (for example, they are all 32-bit
+ integer registers).  Each SSA virtual register created by the instruction
+ selector has an associated register class.  When the register allocator runs, it
+ replaces virtual registers with a physical register in the set.</p>
+ 
+ <p>
+ The target-specific implementations of these classes is auto-generated from a <a
+ href="TableGenFundamentals.html">TableGen</a> description of the register file.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetinstrinfo">The <tt>TargetInstrInfo</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+   <p>The <tt>TargetInstrInfo</tt> class is used to describe the machine 
+   instructions supported by the target. It is essentially an array of 
+   <tt>TargetInstrDescriptor</tt> objects, each of which describes one
+   instruction the target supports. Descriptors define things like the mnemonic
+   for the opcode, the number of operands, the list of implicit register uses
+   and defs, whether the instruction has certain target-independent properties 
+   (accesses memory, is commutable, etc), and holds any target-specific flags.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetframeinfo">The <tt>TargetFrameInfo</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+   <p>The <tt>TargetFrameInfo</tt> class is used to provide information about the
+   stack frame layout of the target. It holds the direction of stack growth, 
+   the known stack alignment on entry to each function, and the offset to the 
+   locals area.  The offset to the local area is the offset from the stack 
+   pointer on function entry to the first location where function data (local 
+   variables, spill locations) can be stored.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetsubtarget">The <tt>TargetSubtarget</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+   <p>
+   <p>The <tt>TargetSubtarget</tt> class is used to provide information about the
+   specific chip set being targeted.  A sub-target informs code generation of 
+   which instructions are supported, instruction latencies and instruction 
+   execution itinerary; i.e., which processing units are used, in what order, and
+   for how long.
+   </p>
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="targetjitinfo">The <tt>TargetJITInfo</tt> class</a>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="codegendesc">Machine code description classes</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>
+ At the high-level, LLVM code is translated to a machine specific representation
+ formed out of <a href="#machinefunction">MachineFunction</a>,
+ <a href="#machinebasicblock">MachineBasicBlock</a>, and <a 
+ href="#machineinstr"><tt>MachineInstr</tt></a> instances
+ (defined in include/llvm/CodeGen).  This representation is completely target
+ agnostic, representing instructions in their most abstract form: an opcode and a
+ series of operands.  This representation is designed to support both SSA
+ representation for machine code, as well as a register allocated, non-SSA form.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="machineinstr">The <tt>MachineInstr</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Target machine instructions are represented as instances of the
+ <tt>MachineInstr</tt> class.  This class is an extremely abstract way of
+ representing machine instructions.  In particular, it only keeps track of 
+ an opcode number and a set of operands.</p>
+ 
+ <p>The opcode number is a simple unsigned number that only has meaning to a 
+ specific backend.  All of the instructions for a target should be defined in 
+ the <tt>*InstrInfo.td</tt> file for the target. The opcode enum values
+ are auto-generated from this description.  The <tt>MachineInstr</tt> class does
+ not have any information about how to interpret the instruction (i.e., what the 
+ semantics of the instruction are): for that you must refer to the 
+ <tt><a href="#targetinstrinfo">TargetInstrInfo</a></tt> class.</p> 
+ 
+ <p>The operands of a machine instruction can be of several different types:
+ they can be a register reference, constant integer, basic block reference, etc.
+ In addition, a machine operand should be marked as a def or a use of the value
+ (though only registers are allowed to be defs).</p>
+ 
+ <p>By convention, the LLVM code generator orders instruction operands so that
+ all register definitions come before the register uses, even on architectures
+ that are normally printed in other orders.  For example, the SPARC add 
+ instruction: "<tt>add %i1, %i2, %i3</tt>" adds the "%i1", and "%i2" registers
+ and stores the result into the "%i3" register.  In the LLVM code generator,
+ the operands should be stored as "<tt>%i3, %i1, %i2</tt>": with the destination
+ first.</p>
+ 
+ <p>Keeping destination (definition) operands at the beginning of the operand 
+ list has several advantages.  In particular, the debugging printer will print 
+ the instruction like this:</p>
+ 
+ <pre>
+   %r3 = add %i1, %i2
+ </pre>
+ 
+ <p>If the first operand is a def, and it is also easier to <a 
+ href="#buildmi">create instructions</a> whose only def is the first 
+ operand.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="buildmi">Using the <tt>MachineInstrBuilder.h</tt> functions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Machine instructions are created by using the <tt>BuildMI</tt> functions,
+ located in the <tt>include/llvm/CodeGen/MachineInstrBuilder.h</tt> file.  The
+ <tt>BuildMI</tt> functions make it easy to build arbitrary machine 
+ instructions.  Usage of the <tt>BuildMI</tt> functions look like this: 
+ </p>
+ 
+ <pre>
+   // Create a 'DestReg = mov 42' (rendered in X86 assembly as 'mov DestReg, 42')
+   // instruction.  The '1' specifies how many operands will be added.
+   MachineInstr *MI = BuildMI(X86::MOV32ri, 1, DestReg).addImm(42);
+ 
+   // Create the same instr, but insert it at the end of a basic block.
+   MachineBasicBlock &MBB = ...
+   BuildMI(MBB, X86::MOV32ri, 1, DestReg).addImm(42);
+ 
+   // Create the same instr, but insert it before a specified iterator point.
+   MachineBasicBlock::iterator MBBI = ...
+   BuildMI(MBB, MBBI, X86::MOV32ri, 1, DestReg).addImm(42);
+ 
+   // Create a 'cmp Reg, 0' instruction, no destination reg.
+   MI = BuildMI(X86::CMP32ri, 2).addReg(Reg).addImm(0);
+   // Create an 'sahf' instruction which takes no operands and stores nothing.
+   MI = BuildMI(X86::SAHF, 0);
+ 
+   // Create a self looping branch instruction.
+   BuildMI(MBB, X86::JNE, 1).addMBB(&MBB);
+ </pre>
+ 
+ <p>
+ The key thing to remember with the <tt>BuildMI</tt> functions is that you have
+ to specify the number of operands that the machine instruction will take. This
+ allows for efficient memory allocation.  You also need to specify if operands 
+ default to be uses of values, not definitions.  If you need to add a definition
+ operand (other than the optional destination register), you must explicitly 
+ mark it as such.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="fixedregs">Fixed (preassigned) registers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>One important issue that the code generator needs to be aware of is the
+ presence of fixed registers.  In particular, there are often places in the 
+ instruction stream where the register allocator <em>must</em> arrange for a
+ particular value to be in a particular register.  This can occur due to 
+ limitations of the instruction set (e.g., the X86 can only do a 32-bit divide 
+ with the <tt>EAX</tt>/<tt>EDX</tt> registers), or external factors like calling
+ conventions.  In any case, the instruction selector should emit code that 
+ copies a virtual register into or out of a physical register when needed.</p>
+ 
+ <p>For example, consider this simple LLVM example:</p>
+ 
+ <pre>
+   int %test(int %X, int %Y) {
+     %Z = div int %X, %Y
+     ret int %Z
+   }
+ </pre>
+ 
+ <p>The X86 instruction selector produces this machine code for the div 
+ and ret (use 
+ "<tt>llc X.bc -march=x86 -print-machineinstrs</tt>" to get this):</p>
+ 
+ <pre>
+         ;; Start of div
+         %EAX = mov %reg1024           ;; Copy X (in reg1024) into EAX
+         %reg1027 = sar %reg1024, 31
+         %EDX = mov %reg1027           ;; Sign extend X into EDX
+         idiv %reg1025                 ;; Divide by Y (in reg1025)
+         %reg1026 = mov %EAX           ;; Read the result (Z) out of EAX
+ 
+         ;; Start of ret
+         %EAX = mov %reg1026           ;; 32-bit return value goes in EAX
+         ret
+ </pre>
+ 
+ <p>By the end of code generation, the register allocator has coalesced
+ the registers and deleted the resultant identity moves, producing the
+ following code:</p>
+ 
+ <pre>
+         ;; X is in EAX, Y is in ECX
+         mov %EAX, %EDX
+         sar %EDX, 31
+         idiv %ECX
+         ret 
+ </pre>
+ 
+ <p>This approach is extremely general (if it can handle the X86 architecture, 
+ it can handle anything!) and allows all of the target specific
+ knowledge about the instruction stream to be isolated in the instruction 
+ selector.  Note that physical registers should have a short lifetime for good 
+ code generation, and all physical registers are assumed dead on entry and
+ exit of basic blocks (before register allocation).  Thus if you need a value
+ to be live across basic block boundaries, it <em>must</em> live in a virtual 
+ register.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ssa">Machine code SSA form</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>MachineInstr</tt>'s are initially selected in SSA-form, and
+ are maintained in SSA-form until register allocation happens.  For the most 
+ part, this is trivially simple since LLVM is already in SSA form: LLVM PHI nodes
+ become machine code PHI nodes, and virtual registers are only allowed to have a
+ single definition.</p>
+ 
+ <p>After register allocation, machine code is no longer in SSA-form, as there 
+ are no virtual registers left in the code.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="machinebasicblock">The <tt>MachineBasicBlock</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>MachineBasicBlock</tt> class contains a list of machine instructions
+ (<a href="#machineinstr">MachineInstr</a> instances).  It roughly corresponds to
+ the LLVM code input to the instruction selector, but there can be a one-to-many
+ mapping (i.e. one LLVM basic block can map to multiple machine basic blocks).
+ The MachineBasicBlock class has a "<tt>getBasicBlock</tt>" method, which returns
+ the LLVM basic block that it comes from.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="machinefunction">The <tt>MachineFunction</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>MachineFunction</tt> class contains a list of machine basic blocks
+ (<a href="#machinebasicblock">MachineBasicBlock</a> instances).  It corresponds
+ one-to-one with the LLVM function input to the instruction selector.  In
+ addition to a list of basic blocks, the <tt>MachineFunction</tt> contains a
+ the MachineConstantPool, MachineFrameInfo, MachineFunctionInfo,
+ SSARegMap, and a set of live in and live out registers for the function.  See
+ <tt>MachineFunction.h</tt> for more information.
+ </p>
+ 
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="codegenalgs">Target-independent code generation algorithms</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section documents the phases described in the <a
+ href="#high-level-design">high-level design of the code generator</a>.  It
+ explains how they work and some of the rationale behind their design.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="instselect">Instruction Selection</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ Instruction Selection is the process of translating LLVM code presented to the
+ code generator into target-specific machine instructions.  There are several
+ well-known ways to do this in the literature.  In LLVM there are two main forms:
+ the SelectionDAG based instruction selector framework and an old-style 'simple'
+ instruction selector (which effectively peephole selects each LLVM instruction
+ into a series of machine instructions).  We recommend that all targets use the
+ SelectionDAG infrastructure.
+ </p>
+ 
+ <p>Portions of the DAG instruction selector are generated from the target 
+ description files (<tt>*.td</tt>) files.  Eventually, we aim for the entire
+ instruction selector to be generated from these <tt>.td</tt> files.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_intro">Introduction to SelectionDAGs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The SelectionDAG provides an abstraction for code representation in a way that 
+ is amenable to instruction selection using automatic techniques
+ (e.g. dynamic-programming based optimal pattern matching selectors), It is also
+ well suited to other phases of code generation; in particular,
+ instruction scheduling (SelectionDAG's are very close to scheduling DAGs
+ post-selection).  Additionally, the SelectionDAG provides a host representation
+ where a large variety of very-low-level (but target-independent) 
+ <a href="#selectiondag_optimize">optimizations</a> may be
+ performed: ones which require extensive information about the instructions
+ efficiently supported by the target.
+ </p>
+ 
+ <p>
+ The SelectionDAG is a Directed-Acyclic-Graph whose nodes are instances of the
+ <tt>SDNode</tt> class.  The primary payload of the <tt>SDNode</tt> is its 
+ operation code (Opcode) that indicates what operation the node performs and
+ the operands to the operation.
+ The various operation node types are described at the top of the
+ <tt>include/llvm/CodeGen/SelectionDAGNodes.h</tt> file.</p>
+ 
+ <p>Although most operations define a single value, each node in the graph may 
+ define multiple values.  For example, a combined div/rem operation will define
+ both the dividend and the remainder. Many other situations require multiple
+ values as well.  Each node also has some number of operands, which are edges 
+ to the node defining the used value.  Because nodes may define multiple values,
+ edges are represented by instances of the <tt>SDOperand</tt> class, which is 
+ a <SDNode, unsigned> pair, indicating the node and result
+ value being used, respectively.  Each value produced by an SDNode has an 
+ associated MVT::ValueType, indicating what type the value is.
+ </p>
+ 
+ <p>
+ SelectionDAGs contain two different kinds of values: those that represent data
+ flow and those that represent control flow dependencies.  Data values are simple
+ edges with an integer or floating point value type.  Control edges are
+ represented as "chain" edges which are of type MVT::Other.  These edges provide
+ an ordering between nodes that have side effects (such as
+ loads/stores/calls/return/etc).  All nodes that have side effects should take a
+ token chain as input and produce a new one as output.  By convention, token
+ chain inputs are always operand #0, and chain results are always the last
+ value produced by an operation.</p>
+ 
+ <p>
+ A SelectionDAG has designated "Entry" and "Root" nodes.  The Entry node is
+ always a marker node with an Opcode of ISD::EntryToken.  The Root node is the
+ final side-effecting node in the token chain. For example, in a single basic
+ block function, this would be the return node.
+ </p>
+ 
+ <p>
+ One important concept for SelectionDAGs is the notion of a "legal" vs. "illegal"
+ DAG.  A legal DAG for a target is one that only uses supported operations and
+ supported types.  On a 32-bit PowerPC, for example, a DAG with any values of i1,
+ i8, i16,
+ or i64 type would be illegal, as would a DAG that uses a SREM or UREM operation.
+ The <a href="#selectiondag_legalize">legalize</a>
+ phase is responsible for turning an illegal DAG into a legal DAG.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_process">SelectionDAG Instruction Selection Process</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ SelectionDAG-based instruction selection consists of the following steps:
+ </p>
+ 
+ <ol>
+ <li><a href="#selectiondag_build">Build initial DAG</a> - This stage performs
+     a simple translation from the input LLVM code to an illegal SelectionDAG.
+     </li>
+ <li><a href="#selectiondag_optimize">Optimize SelectionDAG</a> - This stage
+     performs simple optimizations on the SelectionDAG to simplify it and
+     recognize meta instructions (like rotates and div/rem pairs) for
+     targets that support these meta operations.  This makes the resultant code
+     more efficient and the 'select instructions from DAG' phase (below) simpler.
+ </li>
+ <li><a href="#selectiondag_legalize">Legalize SelectionDAG</a> - This stage
+     converts the illegal SelectionDAG to a legal SelectionDAG, by eliminating
+     unsupported operations and data types.</li>
+ <li><a href="#selectiondag_optimize">Optimize SelectionDAG (#2)</a> - This
+     second run of the SelectionDAG optimized the newly legalized DAG, to
+     eliminate inefficiencies introduced by legalization.</li>
+ <li><a href="#selectiondag_select">Select instructions from DAG</a> - Finally,
+     the target instruction selector matches the DAG operations to target
+     instructions.  This process translates the target-independent input DAG into
+     another DAG of target instructions.</li>
+ <li><a href="#selectiondag_sched">SelectionDAG Scheduling and Formation</a>
+     - The last phase assigns a linear order to the instructions in the 
+     target-instruction DAG and emits them into the MachineFunction being
+     compiled.  This step uses traditional prepass scheduling techniques.</li>
+ </ol>
+ 
+ <p>After all of these steps are complete, the SelectionDAG is destroyed and the
+ rest of the code generation passes are run.</p>
+ 
+ <p>One great way to visualize what is going on here is to take advantage of a 
+ few LLC command line options.  In particular, the <tt>-view-isel-dags</tt>
+ option pops up a window with the SelectionDAG input to the Select phase for all
+ of the code compiled (if you only get errors printed to the console while using
+ this, you probably <a href="ProgrammersManual.html#ViewGraph">need to configure
+ your system</a> to add support for it).  The <tt>-view-sched-dags</tt> option
+ views the SelectionDAG output from the Select phase and input to the Scheduler
+ phase.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_build">Initial SelectionDAG Construction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The initial SelectionDAG is naively peephole expanded from the LLVM input by
+ the <tt>SelectionDAGLowering</tt> class in the SelectionDAGISel.cpp file.  The 
+ intent of  this pass is to expose as much low-level, target-specific details 
+ to the SelectionDAG as possible.  This pass is mostly hard-coded (e.g. an LLVM 
+ add turns into an SDNode add while a geteelementptr is expanded into the obvious
+ arithmetic). This pass requires target-specific hooks to lower calls and
+ returns, varargs, etc.  For these features, the <a 
+ href="#targetlowering">TargetLowering</a> interface is
+ used.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_legalize">SelectionDAG Legalize Phase</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The Legalize phase is in charge of converting a DAG to only use the types and
+ operations that are natively supported by the target.  This involves two major
+ tasks:</p>
+ 
+ <ol>
+ <li><p>Convert values of unsupported types to values of supported types.</p>
+     <p>There are two main ways of doing this: converting small types to 
+        larger types ("promoting"), and breaking up large integer types
+        into smaller ones ("expanding").  For example, a target might require
+        that all f32 values are promoted to f64 and that all i1/i8/i16 values
+        are promoted to i32.  The same target might require that all i64 values
+        be expanded into i32 values.  These changes can insert sign and zero
+        extensions as 
+        needed to make sure that the final code has the same behavior as the 
+        input.</p>
+     <p>A target implementation tells the legalizer which types are supported
+        (and which register class to use for them) by calling the
+        "addRegisterClass" method in its TargetLowering constructor.</p>
+ </li>
+ 
+ <li><p>Eliminate operations that are not supported by the target.</p>
+     <p>Targets often have weird constraints, such as not supporting every
+        operation on every supported datatype (e.g. X86 does not support byte
+        conditional moves and PowerPC does not support sign-extending loads from
+        a 16-bit memory location).  Legalize takes care by open-coding
+        another sequence of operations to emulate the operation ("expansion"), by
+        promoting to a larger type that supports the operation
+        (promotion), or using a target-specific hook to implement the
+        legalization (custom).</p>
+     <p>A target implementation tells the legalizer which operations are not
+        supported (and which of the above three actions to take) by calling the
+        "setOperationAction" method in its TargetLowering constructor.</p>
+ </li>
+ </ol>
+ 
+ <p>
+ Prior to the existance of the Legalize pass, we required that every
+ target <a href="#selectiondag_optimize">selector</a> supported and handled every
+ operator and type even if they are not natively supported.  The introduction of
+ the Legalize phase allows all of the 
+ cannonicalization patterns to be shared across targets, and makes it very 
+ easy to optimize the cannonicalized code because it is still in the form of 
+ a DAG.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_optimize">SelectionDAG Optimization Phase: the DAG
+   Combiner</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The SelectionDAG optimization phase is run twice for code generation: once
+ immediately after the DAG is built and once after legalization.  The first run
+ of the pass allows the initial code to be cleaned up (e.g. performing 
+ optimizations that depend on knowing that the operators have restricted type 
+ inputs).  The second run of the pass cleans up the messy code generated by the 
+ Legalize pass, which allows Legalize to be very simple (it can focus on making
+ code legal instead of focusing on generating <i>good</i> and legal code).
+ </p>
+ 
+ <p>
+ One important class of optimizations performed is optimizing inserted sign and
+ zero extension instructions.  We currently use ad-hoc techniques, but could move
+ to more rigorous techniques in the future.  Here are some good
+ papers on the subject:</p>
+ 
+ <p>
+ "<a href="http://www.eecs.harvard.edu/~nr/pubs/widen-abstract.html">Widening
+ integer arithmetic</a>"<br>
+ Kevin Redwine and Norman Ramsey<br>
+ International Conference on Compiler Construction (CC) 2004
+ </p>
+ 
+ 
+ <p>
+  "<a href="http://portal.acm.org/citation.cfm?doid=512529.512552">Effective
+  sign extension elimination</a>"<br>
+  Motohiro Kawahito, Hideaki Komatsu, and Toshio Nakatani<br>
+  Proceedings of the ACM SIGPLAN 2002 Conference on Programming Language Design
+  and Implementation.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_select">SelectionDAG Select Phase</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The Select phase is the bulk of the target-specific code for instruction
+ selection.  This phase takes a legal SelectionDAG as input,
+ pattern matches the instructions supported by the target to this DAG, and
+ produces a new DAG of target code.  For example, consider the following LLVM
+ fragment:</p>
+ 
+ <pre>
+    %t1 = add float %W, %X
+    %t2 = mul float %t1, %Y
+    %t3 = add float %t2, %Z
+ </pre>
+ 
+ <p>This LLVM code corresponds to a SelectionDAG that looks basically like this:
+ </p>
+ 
+ <pre>
+   (fadd:f32 (fmul:f32 (fadd:f32 W, X), Y), Z)
+ </pre>
+ 
+ <p>If a target supports floating point multiply-and-add (FMA) operations, one
+ of the adds can be merged with the multiply.  On the PowerPC, for example, the
+ output of the instruction selector might look like this DAG:</p>
+ 
+ <pre>
+   (FMADDS (FADDS W, X), Y, Z)
+ </pre>
+ 
+ <p>
+ The FMADDS instruction is a ternary instruction that multiplies its first two
+ operands and adds the third (as single-precision floating-point numbers).  The
+ FADDS instruction is a simple binary single-precision add instruction.  To
+ perform this pattern match, the PowerPC backend includes the following
+ instruction definitions:
+ </p>
+ 
+ <pre>
+ def FMADDS : AForm_1<59, 29,
+                     (ops F4RC:$FRT, F4RC:$FRA, F4RC:$FRC, F4RC:$FRB),
+                     "fmadds $FRT, $FRA, $FRC, $FRB",
+                     [<b>(set F4RC:$FRT, (fadd (fmul F4RC:$FRA, F4RC:$FRC),
+                                            F4RC:$FRB))</b>]>;
+ def FADDS : AForm_2<59, 21,
+                     (ops F4RC:$FRT, F4RC:$FRA, F4RC:$FRB),
+                     "fadds $FRT, $FRA, $FRB",
+                     [<b>(set F4RC:$FRT, (fadd F4RC:$FRA, F4RC:$FRB))</b>]>;
+ </pre>
+ 
+ <p>The portion of the instruction definition in bold indicates the pattern used
+ to match the instruction.  The DAG operators (like <tt>fmul</tt>/<tt>fadd</tt>)
+ are defined in the <tt>lib/Target/TargetSelectionDAG.td</tt> file.  
+ "<tt>F4RC</tt>" is the register class of the input and result values.<p>
+ 
+ <p>The TableGen DAG instruction selector generator reads the instruction 
+ patterns in the .td and automatically builds parts of the pattern matching code
+ for your target.  It has the following strengths:</p>
+ 
+ <ul>
+ <li>At compiler-compiler time, it analyzes your instruction patterns and tells
+     you if your patterns make sense or not.</li>
+ <li>It can handle arbitrary constraints on operands for the pattern match.  In
+     particular, it is straight-forward to say things like "match any immediate
+     that is a 13-bit sign-extended value".  For examples, see the 
+     <tt>immSExt16</tt> and related tblgen classes in the PowerPC backend.</li>
+ <li>It knows several important identities for the patterns defined.  For
+     example, it knows that addition is commutative, so it allows the 
+     <tt>FMADDS</tt> pattern above to match "<tt>(fadd X, (fmul Y, Z))</tt>" as
+     well as "<tt>(fadd (fmul X, Y), Z)</tt>", without the target author having
+     to specially handle this case.</li>
+ <li>It has a full-featured type-inferencing system.  In particular, you should
+     rarely have to explicitly tell the system what type parts of your patterns
+     are.  In the FMADDS case above, we didn't have to tell tblgen that all of
+     the nodes in the pattern are of type 'f32'.  It was able to infer and
+     propagate this knowledge from the fact that F4RC has type 'f32'.</li>
+ <li>Targets can define their own (and rely on built-in) "pattern fragments".
+     Pattern fragments are chunks of reusable patterns that get inlined into your
+     patterns during compiler-compiler time.  For example, the integer "(not x)"
+     operation is actually defined as a pattern fragment that expands as
+     "(xor x, -1)", since the SelectionDAG does not have a native 'not'
+     operation.  Targets can define their own short-hand fragments as they see
+     fit.  See the definition of 'not' and 'ineg' for examples.</li>
+ <li>In addition to instructions, targets can specify arbitrary patterns that
+     map to one or more instructions, using the 'Pat' class.  For example,
+     the PowerPC has no way to load an arbitrary integer immediate into a
+     register in one instruction. To tell tblgen how to do this, it defines:
+     
+     <pre>
+     // Arbitrary immediate support.  Implement in terms of LIS/ORI.
+     def : Pat<(i32 imm:$imm),
+               (ORI (LIS (HI16 imm:$imm)), (LO16 imm:$imm))>;
+     </pre>
+     
+     If none of the single-instruction patterns for loading an immediate into a
+     register match, this will be used.  This rule says "match an arbitrary i32
+     immediate, turning it into an ORI ('or a 16-bit immediate') and an LIS
+     ('load 16-bit immediate, where the immediate is shifted to the left 16
+     bits') instruction".  To make this work, the LO16/HI16 node transformations
+     are used to manipulate the input immediate (in this case, take the high or
+     low 16-bits of the immediate).
+     </li>
+ <li>While the system does automate a lot, it still allows you to write custom
+     C++ code to match special cases, in case there is something that is hard
+     to express.</li>
+ </ul>
+ 
+ <p>
+ While it has many strengths, the system currently has some limitations,
+ primarily because it is a work in progress and is not yet finished:
+ </p>
+ 
+ <ul>
+ <li>Overall, there is no way to define or match SelectionDAG nodes that define
+     multiple values (e.g. ADD_PARTS, LOAD, CALL, etc).  This is the biggest
+     reason that you currently still <i>have to</i> write custom C++ code for
+     your instruction selector.</li>
+ <li>There is no great way to support match complex addressing modes yet.  In the
+     future, we will extend pattern fragments to allow them to define multiple
+     values (e.g. the four operands of the <a href="#x86_memory">X86 addressing
+     mode</a>).  In addition, we'll extend fragments so that a fragment can match
+     multiple different patterns.</li>
+ <li>We don't automatically infer flags like isStore/isLoad yet.</li>
+ <li>We don't automatically generate the set of supported registers and
+     operations for the <a href="#"selectiondag_legalize>Legalizer</a> yet.</li>
+ <li>We don't have a way of tying in custom legalized nodes yet.</li>
+ </ul>
+ 
+ <p>Despite these limitations, the instruction selector generator is still quite
+ useful for most of the binary and logical operations in typical instruction
+ sets.  If you run into any problems or can't figure out how to do something, 
+ please let Chris know!</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_sched">SelectionDAG Scheduling and Formation Phase</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The scheduling phase takes the DAG of target instructions from the selection
+ phase and assigns an order.  The scheduler can pick an order depending on
+ various constraints of the machines (i.e. order for minimal register pressure or
+ try to cover instruction latencies).  Once an order is established, the DAG is
+ converted to a list of <a href="#machineinstr">MachineInstr</a>s and the
+ Selection DAG is destroyed.
+ </p>
+ 
+ <p>Note that this phase is logically separate from the instruction selection
+ phase, but is tied to it closely in the code because it operates on
+ SelectionDAGs.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="selectiondag_future">Future directions for the SelectionDAG</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ol>
+ <li>Optional function-at-a-time selection.</li>
+ <li>Auto-generate entire selector from .td file.</li>
+ </li>
+ </ol>
+ 
+ </div>
+  
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ssamco">SSA-based Machine Code Optimizations</a>
+ </div>
+ <div class="doc_text"><p>To Be Written</p></div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="regalloc">Register Allocation</a>
+ </div>
+ <div class="doc_text"><p>To Be Written</p></div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="proepicode">Prolog/Epilog Code Insertion</a>
+ </div>
+ <div class="doc_text"><p>To Be Written</p></div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="latemco">Late Machine Code Optimizations</a>
+ </div>
+ <div class="doc_text"><p>To Be Written</p></div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="codeemit">Code Emission</a>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="codeemit_asm">Generating Assembly Code</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="codeemit_bin">Generating Binary Machine Code</a>
+ </div>
+ 
+ <div class="doc_text">
+    <p>For the JIT or .o file writer</p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="targetimpls">Target-specific Implementation Notes</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section of the document explains features or design decisions that
+ are specific to the code generator for a particular target.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="x86">The X86 backend</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The X86 code generator lives in the <tt>lib/Target/X86</tt> directory.  This
+ code generator currently targets a generic P6-like processor.  As such, it
+ produces a few P6-and-above instructions (like conditional moves), but it does
+ not make use of newer features like MMX or SSE.  In the future, the X86 backend
+ will have sub-target support added for specific processor families and 
+ implementations.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="x86_tt">X86 Target Triples Supported</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The following are the known target triples that are supported by the X86 
+ backend.  This is not an exhaustive list, but it would be useful to add those
+ that people test.
+ </p>
+ 
+ <ul>
+ <li><b>i686-pc-linux-gnu</b> - Linux</li>
+ <li><b>i386-unknown-freebsd5.3</b> - FreeBSD 5.3</li>
+ <li><b>i686-pc-cygwin</b> - Cygwin on Win32</li>
+ <li><b>i686-pc-mingw32</b> - MingW on Win32</li>
+ <li><b>i686-apple-darwin*</b> - Apple Darwin on X86</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="x86_memory">Representing X86 addressing modes in MachineInstrs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The x86 has a very flexible way of accessing memory.  It is capable of
+ forming memory addresses of the following expression directly in integer
+ instructions (which use ModR/M addressing):</p>
+ 
+ <pre>
+    Base+[1,2,4,8]*IndexReg+Disp32
+ </pre>
+ 
+ <p>In order to represent this, LLVM tracks no less than 4 operands for each
+ memory operand of this form.  This means that the "load" form of 'mov' has the
+ following <tt>MachineOperand</tt>s in this order:</p>
+ 
+ <pre>
+ Index:        0     |    1        2       3           4
+ Meaning:   DestReg, | BaseReg,  Scale, IndexReg, Displacement
+ OperandTy: VirtReg, | VirtReg, UnsImm, VirtReg,   SignExtImm
+ </pre>
+ 
+ <p>Stores, and all other instructions, treat the four memory operands in the 
+ same way, in the same order.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="x86_names">Instruction naming</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ An instruction name consists of the base name, a default operand size, and a
+ a character per operand with an optional special size. For example:</p>
+ 
+ <p>
+ <tt>ADD8rr</tt> -> add, 8-bit register, 8-bit register<br>
+ <tt>IMUL16rmi</tt> -> imul, 16-bit register, 16-bit memory, 16-bit immediate<br>
+ <tt>IMUL16rmi8</tt> -> imul, 16-bit register, 16-bit memory, 8-bit immediate<br>
+ <tt>MOVSX32rm16</tt> -> movsx, 32-bit register, 16-bit memory
+ </p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CodingStandards.html
diff -c /dev/null llvm-www/releases/1.8/docs/CodingStandards.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CodingStandards.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,679 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+   <title>A Few Coding Standards</title>
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   A Few Coding Standards
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#mechanicalissues">Mechanical Source Issues</a>
+     <ol>
+       <li><a href="#sourceformating">Source Code Formatting</a>
+         <ol>
+           <li><a href="#scf_commenting">Commenting</a></li>
+           <li><a href="#scf_commentformat">Comment Formatting</a></li>
+           <li><a href="#scf_includes"><tt>#include</tt> Style</a></li>
+           <li><a href="#scf_codewidth">Source Code Width</a></li>
+           <li><a href="#scf_spacestabs">Use Spaces Instead of Tabs</a></li>
+           <li><a href="#scf_indentation">Indent Code Consistently</a></li>
+         </ol></li>
+       <li><a href="#compilerissues">Compiler Issues</a>
+         <ol>
+           <li><a href="#ci_warningerrors">Treat Compiler Warnings Like
+               Errors</a></li>
+           <li><a href="#ci_portable_code">Write Portable Code</a></li>
+           <li><a href="#ci_class_struct">Use of class/struct Keywords</a></li>
+         </ol></li>
+     </ol></li>
+   <li><a href="#styleissues">Style Issues</a>
+     <ol>
+       <li><a href="#macro">The High Level Issues</a>
+         <ol>
+           <li><a href="#hl_module">A Public Header File <b>is</b> a
+               Module</a></li>
+           <li><a href="#hl_dontinclude">#include as Little as Possible</a></li>
+           <li><a href="#hl_privateheaders">Keep "internal" Headers
+               Private</a></li>
+         </ol></li>
+       <li><a href="#micro">The Low Level Issues</a>
+         <ol>
+           <li><a href="#ll_assert">Assert Liberally</a></li>
+           <li><a href="#ll_ns_std">Do not use 'using namespace std'</a></li>
+           <li><a href="#ll_virtual_anch">Provide a virtual method anchor for clases in headers</a></li>
+           <li><a href="#ll_preincrement">Prefer Preincrement</a></li>
+           <li><a href="#ll_avoidendl">Avoid <tt>std::endl</tt></a></li>
+         </ol></li>
+     </ol></li>
+   <li><a href="#seealso">See Also</a></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document attempts to describe a few coding standards that are being used
+ in the LLVM source tree.  Although no coding standards should be regarded as
+ absolute requirements to be followed in all instances, coding standards can be
+ useful.</p>
+ 
+ <p>This document intentionally does not prescribe fixed standards for religious
+ issues such as brace placement and space usage.  For issues like this, follow
+ the golden rule:</p>
+ 
+ <blockquote>
+ 
+ <p><b><a name="goldenrule">If you are adding a significant body of source to a
+ project, feel free to use whatever style you are most comfortable with.  If you
+ are extending, enhancing, or bug fixing already implemented code, use the style
+ that is already being used so that the source is uniform and easy to
+ follow.</a></b></p>
+ 
+ </blockquote>
+ 
+ <p>The ultimate goal of these guidelines is the increase readability and
+ maintainability of our common source base. If you have suggestions for topics to
+ be included, please mail them to <a
+ href="mailto:sabre at nondot.org">Chris</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="mechanicalissues">Mechanical Source Issues</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="sourceformating">Source Code Formatting</a>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_commenting">Commenting</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Comments are one critical part of readability and maintainability.  Everyone
+ knows they should comment, so should you.  Although we all should probably
+ comment our code more than we do, there are a few very critical places that
+ documentation is very useful:</p>
+ 
+ <b>File Headers</b>
+ 
+ <p>Every source file should have a header on it that
+ describes the basic purpose of the file.  If a file does not have a header, it
+ should not be checked into CVS.  Most source trees will probably have a standard
+ file header format.  The standard format for the LLVM source tree looks like
+ this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ //===-- llvm/Instruction.h - Instruction class definition -------*- C++ -*-===//
+ // 
+ //                     The LLVM Compiler Infrastructure
+ //
+ // This file was developed by the LLVM research group and is distributed under
+ // the University of Illinois Open Source License. See LICENSE.TXT for details.
+ // 
+ //===----------------------------------------------------------------------===//
+ //
+ // This file contains the declaration of the Instruction class, which is the
+ // base class for all of the VM instructions.
+ //
+ //===----------------------------------------------------------------------===//
+ </pre>
+ </div>
+ 
+ <p>A few things to note about this particular format:  The "<tt>-*- C++
+ -*-</tt>" string on the first line is there to tell Emacs that the source file
+ is a C++ file, not a C file (Emacs assumes .h files are C files by default).
+ Note that this tag is not necessary in .cpp files.  The name of the file is also
+ on the first line, along with a very short description of the purpose of the
+ file.  This is important when printing out code and flipping though lots of
+ pages.</p>
+ 
+ <p>The next section in the file is a concise note that defines the license that
+ the file is released under.  This makes it perfectly clear what terms the source
+ code can be distributed under.</p>
+ 
+ <p>The main body of the description does not have to be very long in most cases.
+ Here it's only two lines.  If an algorithm is being implemented or something
+ tricky is going on, a reference to the paper where it is published should be
+ included, as well as any notes or "gotchas" in the code to watch out for.</p>
+ 
+ <b>Class overviews</b>
+ 
+ <p>Classes are one fundamental part of a good object oriented design.  As such,
+ a class definition should have a comment block that explains what the class is
+ used for... if it's not obvious.  If it's so completely obvious your grandma
+ could figure it out, it's probably safe to leave it out.  Naming classes
+ something sane goes a long ways towards avoiding writing documentation.</p>
+ 
+ 
+ <b>Method information</b>
+ 
+ <p>Methods defined in a class (as well as any global functions) should also be
+ documented properly.  A quick note about what it does any a description of the
+ borderline behaviour is all that is necessary here (unless something
+ particularly tricky or insideous is going on).  The hope is that people can
+ figure out how to use your interfaces without reading the code itself... that is
+ the goal metric.</p>
+ 
+ <p>Good things to talk about here are what happens when something unexpected
+ happens: does the method return null?  Abort?  Format your hard disk?</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_commentformat">Comment Formatting</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In general, prefer C++ style (<tt>//</tt>) comments.  They take less space,
+ require less typing, don't have nesting problems, etc.  There are a few cases
+ when it is useful to use C style (<tt>/* */</tt>) comments however:</p>
+ 
+ <ol>
+   <li>When writing a C code: Obviously if you are writing C code, use C style
+       comments.</li>
+   <li>When writing a header file that may be <tt>#include</tt>d by a C source
+       file.</li>
+   <li>When writing a source file that is used by a tool that only accepts C
+       style comments.</li>
+ </ol>
+ 
+ <p>To comment out a large block of code, use <tt>#if 0</tt> and <tt>#endif</tt>.
+ These nest properly and are better behaved in general than C style comments.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_includes"><tt>#include</tt> Style</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Immediately after the <a href="#scf_commenting">header file comment</a> (and
+ include guards if working on a header file), the <a
+ href="#hl_dontinclude">minimal</a> list of <tt>#include</tt>s required by the
+ file should be listed.  We prefer these <tt>#include</tt>s to be listed in this
+ order:</p>
+ 
+ <ol>
+   <li><a href="#mmheader">Main Module header</a></li>
+   <li><a href="#hl_privateheaders">Local/Private Headers</a></li>
+   <li><tt>llvm/*</tt></li>
+   <li><tt>llvm/Analysis/*</tt></li>
+   <li><tt>llvm/Assembly/*</tt></li>
+   <li><tt>llvm/Bytecode/*</tt></li>
+   <li><tt>llvm/CodeGen/*</tt></li>
+   <li>...</li>
+   <li><tt>Support/*</tt></li>
+   <li><tt>Config/*</tt></li>
+   <li>System <tt>#includes</tt></li>
+ </ol>
+ 
+ <p>... and each catagory should be sorted by name.</p>
+ 
+ <p><a name="mmheader">The "Main Module Header"</a> file applies to .cpp file
+ which implement an interface defined by a .h file.  This <tt>#include</tt>
+ should always be included <b>first</b> regardless of where it lives on the file
+ system.  By including a header file first in the .cpp files that implement the
+ interfaces, we ensure that the header does not have any hidden dependencies
+ which are not explicitly #included in the header, but should be.  It is also a
+ form of documentation in the .cpp file to indicate where the interfaces it
+ implements are defined.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_codewidth">Source Code Width</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Write your code to fit within 80 columns of text.  This helps those of us who
+ like to print out code and look at your code in an xterm without resizing
+ it.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_spacestabs">Use Spaces Instead of Tabs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In all cases, prefer spaces to tabs in source files.  People have different
+ prefered indentation levels, and different styles of indentation that they
+ like... this is fine.  What isn't is that different editors/viewers expand tabs
+ out to different tab stops.  This can cause your code to look completely
+ unreadable, and it is not worth dealing with.</p>
+ 
+ <p>As always, follow the <a href="#goldenrule">Golden Rule</a> above: follow the
+ style of existing code if your are modifying and extending it.  If you like four
+ spaces of indentation, <b>DO NOT</b> do that in the middle of a chunk of code
+ with two spaces of indentation.  Also, do not reindent a whole source file: it
+ makes for incredible diffs that are absolutely worthless.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="scf_indentation">Indent Code Consistently</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Okay, your first year of programming you were told that indentation is
+ important.  If you didn't believe and internalize this then, now is the time.
+ Just do it.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="compilerissues">Compiler Issues</a>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ci_warningerrors">Treat Compiler Warnings Like Errors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If your code has compiler warnings in it, something is wrong: you aren't
+ casting values correctly, your have "questionable" constructs in your code, or
+ you are doing something legitimately wrong.  Compiler warnings can cover up
+ legitimate errors in output and make dealing with a translation unit
+ difficult.</p>
+ 
+ <p>It is not possible to prevent all warnings from all compilers, nor is it
+ desirable.  Instead, pick a standard compiler (like <tt>gcc</tt>) that provides
+ a good thorough set of warnings, and stick to them.  At least in the case of
+ <tt>gcc</tt>, it is possible to work around any spurious errors by changing the
+ syntax of the code slightly.  For example, an warning that annoys me occurs when
+ I write code like this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ if (V = getValue()) {
+   ...
+ }
+ </pre>
+ </div>
+ 
+ <p><tt>gcc</tt> will warn me that I probably want to use the <tt>==</tt>
+ operator, and that I probably mistyped it.  In most cases, I haven't, and I
+ really don't want the spurious errors.  To fix this particular problem, I
+ rewrite the code like this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ if ((V = getValue())) {
+   ...
+ }
+ </pre>
+ </div>
+ 
+ <p>...which shuts <tt>gcc</tt> up.  Any <tt>gcc</tt> warning that annoys you can
+ be fixed by massaging the code appropriately.</p>
+ 
+ <p>These are the <tt>gcc</tt> warnings that I prefer to enable: <tt>-Wall
+ -Winline -W -Wwrite-strings -Wno-unused</tt></p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ci_portable_code">Write Portable Code</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In almost all cases, it is possible and within reason to write completely
+ portable code.  If there are cases where it isn't possible to write portable
+ code, isolate it behind a well defined (and well documented) interface.</p>
+ 
+ <p>In practice, this means that you shouldn't assume much about the host
+ compiler, including its support for "high tech" features like partial
+ specialization of templates.  In fact, Visual C++ 6 could be an important target
+ for our work in the future, and we don't want to have to rewrite all of our code
+ to support it.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+ <a name="ci_class_struct">Use of <tt>class</tt> and <tt>struct</tt> Keywords</a>
+ </div>
+ <div class="doc_text">
+ 
+ <p>In C++, the <tt>class</tt> and <tt>struct</tt> keywords can be used almost
+ interchangeably. The only difference is when they are used to declare a class:
+ <tt>class</tt> makes all members private by default while <tt>struct</tt> makes
+ all members public by default.</p>
+ 
+ <p>Unfortunately, not all compilers follow the rules and some will generate
+ different symbols based on whether <tt>class</tt> or <tt>struct</tt> was used to
+ declare the symbol.  This can lead to problems at link time.</p> 
+ 
+ <p>So, the rule for LLVM is to always use the <tt>class</tt> keyword, unless
+ <b>all</b> members are public, in which case <tt>struct</tt> is allowed.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="styleissues">Style Issues</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="macro">The High Level Issues</a>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="hl_module">A Public Header File <b>is</b> a Module</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>C++ doesn't do too well in the modularity department.  There is no real
+ encapsulation or data hiding (unless you use expensive protocol classes), but it
+ is what we have to work with.  When you write a public header file (in the LLVM
+ source tree, they live in the top level "include" directory), you are defining a
+ module of functionality.</p>
+ 
+ <p>Ideally, modules should be completely independent of each other, and their
+ header files should only include the absolute minimum number of headers
+ possible. A module is not just a class, a function, or a namespace: <a
+ href="http://www.cuj.com/articles/2000/0002/0002c/0002c.htm">it's a collection
+ of these</a> that defines an interface.  This interface may be several
+ functions, classes or data structures, but the important issue is how they work
+ together.</p>
+ 
+ <p>In general, a module should be implemented with one or more <tt>.cpp</tt>
+ files.  Each of these <tt>.cpp</tt> files should include the header that defines
+ their interface first.  This ensure that all of the dependences of the module
+ header have been properly added to the module header itself, and are not
+ implicit.  System headers should be included after user headers for a
+ translation unit.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="hl_dontinclude"><tt>#include</tt> as Little as Possible</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include</tt> hurts compile time performance.  Don't do it unless you
+ have to, especially in header files.</p>
+ 
+ <p>But wait, sometimes you need to have the definition of a class to use it, or
+ to inherit from it.  In these cases go ahead and <tt>#include</tt> that header
+ file.  Be aware however that there are many cases where you don't need to have
+ the full definition of a class.  If you are using a pointer or reference to a
+ class, you don't need the header file.  If you are simply returning a class
+ instance from a prototyped function or method, you don't need it.  In fact, for
+ most cases, you simply don't need the definition of a class... and not
+ <tt>#include</tt>'ing speeds up compilation.</p>
+ 
+ <p>It is easy to try to go too overboard on this recommendation, however.  You
+ <b>must</b> include all of the header files that you are using, either directly
+ or indirectly (through another header file).  To make sure that you don't
+ accidently forget to include a header file in your module header, make sure to
+ include your module header <b>first</b> in the implementation file (as mentioned
+ above).  This way there won't be any hidden dependencies that you'll find out
+ about later...</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="hl_privateheaders">Keep "internal" Headers Private</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Many modules have a complex implementation that causes them to use more than
+ one implementation (<tt>.cpp</tt>) file.  It is often tempting to put the
+ internal communication interface (helper classes, extra functions, etc) in the
+ public module header file.  Don't do this.</p>
+ 
+ <p>If you really need to do something like this, put a private header file in
+ the same directory as the source files, and include it locally.  This ensures
+ that your private interface remains private and undisturbed by outsiders.</p>
+ 
+ <p>Note however, that it's okay to put extra implementation methods a public
+ class itself... just make them private (or protected), and all is well.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="micro">The Low Level Issues</a>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ll_assert">Assert Liberally</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Use the "<tt>assert</tt>" function to its fullest.  Check all of your
+ preconditions and assumptions, you never know when a bug (not neccesarily even
+ yours) might be caught early by an assertion, which reduces debugging time
+ dramatically.  The "<tt><cassert></tt>" header file is probably already
+ included by the header files you are using, so it doesn't cost anything to use
+ it.</p>
+ 
+ <p>To further assist with debugging, make sure to put some kind of error message
+ in the assertion statement (which is printed if the assertion is tripped). This
+ helps the poor debugging make sense of why an assertion is being made and
+ enforced, and hopefully what to do about it.  Here is one complete example:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ inline Value *getOperand(unsigned i) { 
+   assert(i < Operands.size() && "getOperand() out of range!");
+   return Operands[i]; 
+ }
+ </pre>
+ </div>
+ 
+ <p>Here are some examples:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ assert(Ty->isPointerType() && "Can't allocate a non pointer type!");
+ 
+ assert((Opcode == Shl || Opcode == Shr) && "ShiftInst Opcode invalid!");
+ 
+ assert(idx < getNumSuccessors() && "Successor # out of range!");
+ 
+ assert(V1.getType() == V2.getType() && "Constant types must be identical!");
+ 
+ assert(isa<PHINode>(Succ->front()) && "Only works on PHId BBs!");
+ </pre>
+ </div>
+ 
+ <p>You get the idea...</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ll_ns_std">Do not use 'using namespace std'</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>In LLVM, we prefer to explicitly prefix all identifiers from the standard
+ namespace with an "std::" prefix, rather than rely on "using namespace std;".
+ </p>
+ 
+ <p> In header files, adding a 'using namespace XXX' directive pollutes the 
+ namespace of any source file that includes the header.  This is clearly a bad
+ thing.</p>
+ 
+ <p>In implementation files (e.g. .cpp files) the rule is more of a stylistic
+ rule, but is still important.  Basically, using explicit namespace prefixes 
+ makes
+ the code <b>more clear</b> - because it is immediately obvious what facilities
+ are being used and where they are coming from - and <b>more portable</b> -
+ because namespace clashes cannot occur between LLVM code and other namespaces.
+ The portability rule is important because different standard library 
+ implementations expose different symbols (potentially ones they shouldn't) and 
+ future revisions to the C++ standard will add more symbols to the std 
+ namespace.  As such, we never 'using namespace std;' in LLVM.</p>
+ 
+ <p>The exception to the general rule (i.e. it's not an exception for the std 
+ namespace) is for implementation files.  For example, all of the code in the
+ LLVM project implements code that lives in the 'llvm' namespace.  As such, it
+ is ok, and actually more clear, for the .cpp files to have a 'using namespace 
+ llvm' directive at their top, after the #includes.  The general form of this
+ rule is that any .cpp file that implements code in any namespace may use that
+ namespace (and its parents), but should not use any others.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ll_virtual_anch">Provide a virtual method anchor for clases in headers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If a class is defined in a header file and has a v-table (either it has 
+ virtual methods or it derives from classes with virtual methods), it must 
+ always have at least one out-of-line virtual method in the class.  Without 
+ this, the compiler will copy the vtable and RTTI into every .o file that
+ #includes the header, bloating .o file sizes and increasing link times.
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ll_preincrement">Prefer Preincrement</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Hard fast rule: Preincrement (<tt>++X</tt>) may be no slower than
+ postincrement (<tt>X++</tt>) and could very well be a lot faster than it.  Use
+ preincrementation whenever possible.</p>
+ 
+ <p>The semantics of postincrement include making a copy of the value being
+ incremented, returning it, and then preincrementing the "work value".  For
+ primitive types, this isn't a big deal... but for iterators, it can be a huge
+ issue (for example, some iterators contains stack and set objects in them...
+ copying an iterator could invoke the copy ctor's of these as well).  In general,
+ get in the habit of always using preincrement, and you won't have a problem.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="ll_avoidendl">Avoid <tt>std::endl</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>std::endl</tt> modifier, when used with iostreams outputs a newline
+ to the output stream specified.  In addition to doing this, however, it also
+ flushes the output stream.  In other words, these are equivalent:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ std::cout << std::endl;
+ std::cout << '\n' << std::flush;
+ </pre>
+ </div>
+ 
+ <p>Most of the time, you probably have no reason to flush the output stream, so
+ it's better to use a literal <tt>'\n'</tt>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="seealso">See Also</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>A lot of these comments and recommendations have been culled for other
+ sources.  Two particularly important books for our work are:</p>
+ 
+ <ol>
+ 
+ <li><a href="http://www.aw-bc.com/catalog/academic/product/0,1144,0201310155,00.html">Effective
+ C++</a> by Scott Meyers.  There is an online version of the book (only some
+ chapters though) <a
+ href="http://www.awlonline.com/cseng/meyerscddemo/">available as well</a>.  Also
+ interesting and useful are "More Effective C++" and "Effective STL" by the same
+ author.</li>
+ 
+ <li><a href="http://cseng.aw.com/book/0,3828,0201633620,00.html">Large-Scale C++
+ Software Design</a> by John Lakos</li>
+ 
+ </ol>
+ 
+ <p>If you get some free time, and you haven't read them: do so, you might learn
+ something.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CommandLine.html
diff -c /dev/null llvm-www/releases/1.8/docs/CommandLine.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CommandLine.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1930 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>CommandLine 2.0 Library Manual</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   CommandLine 2.0 Library Manual
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+ 
+   <li><a href="#quickstart">Quick Start Guide</a>
+     <ol>
+       <li><a href="#bool">Boolean Arguments</a></li>
+       <li><a href="#alias">Argument Aliases</a></li>
+       <li><a href="#onealternative">Selecting an alternative from a
+                                     set of possibilities</a></li>
+       <li><a href="#namedalternatives">Named alternatives</a></li>
+       <li><a href="#list">Parsing a list of options</a></li>
+       <li><a href="#bits">Collecting options as a set of flags</a></li>
+       <li><a href="#description">Adding freeform text to help output</a></li>
+     </ol></li>
+ 
+   <li><a href="#referenceguide">Reference Guide</a>
+     <ol>
+       <li><a href="#positional">Positional Arguments</a>
+         <ul>
+         <li><a href="#--">Specifying positional options with hyphens</a></li>
+         <li><a href="#getPosition">Determining absolute position with
+           getPosition</a></li>
+         <li><a href="#cl::ConsumeAfter">The <tt>cl::ConsumeAfter</tt>
+              modifier</a></li>
+         </ul></li>
+ 
+       <li><a href="#storage">Internal vs External Storage</a></li>
+ 
+       <li><a href="#attributes">Option Attributes</a></li>
+ 
+       <li><a href="#modifiers">Option Modifiers</a>
+         <ul>
+         <li><a href="#hiding">Hiding an option from <tt>--help</tt> 
+             output</a></li>
+         <li><a href="#numoccurrences">Controlling the number of occurrences
+                                      required and allowed</a></li>
+         <li><a href="#valrequired">Controlling whether or not a value must be
+                                    specified</a></li>
+         <li><a href="#formatting">Controlling other formatting options</a></li>
+         <li><a href="#misc">Miscellaneous option modifiers</a></li>
+         </ul></li>
+ 
+       <li><a href="#toplevel">Top-Level Classes and Functions</a>
+         <ul>
+         <li><a href="#cl::ParseCommandLineOptions">The 
+             <tt>cl::ParseCommandLineOptions</tt> function</a></li>
+         <li><a href="#cl::ParseEnvironmentOptions">The 
+             <tt>cl::ParseEnvironmentOptions</tt> function</a></li>
+         <li><a href="#cl::SetVersionPrinter">The cl::SetVersionPrinter
+           function</a></li>
+         <li><a href="#cl::opt">The <tt>cl::opt</tt> class</a></li>
+         <li><a href="#cl::list">The <tt>cl::list</tt> class</a></li>
+         <li><a href="#cl::bits">The <tt>cl::bits</tt> class</a></li>
+         <li><a href="#cl::alias">The <tt>cl::alias</tt> class</a></li>
+         <li><a href="#cl::extrahelp">The <tt>cl::extrahelp</tt> class</a></li>
+         </ul></li>
+ 
+       <li><a href="#builtinparsers">Builtin parsers</a>
+         <ul>
+         <li><a href="#genericparser">The Generic <tt>parser<t></tt>
+             parser</a></li>
+         <li><a href="#boolparser">The <tt>parser<bool></tt>
+             specialization</a></li>
+         <li><a href="#stringparser">The <tt>parser<string></tt>
+             specialization</a></li>
+         <li><a href="#intparser">The <tt>parser<int></tt>
+             specialization</a></li>
+         <li><a href="#doubleparser">The <tt>parser<double></tt> and
+             <tt>parser<float></tt> specializations</a></li>
+         </ul></li>
+     </ol></li>
+   <li><a href="#extensionguide">Extension Guide</a>
+     <ol>
+       <li><a href="#customparser">Writing a custom parser</a></li>
+       <li><a href="#explotingexternal">Exploiting external storage</a></li>
+       <li><a href="#dynamicopts">Dynamically adding command line 
+           options</a></li>
+     </ol></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document describes the CommandLine argument processing library.  It will
+ show you how to use it, and what it can do.  The CommandLine library uses a
+ declarative approach to specifying the command line options that your program
+ takes.  By default, these options declarations implicitly hold the value parsed
+ for the option declared (of course this <a href="#storage">can be
+ changed</a>).</p>
+ 
+ <p>Although there are a <b>lot</b> of command line argument parsing libraries
+ out there in many different languages, none of them fit well with what I needed.
+ By looking at the features and problems of other libraries, I designed the
+ CommandLine library to have the following features:</p>
+ 
+ <ol>
+ <li>Speed: The CommandLine library is very quick and uses little resources.  The
+ parsing time of the library is directly proportional to the number of arguments
+ parsed, not the the number of options recognized.  Additionally, command line
+ argument values are captured transparently into user defined global variables,
+ which can be accessed like any other variable (and with the same
+ performance).</li>
+ 
+ <li>Type Safe: As a user of CommandLine, you don't have to worry about
+ remembering the type of arguments that you want (is it an int?  a string? a
+ bool? an enum?) and keep casting it around.  Not only does this help prevent
+ error prone constructs, it also leads to dramatically cleaner source code.</li>
+ 
+ <li>No subclasses required: To use CommandLine, you instantiate variables that
+ correspond to the arguments that you would like to capture, you don't subclass a
+ parser.  This means that you don't have to write <b>any</b> boilerplate
+ code.</li>
+ 
+ <li>Globally accessible: Libraries can specify command line arguments that are
+ automatically enabled in any tool that links to the library.  This is possible
+ because the application doesn't have to keep a "list" of arguments to pass to
+ the parser.  This also makes supporting <a href="#dynamicopts">dynamically
+ loaded options</a> trivial.</li>
+ 
+ <li>Cleaner: CommandLine supports enum and other types directly, meaning that
+ there is less error and more security built into the library.  You don't have to
+ worry about whether your integral command line argument accidentally got
+ assigned a value that is not valid for your enum type.</li>
+ 
+ <li>Powerful: The CommandLine library supports many different types of
+ arguments, from simple <a href="#boolparser">boolean flags</a> to <a
+ href="#cl::opt">scalars arguments</a> (<a href="#stringparser">strings</a>, <a
+ href="#intparser">integers</a>, <a href="#genericparser">enums</a>, <a
+ href="#doubleparser">doubles</a>), to <a href="#cl::list">lists of
+ arguments</a>.  This is possible because CommandLine is...</li>
+ 
+ <li>Extensible: It is very simple to add a new argument type to CommandLine.
+ Simply specify the parser that you want to use with the command line option when
+ you declare it.  <a href="#customparser">Custom parsers</a> are no problem.</li>
+ 
+ <li>Labor Saving: The CommandLine library cuts down on the amount of grunt work
+ that you, the user, have to do.  For example, it automatically provides a
+ <tt>--help</tt> option that shows the available command line options for your
+ tool.  Additionally, it does most of the basic correctness checking for
+ you.</li>
+ 
+ <li>Capable: The CommandLine library can handle lots of different forms of
+ options often found in real programs.  For example, <a
+ href="#positional">positional</a> arguments, <tt>ls</tt> style <a
+ href="#cl::Grouping">grouping</a> options (to allow processing '<tt>ls
+ -lad</tt>' naturally), <tt>ld</tt> style <a href="#cl::Prefix">prefix</a>
+ options (to parse '<tt>-lmalloc -L/usr/lib</tt>'), and <a
+ href="#cl::ConsumeAfter">interpreter style options</a>.</li>
+ 
+ </ol>
+ 
+ <p>This document will hopefully let you jump in and start using CommandLine in
+ your utility quickly and painlessly.  Additionally it should be a simple
+ reference manual to figure out how stuff works.  If it is failing in some area
+ (or you want an extension to the library), nag the author, <a
+ href="mailto:sabre at nondot.org">Chris Lattner</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="quickstart">Quick Start Guide</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section of the manual runs through a simple CommandLine'ification of a
+ basic compiler tool.  This is intended to show you how to jump into using the
+ CommandLine library in your own program, and show you some of the cool things it
+ can do.</p>
+ 
+ <p>To start out, you need to include the CommandLine header file into your
+ program:</p>
+ 
+ <div class="doc_code"><pre>
+   #include "llvm/Support/CommandLine.h"
+ </pre></div>
+ 
+ <p>Additionally, you need to add this as the first line of your main
+ program:</p>
+ 
+ <div class="doc_code"><pre>
+ int main(int argc, char **argv) {
+   <a href="#cl::ParseCommandLineOptions">cl::ParseCommandLineOptions</a>(argc, argv);
+   ...
+ }
+ </pre></div>
+ 
+ <p>... which actually parses the arguments and fills in the variable
+ declarations.</p>
+ 
+ <p>Now that you are ready to support command line arguments, we need to tell the
+ system which ones we want, and what type of argument they are.  The CommandLine
+ library uses a declarative syntax to model command line arguments with the
+ global variable declarations that capture the parsed values.  This means that
+ for every command line option that you would like to support, there should be a
+ global variable declaration to capture the result.  For example, in a compiler,
+ we would like to support the unix standard '<tt>-o <filename></tt>' option
+ to specify where to put the output.  With the CommandLine library, this is
+ represented like this:</p>
+ 
+ <a name="value_desc_example"></a>
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> OutputFilename("<i>o</i>", <a href="#cl::desc">cl::desc</a>("<i>Specify output filename</i>"), <a href="#cl::value_desc">cl::value_desc</a>("<i>filename</i>"));
+ </pre></div>
+ 
+ <p>This declares a global variable "<tt>OutputFilename</tt>" that is used to
+ capture the result of the "<tt>o</tt>" argument (first parameter).  We specify
+ that this is a simple scalar option by using the "<tt><a
+ href="#cl::opt">cl::opt</a></tt>" template (as opposed to the <a
+ href="#list">"<tt>cl::list</tt> template</a>), and tell the CommandLine library
+ that the data type that we are parsing is a string.</p>
+ 
+ <p>The second and third parameters (which are optional) are used to specify what
+ to output for the "<tt>--help</tt>" option.  In this case, we get a line that
+ looks like this:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options]
+ 
+ OPTIONS:
+   -help             - display available options (--help-hidden for more)
+   <b>-o <filename>     - Specify output filename</b>
+ </pre></div>
+ 
+ <p>Because we specified that the command line option should parse using the
+ <tt>string</tt> data type, the variable declared is automatically usable as a
+ real string in all contexts that a normal C++ string object may be used.  For
+ example:</p>
+ 
+ <div class="doc_code"><pre>
+   ...
+   ofstream Output(OutputFilename.c_str());
+   if (Out.good()) ...
+   ...
+ </pre></div>
+ 
+ <p>There are many different options that you can use to customize the command
+ line option handling library, but the above example shows the general interface
+ to these options.  The options can be specified in any order, and are specified
+ with helper functions like <a href="#cl::desc"><tt>cl::desc(...)</tt></a>, so
+ there are no positional dependencies to remember.  The available options are
+ discussed in detail in the <a href="#referenceguide">Reference Guide</a>.</p>
+ 
+ <p>Continuing the example, we would like to have our compiler take an input
+ filename as well as an output filename, but we do not want the input filename to
+ be specified with a hyphen (ie, not <tt>-filename.c</tt>).  To support this
+ style of argument, the CommandLine library allows for <a
+ href="#positional">positional</a> arguments to be specified for the program.
+ These positional arguments are filled with command line parameters that are not
+ in option form.  We use this feature like this:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> InputFilename(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<i><input file></i>"), <a href="#cl::init">cl::init</a>("<i>-</i>"));
+ </pre></div>
+ 
+ <p>This declaration indicates that the first positional argument should be
+ treated as the input filename.  Here we use the <tt><a
+ href="#cl::init">cl::init</a></tt> option to specify an initial value for the
+ command line option, which is used if the option is not specified (if you do not
+ specify a <tt><a href="#cl::init">cl::init</a></tt> modifier for an option, then
+ the default constructor for the data type is used to initialize the value).
+ Command line options default to being optional, so if we would like to require
+ that the user always specify an input filename, we would add the <tt><a
+ href="#cl::Required">cl::Required</a></tt> flag, and we could eliminate the
+ <tt><a href="#cl::init">cl::init</a></tt> modifier, like this:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> InputFilename(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<i><input file></i>"), <b><a href="#cl::Required">cl::Required</a></b>);
+ </pre></div>
+ 
+ <p>Again, the CommandLine library does not require the options to be specified
+ in any particular order, so the above declaration is equivalent to:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> InputFilename(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::Required">cl::Required</a>, <a href="#cl::desc">cl::desc</a>("<i><input file></i>"));
+ </pre></div>
+ 
+ <p>By simply adding the <tt><a href="#cl::Required">cl::Required</a></tt> flag,
+ the CommandLine library will automatically issue an error if the argument is not
+ specified, which shifts all of the command line option verification code out of
+ your application into the library.  This is just one example of how using flags
+ can alter the default behaviour of the library, on a per-option basis.  By
+ adding one of the declarations above, the <tt>--help</tt> option synopsis is now
+ extended to:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options] <b><input file></b>
+ 
+ OPTIONS:
+   -help             - display available options (--help-hidden for more)
+   -o <filename>     - Specify output filename
+ </pre></div>
+ 
+ <p>... indicating that an input filename is expected.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="bool">Boolean Arguments</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In addition to input and output filenames, we would like the compiler example
+ to support three boolean flags: "<tt>-f</tt>" to force overwriting of the output
+ file, "<tt>--quiet</tt>" to enable quiet mode, and "<tt>-q</tt>" for backwards
+ compatibility with some of our users.  We can support these by declaring options
+ of boolean type like this:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><bool> Force ("<i>f</i>", <a href="#cl::desc">cl::desc</a>("<i>Overwrite output files</i>"));
+ <a href="#cl::opt">cl::opt</a><bool> Quiet ("<i>quiet</i>", <a href="#cl::desc">cl::desc</a>("<i>Don't print informational messages</i>"));
+ <a href="#cl::opt">cl::opt</a><bool> Quiet2("<i>q</i>", <a href="#cl::desc">cl::desc</a>("<i>Don't print informational messages</i>"), <a href="#cl::Hidden">cl::Hidden</a>);
+ </pre></div>
+ 
+ <p>This does what you would expect: it declares three boolean variables
+ ("<tt>Force</tt>", "<tt>Quiet</tt>", and "<tt>Quiet2</tt>") to recognize these
+ options.  Note that the "<tt>-q</tt>" option is specified with the "<a
+ href="#cl::Hidden"><tt>cl::Hidden</tt></a>" flag.  This modifier prevents it
+ from being shown by the standard "<tt>--help</tt>" output (note that it is still
+ shown in the "<tt>--help-hidden</tt>" output).</p>
+ 
+ <p>The CommandLine library uses a <a href="#builtinparsers">different parser</a>
+ for different data types.  For example, in the string case, the argument passed
+ to the option is copied literally into the content of the string variable... we
+ obviously cannot do that in the boolean case, however, so we must use a smarter
+ parser.  In the case of the boolean parser, it allows no options (in which case
+ it assigns the value of true to the variable), or it allows the values
+ "<tt>true</tt>" or "<tt>false</tt>" to be specified, allowing any of the
+ following inputs:</p>
+ 
+ <div class="doc_code"><pre>
+  compiler -f          # No value, 'Force' == true
+  compiler -f=true     # Value specified, 'Force' == true
+  compiler -f=TRUE     # Value specified, 'Force' == true
+  compiler -f=FALSE    # Value specified, 'Force' == false
+ </pre></div>
+ 
+ <p>... you get the idea.  The <a href="#boolparser">bool parser</a> just turns
+ the string values into boolean values, and rejects things like '<tt>compiler
+ -f=foo</tt>'.  Similarly, the <a href="#doubleparser">float</a>, <a
+ href="#doubleparser">double</a>, and <a href="#intparser">int</a> parsers work
+ like you would expect, using the '<tt>strtol</tt>' and '<tt>strtod</tt>' C
+ library calls to parse the string value into the specified data type.</p>
+ 
+ <p>With the declarations above, "<tt>compiler --help</tt>" emits this:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options] <input file>
+ 
+ OPTIONS:
+   <b>-f     - Overwrite output files</b>
+   -o     - Override output filename
+   <b>-quiet - Don't print informational messages</b>
+   -help  - display available options (--help-hidden for more)
+ </pre></div>
+ 
+ <p>and "<tt>opt --help-hidden</tt>" prints this:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options] <input file>
+ 
+ OPTIONS:
+   -f     - Overwrite output files
+   -o     - Override output filename
+   <b>-q     - Don't print informational messages</b>
+   -quiet - Don't print informational messages
+   -help  - display available options (--help-hidden for more)
+ </pre></div>
+ 
+ <p>This brief example has shown you how to use the '<tt><a
+ href="#cl::opt">cl::opt</a></tt>' class to parse simple scalar command line
+ arguments.  In addition to simple scalar arguments, the CommandLine library also
+ provides primitives to support CommandLine option <a href="#alias">aliases</a>,
+ and <a href="#list">lists</a> of options.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="alias">Argument Aliases</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>So far, the example works well, except for the fact that we need to check the
+ quiet condition like this now:</p>
+ 
+ <div class="doc_code"><pre>
+ ...
+   if (!Quiet && !Quiet2) printInformationalMessage(...);
+ ...
+ </pre></div>
+ 
+ <p>... which is a real pain!  Instead of defining two values for the same
+ condition, we can use the "<tt><a href="#cl::alias">cl::alias</a></tt>" class to make the "<tt>-q</tt>"
+ option an <b>alias</b> for the "<tt>-quiet</tt>" option, instead of providing
+ a value itself:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><bool> Force ("<i>f</i>", <a href="#cl::desc">cl::desc</a>("<i>Overwrite output files</i>"));
+ <a href="#cl::opt">cl::opt</a><bool> Quiet ("<i>quiet</i>", <a href="#cl::desc">cl::desc</a>("<i>Don't print informational messages</i>"));
+ <a href="#cl::alias">cl::alias</a>     QuietA("<i>q</i>", <a href="#cl::desc">cl::desc</a>("<i>Alias for -quiet</i>"), <a href="#cl::aliasopt">cl::aliasopt</a>(Quiet));
+ </pre></div>
+ 
+ <p>The third line (which is the only one we modified from above) defines a
+ "<tt>-q</tt> alias that updates the "<tt>Quiet</tt>" variable (as specified by
+ the <tt><a href="#cl::aliasopt">cl::aliasopt</a></tt> modifier) whenever it is
+ specified.  Because aliases do not hold state, the only thing the program has to
+ query is the <tt>Quiet</tt> variable now.  Another nice feature of aliases is
+ that they automatically hide themselves from the <tt>-help</tt> output
+ (although, again, they are still visible in the <tt>--help-hidden
+ output</tt>).</p>
+ 
+ <p>Now the application code can simply use:</p>
+ 
+ <div class="doc_code"><pre>
+ ...
+   if (!Quiet) printInformationalMessage(...);
+ ...
+ </pre></div>
+ 
+ <p>... which is much nicer!  The "<tt><a href="#cl::alias">cl::alias</a></tt>"
+ can be used to specify an alternative name for any variable type, and has many
+ uses.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="onealternative">Selecting an alternative from a set of
+   possibilities</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>So far, we have seen how the CommandLine library handles builtin types like
+ <tt>std::string</tt>, <tt>bool</tt> and <tt>int</tt>, but how does it handle
+ things it doesn't know about, like enums or '<tt>int*</tt>'s?</p>
+ 
+ <p>The answer is that it uses a table driven generic parser (unless you specify
+ your own parser, as described in the <a href="#extensionguide">Extension
+ Guide</a>).  This parser maps literal strings to whatever type is required, and
+ requires you to tell it what this mapping should be.</p>
+ 
+ <p>Lets say that we would like to add four optimization levels to our
+ optimizer, using the standard flags "<tt>-g</tt>", "<tt>-O0</tt>",
+ "<tt>-O1</tt>", and "<tt>-O2</tt>".  We could easily implement this with boolean
+ options like above, but there are several problems with this strategy:</p>
+ 
+ <ol>
+ <li>A user could specify more than one of the options at a time, for example,
+ "<tt>opt -O3 -O2</tt>".  The CommandLine library would not be able to catch this
+ erroneous input for us.</li>
+ 
+ <li>We would have to test 4 different variables to see which ones are set.</li>
+ 
+ <li>This doesn't map to the numeric levels that we want... so we cannot easily
+ see if some level >= "<tt>-O1</tt>" is enabled.</li>
+ 
+ </ol>
+ 
+ <p>To cope with these problems, we can use an enum value, and have the
+ CommandLine library fill it in with the appropriate level directly, which is
+ used like this:</p>
+ 
+ <div class="doc_code"><pre>
+ enum OptLevel {
+   g, O1, O2, O3
+ };
+ 
+ <a href="#cl::opt">cl::opt</a><OptLevel> OptimizationLevel(<a href="#cl::desc">cl::desc</a>("<i>Choose optimization level:</i>"),
+   <a href="#cl::values">cl::values</a>(
+     clEnumVal(g , "<i>No optimizations, enable debugging</i>"),
+     clEnumVal(O1, "<i>Enable trivial optimizations</i>"),
+     clEnumVal(O2, "<i>Enable default optimizations</i>"),
+     clEnumVal(O3, "<i>Enable expensive optimizations</i>"),
+    clEnumValEnd));
+ 
+ ...
+   if (OptimizationLevel >= O2) doPartialRedundancyElimination(...);
+ ...
+ </pre></div>
+ 
+ <p>This declaration defines a variable "<tt>OptimizationLevel</tt>" of the
+ "<tt>OptLevel</tt>" enum type.  This variable can be assigned any of the values
+ that are listed in the declaration (Note that the declaration list must be
+ terminated with the "<tt>clEnumValEnd</tt>" argument!).  The CommandLine 
+ library enforces
+ that the user can only specify one of the options, and it ensure that only valid
+ enum values can be specified.  The "<tt>clEnumVal</tt>" macros ensure that the
+ command line arguments matched the enum values.  With this option added, our
+ help output now is:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options] <input file>
+ 
+ OPTIONS:
+   <b>Choose optimization level:
+     -g          - No optimizations, enable debugging
+     -O1         - Enable trivial optimizations
+     -O2         - Enable default optimizations
+     -O3         - Enable expensive optimizations</b>
+   -f            - Overwrite output files
+   -help         - display available options (--help-hidden for more)
+   -o <filename> - Specify output filename
+   -quiet        - Don't print informational messages
+ </pre></div>
+ 
+ <p>In this case, it is sort of awkward that flag names correspond directly to
+ enum names, because we probably don't want a enum definition named "<tt>g</tt>"
+ in our program.  Because of this, we can alternatively write this example like
+ this:</p>
+ 
+ <div class="doc_code"><pre>
+ enum OptLevel {
+   Debug, O1, O2, O3
+ };
+ 
+ <a href="#cl::opt">cl::opt</a><OptLevel> OptimizationLevel(<a href="#cl::desc">cl::desc</a>("<i>Choose optimization level:</i>"),
+   <a href="#cl::values">cl::values</a>(
+    clEnumValN(Debug, "g", "<i>No optimizations, enable debugging</i>"),
+     clEnumVal(O1        , "<i>Enable trivial optimizations</i>"),
+     clEnumVal(O2        , "<i>Enable default optimizations</i>"),
+     clEnumVal(O3        , "<i>Enable expensive optimizations</i>"),
+    clEnumValEnd));
+ 
+ ...
+   if (OptimizationLevel == Debug) outputDebugInfo(...);
+ ...
+ </pre></div>
+ 
+ <p>By using the "<tt>clEnumValN</tt>" macro instead of "<tt>clEnumVal</tt>", we
+ can directly specify the name that the flag should get.  In general a direct
+ mapping is nice, but sometimes you can't or don't want to preserve the mapping,
+ which is when you would use it.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="namedalternatives">Named Alternatives</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Another useful argument form is a named alternative style.  We shall use this
+ style in our compiler to specify different debug levels that can be used.
+ Instead of each debug level being its own switch, we want to support the
+ following options, of which only one can be specified at a time:
+ "<tt>--debug-level=none</tt>", "<tt>--debug-level=quick</tt>",
+ "<tt>--debug-level=detailed</tt>".  To do this, we use the exact same format as
+ our optimization level flags, but we also specify an option name.  For this
+ case, the code looks like this:</p>
+ 
+ <div class="doc_code"><pre>
+ enum DebugLev {
+   nodebuginfo, quick, detailed
+ };
+ 
+ // Enable Debug Options to be specified on the command line
+ <a href="#cl::opt">cl::opt</a><DebugLev> DebugLevel("<i>debug_level</i>", <a href="#cl::desc">cl::desc</a>("<i>Set the debugging level:</i>"),
+   <a href="#cl::values">cl::values</a>(
+     clEnumValN(nodebuginfo, "none", "<i>disable debug information</i>"),
+      clEnumVal(quick,               "<i>enable quick debug information</i>"),
+      clEnumVal(detailed,            "<i>enable detailed debug information</i>"),
+     clEnumValEnd));
+ </pre></div>
+ 
+ <p>This definition defines an enumerated command line variable of type "<tt>enum
+ DebugLev</tt>", which works exactly the same way as before.  The difference here
+ is just the interface exposed to the user of your program and the help output by
+ the "<tt>--help</tt>" option:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: compiler [options] <input file>
+ 
+ OPTIONS:
+   Choose optimization level:
+     -g          - No optimizations, enable debugging
+     -O1         - Enable trivial optimizations
+     -O2         - Enable default optimizations
+     -O3         - Enable expensive optimizations
+   <b>-debug_level  - Set the debugging level:
+     =none       - disable debug information
+     =quick      - enable quick debug information
+     =detailed   - enable detailed debug information</b>
+   -f            - Overwrite output files
+   -help         - display available options (--help-hidden for more)
+   -o <filename> - Specify output filename
+   -quiet        - Don't print informational messages
+ </pre></div>
+ 
+ <p>Again, the only structural difference between the debug level declaration and
+ the optimization level declaration is that the debug level declaration includes
+ an option name (<tt>"debug_level"</tt>), which automatically changes how the
+ library processes the argument.  The CommandLine library supports both forms so
+ that you can choose the form most appropriate for your application.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="list">Parsing a list of options</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Now that we have the standard run of the mill argument types out of the way,
+ lets get a little wild and crazy.  Lets say that we want our optimizer to accept
+ a <b>list</b> of optimizations to perform, allowing duplicates.  For example, we
+ might want to run: "<tt>compiler -dce -constprop -inline -dce -strip</tt>".  In
+ this case, the order of the arguments and the number of appearances is very
+ important.  This is what the "<tt><a href="#cl::list">cl::list</a></tt>"
+ template is for.  First, start by defining an enum of the optimizations that you
+ would like to perform:</p>
+ 
+ <div class="doc_code"><pre>
+ enum Opts {
+   // 'inline' is a C++ keyword, so name it 'inlining'
+   dce, constprop, inlining, strip
+ };
+ </pre></div>
+ 
+ <p>Then define your "<tt><a href="#cl::list">cl::list</a></tt>" variable:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::list">cl::list</a><Opts> OptimizationList(<a href="#cl::desc">cl::desc</a>("<i>Available Optimizations:</i>"),
+   <a href="#cl::values">cl::values</a>(
+     clEnumVal(dce               , "<i>Dead Code Elimination</i>"),
+     clEnumVal(constprop         , "<i>Constant Propagation</i>"),
+    clEnumValN(inlining, "<i>inline</i>", "<i>Procedure Integration</i>"),
+     clEnumVal(strip             , "<i>Strip Symbols</i>"),
+   clEnumValEnd));
+ </pre></div>
+ 
+ <p>This defines a variable that is conceptually of the type
+ "<tt>std::vector<enum Opts></tt>".  Thus, you can access it with standard
+ vector methods:</p>
+ 
+ <div class="doc_code"><pre>
+   for (unsigned i = 0; i != OptimizationList.size(); ++i)
+     switch (OptimizationList[i])
+        ...
+ </pre></div>
+ 
+ <p>... to iterate through the list of options specified.</p>
+ 
+ <p>Note that the "<tt><a href="#cl::list">cl::list</a></tt>" template is
+ completely general and may be used with any data types or other arguments that
+ you can use with the "<tt><a href="#cl::opt">cl::opt</a></tt>" template.  One
+ especially useful way to use a list is to capture all of the positional
+ arguments together if there may be more than one specified.  In the case of a
+ linker, for example, the linker takes several '<tt>.o</tt>' files, and needs to
+ capture them into a list.  This is naturally specified as:</p>
+ 
+ <div class="doc_code"><pre>
+ ...
+ <a href="#cl::list">cl::list</a><std::string> InputFilenames(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<Input files>"), <a href="#cl::OneOrMore">cl::OneOrMore</a>);
+ ...
+ </pre></div>
+ 
+ <p>This variable works just like a "<tt>vector<string></tt>" object.  As
+ such, accessing the list is simple, just like above.  In this example, we used
+ the <tt><a href="#cl::OneOrMore">cl::OneOrMore</a></tt> modifier to inform the
+ CommandLine library that it is an error if the user does not specify any
+ <tt>.o</tt> files on our command line.  Again, this just reduces the amount of
+ checking we have to do.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="bits">Collecting options as a set of flags</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Instead of collecting sets of options in a list, it is also possible to
+ gather information for enum values in a <b>bit vector</b>.  The represention used by
+ the <a href="#bits"><tt>cl::bits</tt></a> class is an <tt>unsigned</tt>
+ integer.  An enum value is represented by a 0/1 in the enum's ordinal value bit
+ position. 1 indicating that the enum was specified, 0 otherwise.  As each
+ specified value is parsed, the resulting enum's bit is set in the option's bit
+ vector:</p>
+ 
+ <div class="doc_code"><pre>
+   <i>bits</i> |= 1 << (unsigned)<i>enum</i>;
+ </pre></div>
+ 
+ <p>Options that are specified multiple times are redundant.  Any instances after
+ the first are discarded.</p>
+ 
+ <p>Reworking the above list example, we could replace <a href="#list">
+ <tt>cl::list</tt></a> with <a href="#bits"><tt>cl::bits</tt></a>:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::bits">cl::bits</a><Opts> OptimizationBits(<a href="#cl::desc">cl::desc</a>("<i>Available Optimizations:</i>"),
+   <a href="#cl::values">cl::values</a>(
+     clEnumVal(dce               , "<i>Dead Code Elimination</i>"),
+     clEnumVal(constprop         , "<i>Constant Propagation</i>"),
+    clEnumValN(inlining, "<i>inline</i>", "<i>Procedure Integration</i>"),
+     clEnumVal(strip             , "<i>Strip Symbols</i>"),
+   clEnumValEnd));
+ </pre></div>
+ 
+ <p>To test to see if <tt>constprop</tt> was specified, we can use the
+ <tt>cl:bits::isSet</tt> function:</p>
+ 
+ <div class="doc_code"><pre>
+   if (OptimizationBits.isSet(constprop)) {
+     ...
+   }
+ </pre></div>
+ 
+ <p>It's also possible to get the raw bit vector using the
+ <tt>cl::bits::getBits</tt> function:</p>
+ 
+ <div class="doc_code"><pre>
+   unsigned bits = OptimizationBits.getBits();
+ </pre></div>
+ 
+ <p>Finally, if external storage is used, then the location specified must be of
+ <b>type</b> <tt>unsigned</tt>. In all other ways a <a
+ href="#bits"><tt>cl::bits</tt></a> option is morally equivalent to a <a
+ href="#list"> <tt>cl::list</tt></a> option.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="description">Adding freeform text to help output</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>As our program grows and becomes more mature, we may decide to put summary
+ information about what it does into the help output.  The help output is styled
+ to look similar to a Unix <tt>man</tt> page, providing concise information about
+ a program.  Unix <tt>man</tt> pages, however often have a description about what
+ the program does.  To add this to your CommandLine program, simply pass a third
+ argument to the <a
+ href="#cl::ParseCommandLineOptions"><tt>cl::ParseCommandLineOptions</tt></a>
+ call in main.  This additional argument is then printed as the overview
+ information for your program, allowing you to include any additional information
+ that you want.  For example:</p>
+ 
+ <div class="doc_code"><pre>
+ int main(int argc, char **argv) {
+   <a href="#cl::ParseCommandLineOptions">cl::ParseCommandLineOptions</a>(argc, argv, " CommandLine compiler example\n\n"
+                               "  This program blah blah blah...\n");
+   ...
+ }
+ </pre></div>
+ 
+ <p>would yield the help output:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>OVERVIEW: CommandLine compiler example
+ 
+   This program blah blah blah...</b>
+ 
+ USAGE: compiler [options] <input file>
+ 
+ OPTIONS:
+   ...
+   -help             - display available options (--help-hidden for more)
+   -o <filename>     - Specify output filename
+ </pre></div>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="referenceguide">Reference Guide</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Now that you know the basics of how to use the CommandLine library, this
+ section will give you the detailed information you need to tune how command line
+ options work, as well as information on more "advanced" command line option
+ processing capabilities.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="positional">Positional Arguments</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Positional arguments are those arguments that are not named, and are not
+ specified with a hyphen.  Positional arguments should be used when an option is
+ specified by its position alone.  For example, the standard Unix <tt>grep</tt>
+ tool takes a regular expression argument, and an optional filename to search
+ through (which defaults to standard input if a filename is not specified).
+ Using the CommandLine library, this would be specified as:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> Regex   (<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<i><regular expression></i>"), <a href="#cl::Required">cl::Required</a>);
+ <a href="#cl::opt">cl::opt</a><string> Filename(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<i><input file></i>"), <a href="#cl::init">cl::init</a>("<i>-</i>"));
+ </pre></div>
+ 
+ <p>Given these two option declarations, the <tt>--help</tt> output for our grep
+ replacement would look like this:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: spiffygrep [options] <b><regular expression> <input file></b>
+ 
+ OPTIONS:
+   -help - display available options (--help-hidden for more)
+ </pre></div>
+ 
+ <p>... and the resultant program could be used just like the standard
+ <tt>grep</tt> tool.</p>
+ 
+ <p>Positional arguments are sorted by their order of construction.  This means
+ that command line options will be ordered according to how they are listed in a
+ .cpp file, but will not have an ordering defined if the positional arguments
+ are defined in multiple .cpp files.  The fix for this problem is simply to
+ define all of your positional arguments in one .cpp file.</p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="--">Specifying positional options with hyphens</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Sometimes you may want to specify a value to your positional argument that
+ starts with a hyphen (for example, searching for '<tt>-foo</tt>' in a file).  At
+ first, you will have trouble doing this, because it will try to find an argument
+ named '<tt>-foo</tt>', and will fail (and single quotes will not save you).
+ Note that the system <tt>grep</tt> has the same problem:</p>
+ 
+ <div class="doc_code"><pre>
+   $ spiffygrep '-foo' test.txt
+   Unknown command line argument '-foo'.  Try: spiffygrep --help'
+ 
+   $ grep '-foo' test.txt
+   grep: illegal option -- f
+   grep: illegal option -- o
+   grep: illegal option -- o
+   Usage: grep -hblcnsviw pattern file . . .
+ </pre></div>
+ 
+ <p>The solution for this problem is the same for both your tool and the system
+ version: use the '<tt>--</tt>' marker.  When the user specifies '<tt>--</tt>' on
+ the command line, it is telling the program that all options after the
+ '<tt>--</tt>' should be treated as positional arguments, not options.  Thus, we
+ can use it like this:</p>
+ 
+ <div class="doc_code"><pre>
+   $ spiffygrep -- -foo test.txt
+     ...output...
+ </pre></div>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="getPosition">Determining absolute position with getPosition()</a>
+ </div>
+ <div class="doc_text">
+   <p>Sometimes an option can affect or modify the meaning of another option. For
+   example, consider <tt>gcc</tt>'s <tt>-x LANG</tt> option. This tells
+   <tt>gcc</tt> to ignore the suffix of subsequent positional arguments and force
+   the file to be interpreted as if it contained source code in language
+   <tt>LANG</tt>. In order to handle this properly , you need to know the 
+   absolute position of each argument, especially those in lists, so their 
+   interaction(s) can be applied correctly. This is also useful for options like 
+   <tt>-llibname</tt> which is actually a positional argument that starts with 
+   a dash.</p>
+   <p>So, generally, the problem is that you have two <tt>cl::list</tt> variables
+   that interact in some way. To ensure the correct interaction, you can use the
+   <tt>cl::list::getPosition(optnum)</tt> method. This method returns the
+   absolute position (as found on the command line) of the <tt>optnum</tt>
+   item in the <tt>cl::list</tt>.</p>
+   <p>The idiom for usage is like this:</p>
+   
+   <div class="doc_code"><pre>
+   static cl::list<std::string> Files(cl::Positional, cl::OneOrMore);
+   static cl::listlt;std::string> Libraries("l", cl::ZeroOrMore);
+ 
+   int main(int argc, char**argv) {
+     // ...
+     std::vector<std::string>::iterator fileIt = Files.begin();
+     std::vector<std::string>::iterator libIt  = Libraries.begin();
+     unsigned libPos = 0, filePos = 0;
+     while ( 1 ) {
+       if ( libIt != Libraries.end() )
+         libPos = Libraries.getPosition( libIt - Libraries.begin() );
+       else
+         libPos = 0;
+       if ( fileIt != Files.end() )
+         filePos = Files.getPosition( fileIt - Files.begin() );
+       else
+         filePos = 0;
+ 
+       if ( filePos != 0 && (libPos == 0 || filePos < libPos) ) {
+         // Source File Is next
+         ++fileIt;
+       }
+       else if ( libPos != 0 && (filePos == 0 || libPos < filePos) ) {
+         // Library is next
+         ++libIt;
+       }
+       else
+         break; // we're done with the list
+     }
+   }</pre></div>
+ 
+   <p>Note that, for compatibility reasons, the <tt>cl::opt</tt> also supports an
+   <tt>unsigned getPosition()</tt> option that will provide the absolute position
+   of that option. You can apply the same approach as above with a 
+   <tt>cl::opt</tt> and a <tt>cl::list</tt> option as you can with two lists.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::ConsumeAfter">The <tt>cl::ConsumeAfter</tt> modifier</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::ConsumeAfter</tt> <a href="#formatting">formatting option</a> is
+ used to construct programs that use "interpreter style" option processing.  With
+ this style of option processing, all arguments specified after the last
+ positional argument are treated as special interpreter arguments that are not
+ interpreted by the command line argument.</p>
+ 
+ <p>As a concrete example, lets say we are developing a replacement for the
+ standard Unix Bourne shell (<tt>/bin/sh</tt>).  To run <tt>/bin/sh</tt>, first
+ you specify options to the shell itself (like <tt>-x</tt> which turns on trace
+ output), then you specify the name of the script to run, then you specify
+ arguments to the script.  These arguments to the script are parsed by the bourne
+ shell command line option processor, but are not interpreted as options to the
+ shell itself.  Using the CommandLine library, we would specify this as:</p>
+ 
+ <div class="doc_code"><pre>
+ <a href="#cl::opt">cl::opt</a><string> Script(<a href="#cl::Positional">cl::Positional</a>, <a href="#cl::desc">cl::desc</a>("<i><input script></i>"), <a href="#cl::init">cl::init</a>("-"));
+ <a href="#cl::list">cl::list</a><string>  Argv(<a href="#cl::ConsumeAfter">cl::ConsumeAfter</a>, <a href="#cl::desc">cl::desc</a>("<i><program arguments>...</i>"));
+ <a href="#cl::opt">cl::opt</a><bool>    Trace("<i>x</i>", <a href="#cl::desc">cl::desc</a>("<i>Enable trace output</i>"));
+ </pre></div>
+ 
+ <p>which automatically provides the help output:</p>
+ 
+ <div class="doc_code"><pre>
+ USAGE: spiffysh [options] <b><input script> <program arguments>...</b>
+ 
+ OPTIONS:
+   -help - display available options (--help-hidden for more)
+   <b>-x    - Enable trace output</b>
+ </pre></div>
+ 
+ <p>At runtime, if we run our new shell replacement as `<tt>spiffysh -x test.sh
+ -a -x -y bar</tt>', the <tt>Trace</tt> variable will be set to true, the
+ <tt>Script</tt> variable will be set to "<tt>test.sh</tt>", and the
+ <tt>Argv</tt> list will contain <tt>["-a", "-x", "-y", "bar"]</tt>, because they
+ were specified after the last positional argument (which is the script
+ name).</p>
+ 
+ <p>There are several limitations to when <tt>cl::ConsumeAfter</tt> options can
+ be specified.  For example, only one <tt>cl::ConsumeAfter</tt> can be specified
+ per program, there must be at least one <a href="#positional">positional
+ argument</a> specified, there must not be any <a href="#cl::list">cl::list</a>
+ positional arguments, and the <tt>cl::ConsumeAfter</tt> option should be a <a
+ href="#cl::list">cl::list</a> option.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="storage">Internal vs External Storage</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>By default, all command line options automatically hold the value that they
+ parse from the command line.  This is very convenient in the common case,
+ especially when combined with the ability to define command line options in the
+ files that use them.  This is called the internal storage model.</p>
+ 
+ <p>Sometimes, however, it is nice to separate the command line option processing
+ code from the storage of the value parsed.  For example, lets say that we have a
+ '<tt>-debug</tt>' option that we would like to use to enable debug information
+ across the entire body of our program.  In this case, the boolean value
+ controlling the debug code should be globally accessable (in a header file, for
+ example) yet the command line option processing code should not be exposed to
+ all of these clients (requiring lots of .cpp files to #include
+ <tt>CommandLine.h</tt>).</p>
+ 
+ <p>To do this, set up your .h file with your option, like this for example:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ <i>// DebugFlag.h - Get access to the '-debug' command line option
+ //
+ 
+ // DebugFlag - This boolean is set to true if the '-debug' command line option
+ // is specified.  This should probably not be referenced directly, instead, use
+ // the DEBUG macro below.
+ //</i>
+ extern bool DebugFlag;
+ 
+ <i>// DEBUG macro - This macro should be used by code to emit debug information.
+ // In the '-debug' option is specified on the command line, and if this is a
+ // debug build, then the code specified as the option to the macro will be
+ // executed.  Otherwise it will not be.  Example:
+ //
+ // DEBUG(std::cerr << "Bitset contains: " << Bitset << "\n");
+ //</i>
+ <span class="doc_hilite">#ifdef NDEBUG
+ #define DEBUG(X)
+ #else
+ #define DEBUG(X)</span> do { if (DebugFlag) { X; } } while (0)
+ <span class="doc_hilite">#endif</span>
+ </pre>
+ </div>
+ 
+ <p>This allows clients to blissfully use the <tt>DEBUG()</tt> macro, or the
+ <tt>DebugFlag</tt> explicitly if they want to.  Now we just need to be able to
+ set the <tt>DebugFlag</tt> boolean when the option is set.  To do this, we pass
+ an additial argument to our command line argument processor, and we specify
+ where to fill in with the <a href="#cl::location">cl::location</a>
+ attribute:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ bool DebugFlag;                  <i>// the actual value</i>
+ static <a href="#cl::opt">cl::opt</a><bool, true>       <i>// The parser</i>
+ Debug("<i>debug</i>", <a href="#cl::desc">cl::desc</a>("<i>Enable debug output</i>"), <a href="#cl::Hidden">cl::Hidden</a>, <a href="#cl::location">cl::location</a>(DebugFlag));
+ </pre>
+ </div>
+ 
+ <p>In the above example, we specify "<tt>true</tt>" as the second argument to
+ the <tt><a href="#cl::opt">cl::opt</a></tt> template, indicating that the
+ template should not maintain a copy of the value itself.  In addition to this,
+ we specify the <tt><a href="#cl::location">cl::location</a></tt> attribute, so
+ that <tt>DebugFlag</tt> is automatically set.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="attributes">Option Attributes</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This section describes the basic attributes that you can specify on
+ options.</p>
+ 
+ <ul>
+ 
+ <li>The option name attribute (which is required for all options, except <a
+ href="#positional">positional options</a>) specifies what the option name is.
+ This option is specified in simple double quotes:
+ 
+ <pre>
+ <a href="#cl::opt">cl::opt</a><<b>bool</b>> Quiet("<i>quiet</i>");
+ </pre>
+ 
+ </li>
+ 
+ <li><a name="cl::desc">The <b><tt>cl::desc</tt></b></a> attribute specifies a
+ description for the option to be shown in the <tt>--help</tt> output for the
+ program.</li>
+ 
+ <li><a name="cl::value_desc">The <b><tt>cl::value_desc</tt></b></a> attribute
+ specifies a string that can be used to fine tune the <tt>--help</tt> output for
+ a command line option.  Look <a href="#value_desc_example">here</a> for an
+ example.</li>
+ 
+ <li><a name="cl::init">The <b><tt>cl::init</tt></b></a> attribute specifies an
+ inital value for a <a href="#cl::opt">scalar</a> option.  If this attribute is
+ not specified then the command line option value defaults to the value created
+ by the default constructor for the type. <b>Warning</b>: If you specify both
+ <b><tt>cl::init</tt></b> and <b><tt>cl::location</tt></b> for an option,
+ you must specify <b><tt>cl::location</tt></b> first, so that when the
+ command-line parser sees <b><tt>cl::init</tt></b>, it knows where to put the
+ initial value. (You will get an error at runtime if you don't put them in
+ the right order.)</li>
+ 
+ <li><a name="cl::location">The <b><tt>cl::location</tt></b></a> attribute where to
+ store the value for a parsed command line option if using external storage.  See
+ the section on <a href="#storage">Internal vs External Storage</a> for more
+ information.</li>
+ 
+ <li><a name="cl::aliasopt">The <b><tt>cl::aliasopt</tt></b></a> attribute
+ specifies which option a <tt><a href="#cl::alias">cl::alias</a></tt> option is
+ an alias for.</li>
+ 
+ <li><a name="cl::values">The <b><tt>cl::values</tt></b></a> attribute specifies
+ the string-to-value mapping to be used by the generic parser.  It takes a
+ <b>clEnumValEnd terminated</b> list of (option, value, description) triplets 
+ that
+ specify the option name, the value mapped to, and the description shown in the
+ <tt>--help</tt> for the tool.  Because the generic parser is used most
+ frequently with enum values, two macros are often useful:
+ 
+ <ol>
+ 
+ <li><a name="clEnumVal">The <b><tt>clEnumVal</tt></b></a> macro is used as a
+ nice simple way to specify a triplet for an enum.  This macro automatically
+ makes the option name be the same as the enum name.  The first option to the
+ macro is the enum, the second is the description for the command line
+ option.</li>
+ 
+ <li><a name="clEnumValN">The <b><tt>clEnumValN</tt></b></a> macro is used to
+ specify macro options where the option name doesn't equal the enum name.  For
+ this macro, the first argument is the enum value, the second is the flag name,
+ and the second is the description.</li>
+ 
+ </ol>
+ 
+ You will get a compile time error if you try to use cl::values with a parser
+ that does not support it.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="modifiers">Option Modifiers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Option modifiers are the flags and expressions that you pass into the
+ constructors for <tt><a href="#cl::opt">cl::opt</a></tt> and <tt><a
+ href="#cl::list">cl::list</a></tt>.  These modifiers give you the ability to
+ tweak how options are parsed and how <tt>--help</tt> output is generated to fit
+ your application well.</p>
+ 
+ <p>These options fall into five main catagories:</p>
+ 
+ <ol>
+ <li><a href="#hiding">Hiding an option from <tt>--help</tt> output</a></li>
+ <li><a href="#numoccurrences">Controlling the number of occurrences
+                              required and allowed</a></li>
+ <li><a href="#valrequired">Controlling whether or not a value must be
+                            specified</a></li>
+ <li><a href="#formatting">Controlling other formatting options</a></li>
+ <li><a href="#misc">Miscellaneous option modifiers</a></li>
+ </ol>
+ 
+ <p>It is not possible to specify two options from the same catagory (you'll get
+ a runtime error) to a single option, except for options in the miscellaneous
+ catagory.  The CommandLine library specifies defaults for all of these settings
+ that are the most useful in practice and the most common, which mean that you
+ usually shouldn't have to worry about these.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="hiding">Hiding an option from <tt>--help</tt> output</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::NotHidden</tt>, <tt>cl::Hidden</tt>, and
+ <tt>cl::ReallyHidden</tt> modifiers are used to control whether or not an option
+ appears in the <tt>--help</tt> and <tt>--help-hidden</tt> output for the
+ compiled program:</p>
+ 
+ <ul>
+ 
+ <li><a name="cl::NotHidden">The <b><tt>cl::NotHidden</tt></b></a> modifier
+ (which is the default for <tt><a href="#cl::opt">cl::opt</a></tt> and <tt><a
+ href="#cl::list">cl::list</a></tt> options), indicates the option is to appear
+ in both help listings.</li>
+ 
+ <li><a name="cl::Hidden">The <b><tt>cl::Hidden</tt></b></a> modifier (which is the
+ default for <tt><a href="#cl::alias">cl::alias</a></tt> options), indicates that
+ the option should not appear in the <tt>--help</tt> output, but should appear in
+ the <tt>--help-hidden</tt> output.</li>
+ 
+ <li><a name="cl::ReallyHidden">The <b><tt>cl::ReallyHidden</tt></b></a> modifier,
+ indicates that the option should not appear in any help output.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="numoccurrences">Controlling the number of occurrences required and
+   allowed</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This group of options is used to control how many time an option is allowed
+ (or required) to be specified on the command line of your program.  Specifying a
+ value for this setting allows the CommandLine library to do error checking for
+ you.</p>
+ 
+ <p>The allowed values for this option group are:</p>
+ 
+ <ul>
+ 
+ <li><a name="cl::Optional">The <b><tt>cl::Optional</tt></b></a> modifier (which
+ is the default for the <tt><a href="#cl::opt">cl::opt</a></tt> and <tt><a
+ href="#cl::alias">cl::alias</a></tt> classes) indicates that your program will
+ allow either zero or one occurrence of the option to be specified.</li>
+ 
+ <li><a name="cl::ZeroOrMore">The <b><tt>cl::ZeroOrMore</tt></b></a> modifier
+ (which is the default for the <tt><a href="#cl::list">cl::list</a></tt> class)
+ indicates that your program will allow the option to be specified zero or more
+ times.</li>
+ 
+ <li><a name="cl::Required">The <b><tt>cl::Required</tt></b></a> modifier
+ indicates that the specified option must be specified exactly one time.</li>
+ 
+ <li><a name="cl::OneOrMore">The <b><tt>cl::OneOrMore</tt></b></a> modifier
+ indicates that the option must be specified at least one time.</li>
+ 
+ <li>The <b><tt>cl::ConsumeAfter</tt></b> modifier is described in the <a
+ href="#positional">Positional arguments section</a></li>
+ 
+ </ul>
+ 
+ <p>If an option is not specified, then the value of the option is equal to the
+ value specified by the <tt><a href="#cl::init">cl::init</a></tt> attribute.  If
+ the <tt><a href="#cl::init">cl::init</a></tt> attribute is not specified, the
+ option value is initialized with the default constructor for the data type.</p>
+ 
+ <p>If an option is specified multiple times for an option of the <tt><a
+ href="#cl::opt">cl::opt</a></tt> class, only the last value will be
+ retained.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="valrequired">Controlling whether or not a value must be specified</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This group of options is used to control whether or not the option allows a
+ value to be present.  In the case of the CommandLine library, a value is either
+ specified with an equal sign (e.g. '<tt>-index-depth=17</tt>') or as a trailing
+ string (e.g. '<tt>-o a.out</tt>').</p>
+ 
+ <p>The allowed values for this option group are:</p>
+ 
+ <ul>
+ 
+ <li><a name="cl::ValueOptional">The <b><tt>cl::ValueOptional</tt></b></a> modifier
+ (which is the default for <tt>bool</tt> typed options) specifies that it is
+ acceptable to have a value, or not.  A boolean argument can be enabled just by
+ appearing on the command line, or it can have an explicit '<tt>-foo=true</tt>'.
+ If an option is specified with this mode, it is illegal for the value to be
+ provided without the equal sign.  Therefore '<tt>-foo true</tt>' is illegal.  To
+ get this behavior, you must use the <a
+ href="#cl::ValueRequired">cl::ValueRequired</a> modifier.</li>
+ 
+ <li><a name="cl::ValueRequired">The <b><tt>cl::ValueRequired</tt></b></a> modifier
+ (which is the default for all other types except for <a
+ href="#onealternative">unnamed alternatives using the generic parser</a>)
+ specifies that a value must be provided.  This mode informs the command line
+ library that if an option is not provides with an equal sign, that the next
+ argument provided must be the value.  This allows things like '<tt>-o
+ a.out</tt>' to work.</li>
+ 
+ <li><a name="cl::ValueDisallowed">The <b><tt>cl::ValueDisallowed</tt></b></a>
+ modifier (which is the default for <a href="#onealternative">unnamed
+ alternatives using the generic parser</a>) indicates that it is a runtime error
+ for the user to specify a value.  This can be provided to disallow users from
+ providing options to boolean options (like '<tt>-foo=true</tt>').</li>
+ 
+ </ul>
+ 
+ <p>In general, the default values for this option group work just like you would
+ want them to.  As mentioned above, you can specify the <a
+ href="#cl::ValueDisallowed">cl::ValueDisallowed</a> modifier to a boolean
+ argument to restrict your command line parser.  These options are mostly useful
+ when <a href="#extensionguide">extending the library</a>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="formatting">Controlling other formatting options</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The formatting option group is used to specify that the command line option
+ has special abilities and is otherwise different from other command line
+ arguments.  As usual, you can only specify at most one of these arguments.</p>
+ 
+ <ul>
+ 
+ <li><a name="cl::NormalFormatting">The <b><tt>cl::NormalFormatting</tt></b></a>
+ modifier (which is the default all options) specifies that this option is
+ "normal".</li>
+ 
+ <li><a name="cl::Positional">The <b><tt>cl::Positional</tt></b></a> modifier
+ specifies that this is a positional argument, that does not have a command line
+ option associated with it.  See the <a href="#positional">Positional
+ Arguments</a> section for more information.</li>
+ 
+ <li>The <b><a href="#cl::ConsumeAfter"><tt>cl::ConsumeAfter</tt></a></b> modifier
+ specifies that this option is used to capture "interpreter style" arguments.  See <a href="#cl::ConsumeAfter">this section for more information</a>.</li>
+ 
+ <li><a name="cl::Prefix">The <b><tt>cl::Prefix</tt></b></a> modifier specifies
+ that this option prefixes its value.  With 'Prefix' options, the equal sign does
+ not separate the value from the option name specified. Instead, the value is
+ everything after the prefix, including any equal sign if present. This is useful
+ for processing odd arguments like <tt>-lmalloc</tt> and <tt>-L/usr/lib</tt> in a
+ linker tool or <tt>-DNAME=value</tt> in a compiler tool.   Here, the
+ '<tt>l</tt>', '<tt>D</tt>' and '<tt>L</tt>' options are normal string (or list)
+ options, that have the <b><tt><a href="#cl::Prefix">cl::Prefix</a></tt></b>
+ modifier added to allow the CommandLine library to recognize them.  Note that
+ <b><tt><a href="#cl::Prefix">cl::Prefix</a></tt></b> options must not have the
+ <b><tt><a href="#cl::ValueDisallowed">cl::ValueDisallowed</a></tt></b> modifier
+ specified.</li>
+ 
+ <li><a name="cl::Grouping">The <b><tt>cl::Grouping</tt></b></a> modifier is used
+ to implement unix style tools (like <tt>ls</tt>) that have lots of single letter
+ arguments, but only require a single dash.  For example, the '<tt>ls -labF</tt>'
+ command actually enables four different options, all of which are single
+ letters.  Note that <b><tt><a href="#cl::Grouping">cl::Grouping</a></tt></b>
+ options cannot have values.</li>
+ 
+ </ul>
+ 
+ <p>The CommandLine library does not restrict how you use the <b><tt><a
+ href="#cl::Prefix">cl::Prefix</a></tt></b> or <b><tt><a
+ href="#cl::Grouping">cl::Grouping</a></tt></b> modifiers, but it is possible to
+ specify ambiguous argument settings.  Thus, it is possible to have multiple
+ letter options that are prefix or grouping options, and they will still work as
+ designed.</p>
+ 
+ <p>To do this, the CommandLine library uses a greedy algorithm to parse the
+ input option into (potentially multiple) prefix and grouping options.  The
+ strategy basically looks like this:</p>
+ 
+ <div class="doc_code"><tt>parse(string OrigInput) {</tt>
+ 
+ <ol>
+ <li><tt>string input = OrigInput;</tt>
+ <li><tt>if (isOption(input)) return getOption(input).parse();</tt>    <i>// Normal option</i>
+ <li><tt>while (!isOption(input) && !input.empty()) input.pop_back();</tt>    <i>// Remove the last letter</i>
+ <li><tt>if (input.empty()) return error();</tt>    <i>// No matching option</i>
+ <li><tt>if (getOption(input).isPrefix())<br>
+   return getOption(input).parse(input);</tt>
+ <li><tt>while (!input.empty()) {    <i>// Must be grouping options</i><br>
+   getOption(input).parse();<br>
+   OrigInput.erase(OrigInput.begin(), OrigInput.begin()+input.length());<br>
+   input = OrigInput;<br>
+   while (!isOption(input) && !input.empty()) input.pop_back();<br>
+ }</tt>
+ <li><tt>if (!OrigInput.empty()) error();</tt></li>
+ </ol>
+ 
+ <p><tt>}</tt></p>
+ </div>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="misc">Miscellaneous option modifiers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The miscellaneous option modifiers are the only flags where you can specify
+ more than one flag from the set: they are not mutually exclusive.  These flags
+ specify boolean properties that modify the option.</p>
+ 
+ <ul>
+ 
+ <li><a name="cl::CommaSeparated">The <b><tt>cl::CommaSeparated</tt></b></a> modifier
+ indicates that any commas specified for an option's value should be used to
+ split the value up into multiple values for the option.  For example, these two
+ options are equivalent when <tt>cl::CommaSeparated</tt> is specified:
+ "<tt>-foo=a -foo=b -foo=c</tt>" and "<tt>-foo=a,b,c</tt>".  This option only
+ makes sense to be used in a case where the option is allowed to accept one or
+ more values (i.e. it is a <a href="#cl::list">cl::list</a> option).</li>
+ 
+ <li><a name="cl::PositionalEatsArgs">The
+ <b><tt>cl::PositionalEatsArgs</tt></b></a> modifier (which only applies to
+ positional arguments, and only makes sense for lists) indicates that positional
+ argument should consume any strings after it (including strings that start with
+ a "-") up until another recognized positional argument.  For example, if you
+ have two "eating" positional arguments "<tt>pos1</tt>" and "<tt>pos2</tt>" the
+ string "<tt>-pos1 -foo -bar baz -pos2 -bork</tt>" would cause the "<tt>-foo -bar
+ -baz</tt>" strings to be applied to the "<tt>-pos1</tt>" option and the
+ "<tt>-bork</tt>" string to be applied to the "<tt>-pos2</tt>" option.</li>
+ 
+ </ul>
+ 
+ <p>So far, these are the only two miscellaneous option modifiers.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="toplevel">Top-Level Classes and Functions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Despite all of the built-in flexibility, the CommandLine option library
+ really only consists of one function (<a
+ href="#cl::ParseCommandLineOptions"><tt>cl::ParseCommandLineOptions</tt></a>)
+ and three main classes: <a href="#cl::opt"><tt>cl::opt</tt></a>, <a
+ href="#cl::list"><tt>cl::list</tt></a>, and <a
+ href="#cl::alias"><tt>cl::alias</tt></a>.  This section describes these three
+ classes in detail.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::ParseCommandLineOptions">The <tt>cl::ParseCommandLineOptions</tt>
+   function</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::ParseCommandLineOptions</tt> function is designed to be called
+ directly from <tt>main</tt>, and is used to fill in the values of all of the
+ command line option variables once <tt>argc</tt> and <tt>argv</tt> are
+ available.</p>
+ 
+ <p>The <tt>cl::ParseCommandLineOptions</tt> function requires two parameters
+ (<tt>argc</tt> and <tt>argv</tt>), but may also take an optional third parameter
+ which holds <a href="#description">additional extra text</a> to emit when the
+ <tt>--help</tt> option is invoked.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::ParseEnvironmentOptions">The <tt>cl::ParseEnvironmentOptions</tt>
+   function</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::ParseEnvironmentOptions</tt> function has mostly the same effects
+ as <a
+ href="#cl::ParseCommandLineOptions"><tt>cl::ParseCommandLineOptions</tt></a>,
+ except that it is designed to take values for options from an environment
+ variable, for those cases in which reading the command line is not convenient or
+ not desired. It fills in the values of all the command line option variables
+ just like <a
+ href="#cl::ParseCommandLineOptions"><tt>cl::ParseCommandLineOptions</tt></a>
+ does.</p>
+ 
+ <p>It takes three parameters: first, the name of the program (since
+ <tt>argv</tt> may not be available, it can't just look in <tt>argv[0]</tt>),
+ second, the name of the environment variable to examine, and third, the optional
+ <a href="#description">additional extra text</a> to emit when the
+ <tt>--help</tt> option is invoked.</p>
+ 
+ <p><tt>cl::ParseEnvironmentOptions</tt> will break the environment
+ variable's value up into words and then process them using
+ <a href="#cl::ParseCommandLineOptions"><tt>cl::ParseCommandLineOptions</tt></a>.
+ <b>Note:</b> Currently <tt>cl::ParseEnvironmentOptions</tt> does not support
+ quoting, so an environment variable containing <tt>-option "foo bar"</tt> will
+ be parsed as three words, <tt>-option</tt>, <tt>"foo</tt>, and <tt>bar"</tt>,
+ which is different from what you would get from the shell with the same
+ input.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::SetVersionPrinter">The <tt>cl::SetVersionPrinter</tt>
+   function</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::SetVersionPrinter</tt> function is designed to be called
+ directly from <tt>main</tt>, and <i>before</i>
+ <tt>cl::ParseCommandLineOptions</tt>. Its use is optional. It simply arranges
+ for a function to be called in response to the <tt>--version</tt> option instead
+ of having the <tt>CommandLine</tt> library print out the usual version string
+ for LLVM. This is useful for programs that are not part of LLVM but wish to use
+ the <tt>CommandLine</tt> facilities. Such programs should just define a small
+ function that takes no arguments and returns <tt>void</tt> and that prints out
+ whatever version information is appropriate for the program. Pass the address
+ of that function to <tt>cl::SetVersionPrinter</tt> to arrange for it to be
+ called when the <tt>--version</tt> option is given by the user.</p>
+ 
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::opt">The <tt>cl::opt</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::opt</tt> class is the class used to represent scalar command line
+ options, and is the one used most of the time.  It is a templated class which
+ can take up to three arguments (all except for the first have default values
+ though):</p>
+ 
+ <div class="doc_code"><pre>
+ <b>namespace</b> cl {
+   <b>template</b> <<b>class</b> DataType, <b>bool</b> ExternalStorage = <b>false</b>,
+             <b>class</b> ParserClass = parser<DataType> >
+   <b>class</b> opt;
+ }
+ </pre></div>
+ 
+ <p>The first template argument specifies what underlying data type the command
+ line argument is, and is used to select a default parser implementation.  The
+ second template argument is used to specify whether the option should contain
+ the storage for the option (the default) or whether external storage should be
+ used to contain the value parsed for the option (see <a href="#storage">Internal
+ vs External Storage</a> for more information).</p>
+ 
+ <p>The third template argument specifies which parser to use.  The default value
+ selects an instantiation of the <tt>parser</tt> class based on the underlying
+ data type of the option.  In general, this default works well for most
+ applications, so this option is only used when using a <a
+ href="#customparser">custom parser</a>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::list">The <tt>cl::list</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::list</tt> class is the class used to represent a list of command
+ line options.  It too is a templated class which can take up to three
+ arguments:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>namespace</b> cl {
+   <b>template</b> <<b>class</b> DataType, <b>class</b> Storage = <b>bool</b>,
+             <b>class</b> ParserClass = parser<DataType> >
+   <b>class</b> list;
+ }
+ </pre></div>
+ 
+ <p>This class works the exact same as the <a
+ href="#cl::opt"><tt>cl::opt</tt></a> class, except that the second argument is
+ the <b>type</b> of the external storage, not a boolean value.  For this class,
+ the marker type '<tt>bool</tt>' is used to indicate that internal storage should
+ be used.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::bits">The <tt>cl::bits</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::bits</tt> class is the class used to represent a list of command
+ line options in the form of a bit vector.  It is also a templated class which
+ can take up to three arguments:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>namespace</b> cl {
+   <b>template</b> <<b>class</b> DataType, <b>class</b> Storage = <b>bool</b>,
+             <b>class</b> ParserClass = parser<DataType> >
+   <b>class</b> bits;
+ }
+ </pre></div>
+ 
+ <p>This class works the exact same as the <a
+ href="#cl::opt"><tt>cl::lists</tt></a> class, except that the second argument
+ must be of <b>type</b> <tt>unsigned</tt> if external storage is used.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::alias">The <tt>cl::alias</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::alias</tt> class is a nontemplated class that is used to form
+ aliases for other arguments.</p>
+ 
+ <div class="doc_code"><pre>
+ <b>namespace</b> cl {
+   <b>class</b> alias;
+ }
+ </pre></div>
+ 
+ <p>The <a href="#cl::aliasopt"><tt>cl::aliasopt</tt></a> attribute should be
+ used to specify which option this is an alias for.  Alias arguments default to
+ being <a href="#cl::Hidden">Hidden</a>, and use the aliased options parser to do
+ the conversion from string to data.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="cl::extrahelp">The <tt>cl::extrahelp</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>cl::extrahelp</tt> class is a nontemplated class that allows extra
+ help text to be printed out for the <tt>--help</tt> option.</p>
+ 
+ <div class="doc_code"><pre>
+ <b>namespace</b> cl {
+   <b>struct</b> extrahelp;
+ }
+ </pre></div>
+ 
+ <p>To use the extrahelp, simply construct one with a <tt>const char*</tt> 
+ parameter to the constructor. The text passed to the constructor will be printed
+ at the bottom of the help message, verbatim. Note that multiple
+ <tt>cl::extrahelp</tt> <b>can</b> be used, but this practice is discouraged. If
+ your tool needs to print additional help information, put all that help into a
+ single <tt>cl::extrahelp</tt> instance.</p>
+ <p>For example:</p>
+ <div class="doc_code"><pre>
+   cl::extrahelp("\nADDITIONAL HELP:\n\n  This is the extra help\n");
+ </pre></div>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="builtinparsers">Builtin parsers</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Parsers control how the string value taken from the command line is
+ translated into a typed value, suitable for use in a C++ program.  By default,
+ the CommandLine library uses an instance of <tt>parser<type></tt> if the
+ command line option specifies that it uses values of type '<tt>type</tt>'.
+ Because of this, custom option processing is specified with specializations of
+ the '<tt>parser</tt>' class.</p>
+ 
+ <p>The CommandLine library provides the following builtin parser
+ specializations, which are sufficient for most applications. It can, however,
+ also be extended to work with new data types and new ways of interpreting the
+ same data.  See the <a href="#customparser">Writing a Custom Parser</a> for more
+ details on this type of library extension.</p>
+ 
+ <ul>
+ 
+ <li><a name="genericparser">The <b>generic <tt>parser<t></tt> parser</b></a>
+ can be used to map strings values to any data type, through the use of the <a
+ href="#cl::values">cl::values</a> property, which specifies the mapping
+ information.  The most common use of this parser is for parsing enum values,
+ which allows you to use the CommandLine library for all of the error checking to
+ make sure that only valid enum values are specified (as opposed to accepting
+ arbitrary strings).  Despite this, however, the generic parser class can be used
+ for any data type.</li>
+ 
+ <li><a name="boolparser">The <b><tt>parser<bool></tt> specialization</b></a>
+ is used to convert boolean strings to a boolean value.  Currently accepted
+ strings are "<tt>true</tt>", "<tt>TRUE</tt>", "<tt>True</tt>", "<tt>1</tt>",
+ "<tt>false</tt>", "<tt>FALSE</tt>", "<tt>False</tt>", and "<tt>0</tt>".</li>
+ 
+ <li><a name="stringparser">The <b><tt>parser<string></tt>
+ specialization</b></a> simply stores the parsed string into the string value
+ specified.  No conversion or modification of the data is performed.</li>
+ 
+ <li><a name="intparser">The <b><tt>parser<int></tt> specialization</b></a>
+ uses the C <tt>strtol</tt> function to parse the string input.  As such, it will
+ accept a decimal number (with an optional '+' or '-' prefix) which must start
+ with a non-zero digit.  It accepts octal numbers, which are identified with a
+ '<tt>0</tt>' prefix digit, and hexadecimal numbers with a prefix of
+ '<tt>0x</tt>' or '<tt>0X</tt>'.</li>
+ 
+ <li><a name="doubleparser">The <b><tt>parser<double></tt></b></a> and
+ <b><tt>parser<float></tt> specializations</b> use the standard C
+ <tt>strtod</tt> function to convert floating point strings into floating point
+ values.  As such, a broad range of string formats is supported, including
+ exponential notation (ex: <tt>1.7e15</tt>) and properly supports locales.
+ </li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="extensionguide">Extension Guide</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Although the CommandLine library has a lot of functionality built into it
+ already (as discussed previously), one of its true strengths lie in its
+ extensibility.  This section discusses how the CommandLine library works under
+ the covers and illustrates how to do some simple, common, extensions.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="customparser">Writing a custom parser</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>One of the simplest and most common extensions is the use of a custom parser.
+ As <a href="#builtinparsers">discussed previously</a>, parsers are the portion
+ of the CommandLine library that turns string input from the user into a
+ particular parsed data type, validating the input in the process.</p>
+ 
+ <p>There are two ways to use a new parser:</p>
+ 
+ <ol>
+ 
+ <li>
+ 
+ <p>Specialize the <a href="#genericparser"><tt>cl::parser</tt></a> template for
+ your custom data type.<p>
+ 
+ <p>This approach has the advantage that users of your custom data type will
+ automatically use your custom parser whenever they define an option with a value
+ type of your data type.  The disadvantage of this approach is that it doesn't
+ work if your fundamental data type is something that is already supported.</p>
+ 
+ </li>
+ 
+ <li>
+ 
+ <p>Write an independent class, using it explicitly from options that need
+ it.</p>
+ 
+ <p>This approach works well in situations where you would line to parse an
+ option using special syntax for a not-very-special data-type.  The drawback of
+ this approach is that users of your parser have to be aware that they are using
+ your parser, instead of the builtin ones.</p>
+ 
+ </li>
+ 
+ </ol>
+ 
+ <p>To guide the discussion, we will discuss a custom parser that accepts file
+ sizes, specified with an optional unit after the numeric size.  For example, we
+ would like to parse "102kb", "41M", "1G" into the appropriate integer value.  In
+ this case, the underlying data type we want to parse into is
+ '<tt>unsigned</tt>'.  We choose approach #2 above because we don't want to make
+ this the default for all <tt>unsigned</tt> options.</p>
+ 
+ <p>To start out, we declare our new <tt>FileSizeParser</tt> class:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>struct</b> FileSizeParser : <b>public</b> cl::basic_parser<<b>unsigned</b>> {
+   <i>// parse - Return true on error.</i>
+   <b>bool</b> parse(cl::Option &O, <b>const char</b> *ArgName, <b>const</b> std::string &ArgValue,
+              <b>unsigned</b> &Val);
+ };
+ </pre></div>
+ 
+ <p>Our new class inherits from the <tt>cl::basic_parser</tt> template class to
+ fill in the default, boiler plate, code for us.  We give it the data type that
+ we parse into (the last argument to the <tt>parse</tt> method so that clients of
+ our custom parser know what object type to pass in to the parse method (here we
+ declare that we parse into '<tt>unsigned</tt>' variables.</p>
+ 
+ <p>For most purposes, the only method that must be implemented in a custom
+ parser is the <tt>parse</tt> method.  The <tt>parse</tt> method is called
+ whenever the option is invoked, passing in the option itself, the option name,
+ the string to parse, and a reference to a return value.  If the string to parse
+ is not well formed, the parser should output an error message and return true.
+ Otherwise it should return false and set '<tt>Val</tt>' to the parsed value.  In
+ our example, we implement <tt>parse</tt> as:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>bool</b> FileSizeParser::parse(cl::Option &O, <b>const char</b> *ArgName,
+                            <b>const</b> std::string &Arg, <b>unsigned</b> &Val) {
+   <b>const char</b> *ArgStart = Arg.c_str();
+   <b>char</b> *End;
+  
+   <i>// Parse integer part, leaving 'End' pointing to the first non-integer char</i>
+   Val = (unsigned)strtol(ArgStart, &End, 0);
+ 
+   <b>while</b> (1) {
+     <b>switch</b> (*End++) {
+     <b>case</b> 0: <b>return</b> false;   <i>// No error</i>
+     <b>case</b> 'i':               <i>// Ignore the 'i' in KiB if people use that</i>
+     <b>case</b> 'b': <b>case</b> 'B':     <i>// Ignore B suffix</i>
+       <b>break</b>;
+ 
+     <b>case</b> 'g': <b>case</b> 'G': Val *= 1024*1024*1024; <b>break</b>;
+     <b>case</b> 'm': <b>case</b> 'M': Val *= 1024*1024;      <b>break</b>;
+     <b>case</b> 'k': <b>case</b> 'K': Val *= 1024;           <b>break</b>;
+ 
+     default:
+       <i>// Print an error message if unrecognized character!</i>
+       <b>return</b> O.error(": '" + Arg + "' value invalid for file size argument!");
+     }
+   }
+ }
+ </pre></div>
+ 
+ <p>This function implements a very simple parser for the kinds of strings we are
+ interested in.  Although it has some holes (it allows "<tt>123KKK</tt>" for
+ example), it is good enough for this example.  Note that we use the option
+ itself to print out the error message (the <tt>error</tt> method always returns
+ true) in order to get a nice error message (shown below).  Now that we have our
+ parser class, we can use it like this:</p>
+ 
+ <div class="doc_code"><pre>
+ <b>static</b> <a href="#cl::opt">cl::opt</a><<b>unsigned</b>, <b>false</b>, FileSizeParser>
+ MFS(<i>"max-file-size"</i>, <a href="#cl::desc">cl::desc</a>(<i>"Maximum file size to accept"</i>),
+     <a href="#cl::value_desc">cl::value_desc</a>("<i>size</i>"));
+ </pre></div>
+ 
+ <p>Which adds this to the output of our program:</p>
+ 
+ <div class="doc_code"><pre>
+ OPTIONS:
+   -help                 - display available options (--help-hidden for more)
+   ...
+   <b>-max-file-size=<size> - Maximum file size to accept</b>
+ </pre></div>
+ 
+ <p>And we can test that our parse works correctly now (the test program just
+ prints out the max-file-size argument value):</p>
+ 
+ <div class="doc_code"><pre>
+ $ ./test
+ MFS: 0
+ $ ./test -max-file-size=123MB
+ MFS: 128974848
+ $ ./test -max-file-size=3G
+ MFS: 3221225472
+ $ ./test -max-file-size=dog
+ -max-file-size option: 'dog' value invalid for file size argument!
+ </pre></div>
+ 
+ <p>It looks like it works.  The error message that we get is nice and helpful,
+ and we seem to accept reasonable file sizes.  This wraps up the "custom parser"
+ tutorial.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="explotingexternal">Exploiting external storage</a>
+ </div>
+ 
+ <div class="doc_text">
+   <p>Several of the LLVM libraries define static <tt>cl::opt</tt> instances that
+   will automatically be included in any program that links with that library.
+   This is a feature. However, sometimes it is necessary to know the value of the
+   command line option outside of the library. In these cases the library does or
+   should provide an external storage location that is accessible to users of the
+   library. Examples of this include the <tt>llvm::DebugFlag</tt> exported by the
+   <tt>lib/Support/Debug.cpp</tt> file and the <tt>llvm::TimePassesIsEnabled</tt>
+   flag exported by the <tt>lib/VMCore/Pass.cpp</tt> file.</p>
+ 
+ <p>TODO: complete this section</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="dynamicopts">Dynamically adding command line options</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>TODO: fill in this section</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CompilerDriver.html
diff -c /dev/null llvm-www/releases/1.8/docs/CompilerDriver.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CompilerDriver.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,823 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>The LLVM Compiler Driver (llvmc)</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+   <meta name="author" content="Reid Spencer">
+   <meta name="description" 
+   content="A description of the use and design of the LLVM Compiler Driver.">
+ </head>
+ <body>
+ <div class="doc_title">The LLVM Compiler Driver (llvmc)</div>
+ <p class="doc_warning">NOTE: This document is a work in progress!</p>
+ <ol>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#introduction">Introduction</a>
+     <ol>
+       <li><a href="#purpose">Purpose</a></li>
+       <li><a href="#operation">Operation</a></li>
+       <li><a href="#phases">Phases</a></li>
+       <li><a href="#actions">Actions</a></li>
+     </ol>
+   </li>
+   <li><a href="#configuration">Configuration</a>
+     <ol>
+       <li><a href="#overview">Overview</a></li>
+       <li><a href="#filetypes">Configuration Files</a></li>
+       <li><a href="#syntax">Syntax</a></li>
+       <li><a href="#substitutions">Substitutions</a></li>
+       <li><a href="#sample">Sample Config File</a></li>
+     </ol>
+   <li><a href="#glossary">Glossary</a>
+ </ol>
+ <div class="doc_author">
+ <p>Written by <a href="mailto:rspencer at x10sys.com">Reid Spencer</a>
+ </p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="abstract">Abstract</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>This document describes the requirements, design, and configuration of the
+   LLVM compiler driver, <tt>llvmc</tt>.  The compiler driver knows about LLVM's 
+   tool set and can be configured to know about a variety of compilers for 
+   source languages.  It uses this knowledge to execute the tools necessary 
+   to accomplish general compilation, optimization, and linking tasks. The main 
+   purpose of <tt>llvmc</tt> is to provide a simple and consistent interface to 
+   all compilation tasks. This reduces the burden on the end user who can just 
+   learn to use <tt>llvmc</tt> instead of the entire LLVM tool set and all the
+   source language compilers compatible with LLVM.</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="introduction">Introduction</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>The <tt>llvmc</tt> <a href="#def_tool">tool</a> is a configurable compiler 
+   <a href="#def_driver">driver</a>. As such, it isn't a compiler, optimizer, 
+   or a linker itself but it drives (invokes) other software that perform those 
+   tasks. If you are familiar with the GNU Compiler Collection's <tt>gcc</tt> 
+   tool, <tt>llvmc</tt> is very similar.</p>
+   <p>The following introductory sections will help you understand why this tool
+   is necessary and what it does.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="purpose">Purpose</a></div>
+ <div class="doc_text">
+   <p><tt>llvmc</tt> was invented to make compilation of user programs with 
+   LLVM-based tools easier. To accomplish this, <tt>llvmc</tt> strives to:</p>
+   <ul>
+     <li>Be the single point of access to most of the LLVM tool set.</li>
+     <li>Hide the complexities of the LLVM tools through a single interface.</li>
+     <li>Provide a consistent interface for compiling all languages.</li>
+   </ul>
+   <p>Additionally, <tt>llvmc</tt> makes it easier to write a compiler for use
+   with LLVM, because it:</p>
+   <ul>
+     <li>Makes integration of existing non-LLVM tools simple.</li>
+     <li>Extends the capabilities of minimal compiler tools by optimizing their
+     output.</li>
+     <li>Reduces the number of interfaces a compiler writer must know about
+     before a working compiler can be completed (essentially only the VMCore
+     interfaces need to be understood).</li>
+     <li>Supports source language translator invocation via both dynamically
+     loadable shared objects and invocation of an executable.</li>
+   </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="operation">Operation</a></div>
+ <div class="doc_text">
+   <p>At a high level, <tt>llvmc</tt> operation is very simple.  The basic action
+   taken by <tt>llvmc</tt> is to simply invoke some tool or set of tools to fill 
+   the user's request for compilation. Every execution of <tt>llvmc</tt>takes the 
+   following sequence of steps:</p>
+   <dl>
+     <dt><b>Collect Command Line Options</b></dt>
+     <dd>The command line options provide the marching orders to <tt>llvmc</tt> 
+     on what actions it should perform. This is the request the user is making 
+     of <tt>llvmc</tt> and it is interpreted first. See the <tt>llvmc</tt>
+     <a href="CommandGuide/html/llvmc.html">manual page</a> for details on the
+     options.</dd>
+     <dt><b>Read Configuration Files</b></dt>
+     <dd>Based on the options and the suffixes of the filenames presented, a set 
+     of configuration files are read to configure the actions <tt>llvmc</tt> will 
+     take.  Configuration files are provided by either LLVM or the 
+     compiler tools that <tt>llvmc</tt> invokes. These files determine what 
+     actions <tt>llvmc</tt> will take in response to the user's request. See 
+     the section on <a href="#configuration">configuration</a> for more details.
+     </dd>
+     <dt><b>Determine Phases To Execute</b></dt>
+     <dd>Based on the command line options and configuration files,
+     <tt>llvmc</tt> determines the compilation <a href="#phases">phases</a> that
+     must be executed by the user's request. This is the primary work of
+     <tt>llvmc</tt>.</dd>
+     <dt><b>Determine Actions To Execute</b></dt>
+     <dd>Each <a href="#phases">phase</a> to be executed can result in the
+     invocation of one or more <a href="#actions">actions</a>. An action is
+     either a whole program or a function in a dynamically linked shared library. 
+     In this step, <tt>llvmc</tt> determines the sequence of actions that must be 
+     executed. Actions will always be executed in a deterministic order.</dd>
+     <dt><b>Execute Actions</b></dt>
+     <dd>The <a href="#actions">actions</a> necessary to support the user's
+     original request are executed sequentially and deterministically. All 
+     actions result in either the invocation of a whole program to perform the 
+     action or the loading of a dynamically linkable shared library and invocation 
+     of a standard interface function within that library.</dd> 
+     <dt><b>Termination</b></dt>
+     <dd>If any action fails (returns a non-zero result code), <tt>llvmc</tt>
+     also fails and returns the result code from the failing action. If
+     everything succeeds, <tt>llvmc</tt> will return a zero result code.</dd>
+   </dl>
+   <p><tt>llvmc</tt>'s operation must be simple, regular and predictable. 
+   Developers need to be able to rely on it to take a consistent approach to
+   compilation. For example, the invocation:</p>
+   <code>
+     llvmc -O2 x.c y.c z.c -o xyz</code>
+   <p>must produce <i>exactly</i> the same results as:</p>
+   <pre><tt>
+     llvmc -O2 x.c -o x.o
+     llvmc -O2 y.c -o y.o
+     llvmc -O2 z.c -o z.o
+     llvmc -O2 x.o y.o z.o -o xyz</tt></pre>
+   <p>To accomplish this, <tt>llvmc</tt> uses a very simple goal oriented
+   procedure to do its work. The overall goal is to produce a functioning
+   executable. To accomplish this, <tt>llvmc</tt> always attempts to execute a 
+   series of compilation <a href="#def_phase">phases</a> in the same sequence. 
+   However, the user's options to <tt>llvmc</tt> can cause the sequence of phases 
+   to start in the middle or finish early.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="phases"></a>Phases </div>
+ <div class="doc_text">
+   <p><tt>llvmc</tt> breaks every compilation task into the following five 
+   distinct phases:</p>
+   <dl><dt><b>Preprocessing</b></dt><dd>Not all languages support preprocessing; 
+     but for those that do, this phase can be invoked. This phase is for 
+     languages that provide combining, filtering, or otherwise altering with the 
+     source language input before the translator parses it. Although C and C++ 
+     are the most common users of this phase, other languages may provide their 
+     own preprocessor (whether its the C pre-processor or not).</dd>
+   </dl>
+   <dl><dt><b>Translation</b></dt><dd>The translation phase converts the source 
+     language input into something that LLVM can interpret and use for 
+     downstream phases. The translation is essentially from "non-LLVM form" to
+     "LLVM form".</dd>
+   </dl>
+   <dl><dt><b>Optimization</b></dt><dd>Once an LLVM Module has been obtained from 
+     the translation phase, the program enters the optimization phase. This phase 
+     attempts to optimize all of the input provided on the command line according 
+     to the options provided.</dd>
+   </dl>
+   <dl><dt><b>Linking</b></dt><dd>The inputs are combined to form a complete
+     program.</dd>
+   </dl>
+   <p>The following table shows the inputs, outputs, and command line options
+   applicable to each phase.</p>
+   <table>
+     <tr>
+       <th style="width: 10%">Phase</th>
+       <th style="width: 25%">Inputs</th>
+       <th style="width: 25%">Outputs</th>
+       <th style="width: 40%">Options</th>
+     </tr>
+     <tr><td><b>Preprocessing</b></td>
+       <td class="td_left"><ul><li>Source Language File</li></ul></td>
+       <td class="td_left"><ul><li>Source Language File</li></ul></td>
+       <td class="td_left"><dl>
+           <dt><tt>-E</tt></dt>
+           <dd>Stops the compilation after preprocessing</dd>
+       </dl></td>
+     </tr>
+     <tr>
+       <td><b>Translation</b></td>
+       <td class="td_left"><ul>
+           <li>Source Language File</li>
+       </ul></td>
+       <td class="td_left"><ul>
+           <li>LLVM Assembly</li>
+           <li>LLVM Bytecode</li>
+           <li>LLVM C++ IR</li>
+       </ul></td>
+       <td class="td_left"><dl>
+           <dt><tt>-c</tt></dt>
+           <dd>Stops the compilation after translation so that optimization and 
+           linking are not done.</dd>
+           <dt><tt>-S</tt></dt>
+           <dd>Stops the compilation before object code is written so that only
+           assembly code remains.</dd>
+       </dl></td>
+     </tr>
+     <tr>
+       <td><b>Optimization</b></td>
+       <td class="td_left"><ul>
+           <li>LLVM Assembly</li>
+           <li>LLVM Bytecode</li>
+       </ul></td>
+       <td class="td_left"><ul>
+           <li>LLVM Bytecode</li>
+       </ul></td>
+       <td class="td_left"><dl>
+           <dt><tt>-Ox</tt>
+           <dd>This group of options controls the amount of optimization 
+           performed.</dd>
+       </dl></td>
+     </tr>
+     <tr>
+       <td><b>Linking</b></td>
+       <td class="td_left"><ul>
+           <li>LLVM Bytecode</li>
+           <li>Native Object Code</li>
+           <li>LLVM Library</li>
+           <li>Native Library</li>
+       </ul></td>
+       <td class="td_left"><ul>
+           <li>LLVM Bytecode Executable</li>
+           <li>Native Executable</li>
+       </ul></td>
+       <td class="td_left"><dl>
+           <dt><tt>-L</tt></dt><dd>Specifies a path for library search.</dd>
+           <dt><tt>-l</tt></dt><dd>Specifies a library to link in.</dd>
+       </dl></td>
+     </tr>
+   </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="actions"></a>Actions</div>
+ <div class="doc_text">
+   <p>An action, with regard to <tt>llvmc</tt> is a basic operation that it takes
+   in order to fulfill the user's request. Each phase of compilation will invoke
+   zero or more actions in order to accomplish that phase.</p>
+   <p>Actions come in two forms:</p>
+   <ul>
+     <li>Invokable Executables</li>
+     <li>Functions in a shared library</li>
+   </ul>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="configuration">Configuration</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>This section of the document describes the configuration files used by
+   <tt>llvmc</tt>.  Configuration information is relatively static for a 
+   given release of LLVM and a compiler tool. However, the details may 
+   change from release to release of either.  Users are encouraged to simply use 
+   the various options of the <tt>llvmc</tt> command and ignore the configuration 
+   of the tool. These configuration files are for compiler writers and LLVM 
+   developers. Those wishing to simply use <tt>llvmc</tt> don't need to understand 
+   this section but it may be instructive on how the tool works.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="overview"></a>Overview</div>
+ <div class="doc_text">
+ <p><tt>llvmc</tt> is highly configurable both on the command line and in 
+ configuration files. The options it understands are generic, consistent and 
+ simple by design.  Furthermore, the <tt>llvmc</tt> options apply to the 
+ compilation of any LLVM enabled programming language. To be enabled as a 
+ supported source language compiler, a compiler writer must provide a 
+ configuration file that tells <tt>llvmc</tt> how to invoke the compiler 
+ and what its capabilities are. The purpose of the configuration files then 
+ is to allow compiler writers to specify to <tt>llvmc</tt> how the compiler 
+ should be invoked. Users may but are not advised to alter the compiler's 
+ <tt>llvmc</tt> configuration.</p>
+ 
+ <p>Because <tt>llvmc</tt> just invokes other programs, it must deal with the
+ available command line options for those programs regardless of whether they
+ were written for LLVM or not. Furthermore, not all compiler tools will
+ have the same capabilities. Some compiler tools will simply generate LLVM assembly
+ code, others will be able to generate fully optimized byte code. In general,
+ <tt>llvmc</tt> doesn't make any assumptions about the capabilities or command 
+ line options of a sub-tool. It simply uses the details found in the 
+ configuration files and leaves it to the compiler writer to specify the 
+ configuration correctly.</p>
+ 
+ <p>This approach means that new compiler tools can be up and working very
+ quickly. As a first cut, a tool can simply compile its source to raw
+ (unoptimized) bytecode or LLVM assembly and <tt>llvmc</tt> can be configured 
+ to pick up the slack (translate LLVM assembly to bytecode, optimize the 
+ bytecode, generate native assembly, link, etc.).   In fact, the compiler tools 
+ need not use any LLVM libraries, and it could be written in any language 
+ (instead of C++).  The configuration data will allow the full range of 
+ optimization, assembly, and linking capabilities that LLVM provides to be added 
+ to these kinds of tools.  Enabling the rapid development of front-ends is one 
+ of the primary goals of <tt>llvmc</tt>.</p>
+ 
+ <p>As a compiler tool matures, it may utilize the LLVM libraries and tools 
+ to more efficiently produce optimized bytecode directly in a single compilation 
+ and optimization program. In these cases, multiple tools would not be needed 
+ and the configuration data for the compiler would change.</p>
+ 
+ <p>Configuring <tt>llvmc</tt> to the needs and capabilities of a source language 
+ compiler is relatively straight-forward.  A compiler writer must provide a 
+ definition of what to do for each of the five compilation phases for each of 
+ the optimization levels. The specification consists simply of prototypical 
+ command lines into which <tt>llvmc</tt> can substitute command line
+ arguments and file names. Note that any given phase can be completely blank if
+ the source language's compiler combines multiple phases into a single program.
+ For example, quite often pre-processing, translation, and optimization are
+ combined into a single program. The specification for such a compiler would have
+ blank entries for pre-processing and translation but a full command line for
+ optimization.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="filetypes">Configuration Files</a></div>
+ <div class="doc_subsubsection"><a name="filecontents">File Contents</a></div>
+ <div class="doc_text">
+   <p>Each configuration file provides the details for a single source language
+   that is to be compiled.  This configuration information tells <tt>llvmc</tt> 
+   how to invoke the language's pre-processor, translator, optimizer, assembler
+   and linker. Note that a given source language needn't provide all these tools
+   as many of them exist in llvm currently.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"><a name="dirsearch">Directory Search</a></div>
+ <div class="doc_text">
+   <p><tt>llvmc</tt> always looks for files of a specific name. It uses the
+   first file with the name its looking for by searching directories in the
+   following order:<br/>
+   <ol>
+     <li>Any directory specified by the <tt>-config-dir</tt> option will be
+     checked first.</li>
+     <li>If the environment variable LLVM_CONFIG_DIR is set, and it contains
+     the name of a valid directory, that directory will be searched next.</li>
+     <li>If the user's home directory (typically <tt>/home/user</tt> contains 
+     a sub-directory named <tt>.llvm</tt> and that directory contains a 
+     sub-directory named <tt>etc</tt> then that directory will be tried 
+     next.</li>
+     <li>If the LLVM installation directory (typically <tt>/usr/local/llvm</tt>
+     contains a sub-directory named <tt>etc</tt> then that directory will be
+     tried last.</li>
+     <li>A standard "system" directory will be searched next. This is typically
+     <tt>/etc/llvm</tt> on UNIX™ and <tt>C:\WINNT</tt> on Microsoft
+     Windows™.</li>
+     <li>If the configuration file sought still can't be found, <tt>llvmc</tt>
+     will print an error message and exit.</li>
+   </ol>
+   <p>The first file found in this search will be used. Other files with the 
+   same name will be ignored even if they exist in one of the subsequent search
+   locations.</p>
+ </div>
+ 
+ <div class="doc_subsubsection"><a name="filenames">File Names</a></div>
+ <div class="doc_text">
+   <p>In the directories searched, each configuration file is given a specific
+   name to foster faster lookup (so llvmc doesn't have to do directory searches).
+   The name of a given language specific configuration file is simply the same 
+   as the suffix used to identify files containing source in that language. 
+   For example, a configuration file for C++ source might be named 
+   <tt>cpp</tt>, <tt>C</tt>, or <tt>cxx</tt>. For languages that support multiple
+   file suffixes, multiple (probably identical) files (or symbolic links) will
+   need to be provided.</p>
+ </div>
+ 
+ <div class="doc_subsubsection"><a name="whatgetsread">What Gets Read</a></div>
+ <div class="doc_text">
+   <p>Which configuration files are read depends on the command line options and 
+   the suffixes of the file names provided on <tt>llvmc</tt>'s command line. Note
+   that the <tt>-x LANGUAGE</tt> option alters the language that <tt>llvmc</tt>
+   uses for the subsequent files on the command line.  Only the configuration 
+   files actually needed to complete <tt>llvmc</tt>'s task are read. Other 
+   language specific files will be ignored.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="syntax"></a>Syntax</div>
+ <div class="doc_text">
+   <p>The syntax of the configuration files is very simple and somewhat
+   compatible with Java's property files. Here are the syntax rules:</p>
+   <ul>
+     <li>The file encoding is ASCII.</li>
+     <li>The file is line oriented. There should be one configuration definition 
+     per line. Lines are terminated by the newline (0x0A) and/or carriage return
+     characters (0x0D)</li>
+     <li>A backslash (<tt>\</tt>) before a newline causes the newline to be
+     ignored. This is useful for line continuation of long definitions. A
+     backslash anywhere else is recognized as a backslash.</li>
+     <li>A configuration item consists of a name, an <tt>=</tt> and a value.</li>
+     <li>A name consists of a sequence of identifiers separated by period.</li>
+     <li>An identifier consists of specific keywords made up of only lower case
+     and upper case letters (e.g. <tt>lang.name</tt>).</li>
+     <li>Values come in four flavors: booleans, integers, commands and 
+     strings.</li>
+     <li>Valid "false" boolean values are <tt>false False FALSE no No NO
+       off Off</tt> and <tt>OFF</tt>.</li>
+     <li>Valid "true" boolean values are <tt>true True TRUE yes Yes YES
+       on On</tt> and <tt>ON</tt>.</li>
+     <li>Integers are simply sequences of digits.</li>
+     <li>Commands start with a program name and are followed by a sequence of
+     words that are passed to that program as command line arguments. Program
+     arguments that begin and end with the <tt>%</tt> sign will have their value
+     substituted. Program names beginning with <tt>/</tt> are considered to be
+     absolute. Otherwise the <tt>PATH</tt> will be applied to find the program to
+     execute.</li>
+     <li>Strings are composed of multiple sequences of characters from the
+     character class <tt>[-A-Za-z0-9_:%+/\\|,]</tt> separated by white
+     space.</li>
+     <li>White space on a line is folded. Multiple blanks or tabs will be
+     reduced to a single blank.</li>
+     <li>White space before the configuration item's name is ignored.</li>
+     <li>White space on either side of the <tt>=</tt> is ignored.</li>
+     <li>White space in a string value is used to separate the individual
+     components of the string value but otherwise ignored.</li>
+     <li>Comments are introduced by the <tt>#</tt> character. Everything after a
+     <tt>#</tt> and before the end of line is ignored.</li>
+   </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="items">Configuration Items</a></div>
+ <div class="doc_text">
+   <p>The table below provides definitions of the allowed configuration items
+   that may appear in a configuration file. Every item has a default value and
+   does not need to appear in the configuration file. Missing items will have the 
+   default value. Each identifier may appear as all lower case, first letter
+   capitalized or all upper case.</p>
+   <table>
+     <tbody>
+       <tr>
+         <th>Name</th>
+         <th>Value Type</th>
+         <th>Description</th>
+         <th>Default</th>
+       </tr>
+       <tr><td colspan="4"><h4>LLVMC ITEMS</h4></td></tr>
+       <tr>
+         <td><b>version</b></td>
+         <td>string</td>
+         <td class="td_left">Provides the version string for the contents of this
+           configuration file. What is accepted as a legal configuration file
+           will change over time and this item tells <tt>llvmc</tt> which version
+           should be expected.</td>
+         <td><i>b</i></td>
+       </tr>
+       <tr><td colspan="4"><h4>LANG ITEMS</h4></td></tr>
+       <tr>
+         <td><b>lang.name</b></td>
+         <td>string</td>
+         <td class="td_left">Provides the common name for a language definition. 
+           For example "C++", "Pascal", "FORTRAN", etc.</td>
+         <td><i>blank</i></td>
+       </tr>
+       <tr>
+         <td><b>lang.opt1</b></td>
+         <td>string</td>
+         <td class="td_left">Specifies the parameters to give the optimizer when
+           <tt>-O1</tt> is specified on the <tt>llvmc</tt> command line.</td>
+         <td><tt>-simplifycfg -instcombine -mem2reg</tt></td>
+       </tr>
+       <tr>
+         <td><b>lang.opt2</b></td>
+         <td>string</td>
+         <td class="td_left">Specifies the parameters to give the optimizer when
+           <tt>-O2</tt> is specified on the <tt>llvmc</tt> command line.</td>
+         <td><i>TBD</i></td>
+       </tr>
+       <tr>
+         <td><b>lang.opt3</b></td>
+         <td>string</td>
+         <td class="td_left">Specifies the parameters to give the optimizer when
+           <tt>-O3</tt> is specified on the <tt>llvmc</tt> command line.</td>
+         <td><i>TBD</i></td>
+       </tr>
+       <tr>
+         <td><b>lang.opt4</b></td>
+         <td>string</td>
+         <td class="td_left">Specifies the parameters to give the optimizer when
+           <tt>-O4</tt> is specified on the <tt>llvmc</tt> command line.</td>
+         <td><i>TBD</i></td>
+       </tr>
+       <tr>
+         <td><b>lang.opt5</b></td>
+         <td>string</td>
+         <td class="td_left">Specifies the parameters to give the optimizer when 
+           <tt>-O5</tt> is specified on the <tt>llvmc</tt> command line.</td>
+         <td><i>TBD</i></td>
+       </tr>
+       <tr><td colspan="4"><h4>PREPROCESSOR ITEMS</h4></td></tr>
+       <tr>
+         <td><b>preprocessor.command</b></td>
+         <td>command</td>
+         <td class="td_left">This provides the command prototype that will be used
+           to run the preprocessor.  This is generally only used with the 
+           <tt>-E</tt> option.</td>
+         <td><blank></td>
+       </tr>
+       <tr>
+         <td><b>preprocessor.required</b></td>
+         <td>boolean</td>
+         <td class="td_left">This item specifies whether the pre-processing phase
+           is required by the language. If the value is true, then the
+           <tt>preprocessor.command</tt> value must not be blank. With this option,
+           <tt>llvmc</tt> will always run the preprocessor as it assumes that the
+           translation and optimization phases don't know how to pre-process their
+           input.</td>
+         <td>false</td>
+       </tr>
+       <tr><td colspan="4"><h4>TRANSLATOR ITEMS</h4></td></tr>
+       <tr>
+         <td><b>translator.command</b></td>
+         <td>command</td>
+         <td class="td_left">This provides the command prototype that will be used 
+           to run the translator. Valid substitutions are <tt>%in%</tt> for the 
+           input file and <tt>%out%</tt> for the output file.</td>
+         <td><blank></td>
+       </tr>
+       <tr>
+         <td><b>translator.output</b></td>
+         <td><tt>bytecode</tt> or <tt>assembly</tt></td>
+         <td class="td_left">This item specifies the kind of output the language's 
+           translator generates.</td>
+         <td><tt>bytecode</tt></td>
+       </tr>
+       <tr>
+         <td><b>translator.preprocesses</b></td>
+         <td>boolean</td>
+         <td class="td_left">Indicates that the translator also preprocesses. If
+           this is true, then <tt>llvmc</tt> will skip the pre-processing phase
+           whenever the final phase is not pre-processing.</td>
+         <td><tt>false</tt></td>
+       </tr>
+       <tr><td colspan="4"><h4>OPTIMIZER ITEMS</h4></td></tr>
+       <tr>
+         <td><b>optimizer.command</b></td>
+         <td>command</td>
+         <td class="td_left">This provides the command prototype that will be used 
+           to run the optimizer. Valid substitutions are <tt>%in%</tt> for the 
+           input file and <tt>%out%</tt> for the output file.</td>
+         <td><blank></td>
+       </tr>
+       <tr>
+         <td><b>optimizer.output</b></td>
+         <td><tt>bytecode</tt> or <tt>assembly</tt></td>
+         <td class="td_left">This item specifies the kind of output the language's 
+           optimizer generates. Valid values are "assembly" and "bytecode"</td>
+         <td><tt>bytecode</tt></td>
+       </tr>
+       <tr>
+         <td><b>optimizer.preprocesses</b></td>
+         <td>boolean</td>
+         <td class="td_left">Indicates that the optimizer also preprocesses. If
+           this is true, then <tt>llvmc</tt> will skip the pre-processing phase
+           whenever the final phase is optimization or later.</td>
+         <td><tt>false</tt></td>
+       </tr>
+       <tr>
+         <td><b>optimizer.translates</b></td>
+         <td>boolean</td>
+         <td class="td_left">Indicates that the optimizer also translates. If
+           this is true, then <tt>llvmc</tt> will skip the translation phase
+           whenever the final phase is optimization or later.</td>
+         <td><tt>false</tt></td>
+       </tr>
+       <tr><td colspan="4"><h4>ASSEMBLER ITEMS</h4></td></tr>
+       <tr>
+         <td><b>assembler.command</b></td>
+         <td>command</td>
+         <td class="td_left">This provides the command prototype that will be used 
+           to run the assembler. Valid substitutions are <tt>%in%</tt> for the 
+           input file and <tt>%out%</tt> for the output file.</td>
+         <td><blank></td>
+       </tr>
+     </tbody>
+   </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="substitutions">Substitutions</a></div>
+ <div class="doc_text">
+   <p>On any configuration item that ends in <tt>command</tt>, you must
+   specify substitution tokens.  Substitution tokens begin and end with a percent
+   sign (<tt>%</tt>) and are replaced by the corresponding text. Any substitution
+   token may be given on any <tt>command</tt> line but some are more useful than
+   others. In particular each command <em>should</em> have both an <tt>%in%</tt>
+   and an <tt>%out%</tt> substitution. The table below provides definitions of
+   each of the allowed substitution tokens.</p>
+   <table>
+     <tbody>
+       <tr>
+         <th>Substitution Token</th>
+         <th>Replacement Description</th>
+       </tr>
+       <tr>
+         <td><tt>%args%</tt></td>
+         <td class="td_left">Replaced with all the tool-specific arguments given
+           to <tt>llvmc</tt> via the <tt>-T</tt> set of options. This just allows
+           you to place these arguments in the correct place on the command line.
+           If the <tt>%args%</tt> option does not appear on your command line, 
+           then you are explicitly disallowing the <tt>-T</tt> option for your 
+           tool.
+         </td>
+       <tr>
+         <td><tt>%force%</tt></td>
+         <td class="td_left">Replaced with the <tt>-f</tt> option if it was
+           specified on the <tt>llvmc</tt> command line. This is intended to tell
+           the compiler tool to force the overwrite of output files. 
+         </td>
+       </tr>
+       <tr>
+         <td><tt>%in%</tt></td>
+         <td class="td_left">Replaced with the full path of the input file. You
+           needn't worry about the cascading of file names. <tt>llvmc</tt> will
+           create temporary files and ensure that the output of one phase is the
+           input to the next phase.</td>
+       </tr>
+       <tr>
+         <td><tt>%opt%</tt></td>
+         <td class="td_left">Replaced with the optimization options for the
+           tool. If the tool understands the <tt>-O</tt> options then that will
+           be passed. Otherwise, the <tt>lang.optN</tt> series of configuration
+           items will specify which arguments are to be given.</td>
+       </tr>
+       <tr>
+         <td><tt>%out%</tt></td>
+         <td class="td_left">Replaced with the full path of the output file.
+           Note that this is not necessarily the output file specified with the
+           <tt>-o</tt> option on <tt>llvmc</tt>'s command line. It might be a
+           temporary file that will be passed to a subsequent phase's input.
+         </td>
+       </tr>
+       <tr>
+         <td><tt>%stats%</tt></td>
+         <td class="td_left">If your command accepts the <tt>-stats</tt> option,
+           use this substitution token. If the user requested <tt>-stats</tt> 
+           from the <tt>llvmc</tt> command line then this token will be replaced
+           with <tt>-stats</tt>, otherwise it will be ignored.
+         </td>
+       </tr>
+       <tr>
+         <td><tt>%target%</tt></td>
+         <td class="td_left">Replaced with the name of the target "machine" for 
+           which code should be generated. The value used here is taken from the
+           <tt>llvmc</tt> option <tt>-march</tt>.
+         </td>
+       </tr>
+       <tr>
+         <td><tt>%time%</tt></td>
+         <td class="td_left">If your command accepts the <tt>-time-passes</tt> 
+           option, use this substitution token. If the user requested 
+           <tt>-time-passes</tt> from the <tt>llvmc</tt> command line then this 
+           token will be replaced with <tt>-time-passes</tt>, otherwise it will 
+           be ignored.
+         </td>
+       </tr>
+     </tbody>
+   </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="sample">Sample Config File</a></div>
+ <div class="doc_text">
+   <p>Since an example is always instructive, here's how the Stacker language
+   configuration file looks.</p>
+   <pre><tt>
+ # Stacker Configuration File For llvmc
+ 
+ ##########################################################
+ # Language definitions
+ ##########################################################
+   lang.name=Stacker 
+   lang.opt1=-simplifycfg -instcombine -mem2reg
+   lang.opt2=-simplifycfg -instcombine -mem2reg -load-vn \
+     -gcse -dse -scalarrepl -sccp 
+   lang.opt3=-simplifycfg -instcombine -mem2reg -load-vn \
+     -gcse -dse -scalarrepl -sccp -branch-combine -adce \
+     -globaldce -inline -licm 
+   lang.opt4=-simplifycfg -instcombine -mem2reg -load-vn \
+     -gcse -dse -scalarrepl -sccp -ipconstprop \
+     -branch-combine -adce -globaldce -inline -licm 
+   lang.opt5=-simplifycfg -instcombine -mem2reg --load-vn \
+     -gcse -dse scalarrepl -sccp -ipconstprop \
+     -branch-combine -adce -globaldce -inline -licm \
+     -block-placement
+ 
+ ##########################################################
+ # Pre-processor definitions
+ ##########################################################
+ 
+   # Stacker doesn't have a preprocessor but the following
+   # allows the -E option to be supported
+   preprocessor.command=cp %in% %out%
+   preprocessor.required=false
+ 
+ ##########################################################
+ # Translator definitions
+ ##########################################################
+ 
+   # To compile stacker source, we just run the stacker
+   # compiler with a default stack size of 2048 entries.
+   translator.command=stkrc -s 2048 %in% -o %out% %time% \
+     %stats% %force% %args%
+ 
+   # stkrc doesn't preprocess but we set this to true so
+   # that we don't run the cp command by default.
+   translator.preprocesses=true
+ 
+   # The translator is required to run.
+   translator.required=true
+ 
+   # stkrc doesn't handle the -On options
+   translator.output=bytecode
+ 
+ ##########################################################
+ # Optimizer definitions
+ ##########################################################
+   
+   # For optimization, we use the LLVM "opt" program
+   optimizer.command=opt %in% -o %out% %opt% %time% %stats% \
+     %force% %args%
+ 
+   optimizer.required = true
+ 
+   # opt doesn't translate
+   optimizer.translates = no
+ 
+   # opt doesn't preprocess
+   optimizer.preprocesses=no
+ 
+   # opt produces bytecode
+   optimizer.output = bc
+ 
+ ##########################################################
+ # Assembler definitions
+ ##########################################################
+   assembler.command=llc %in% -o %out% %target% %time% %stats%
+ </tt></pre>
+ </div> 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="glossary">Glossary</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>This document uses precise terms in reference to the various artifacts and
+   concepts related to compilation. The terms used throughout this document are
+   defined below.</p>
+   <dl>
+     <dt><a name="def_assembly"><b>assembly</b></a></dt> 
+     <dd>A compilation <a href="#def_phase">phase</a> in which LLVM bytecode or 
+     LLVM assembly code is assembled to a native code format (either target 
+     specific aseembly language or the platform's native object file format).
+     </dd>
+ 
+     <dt><a name="def_compiler"><b>compiler</b></a></dt>
+     <dd>Refers to any program that can be invoked by <tt>llvmc</tt> to accomplish 
+     the work of one or more compilation <a href="#def_phase">phases</a>.</dd>
+ 
+     <dt><a name="def_driver"><b>driver</b></a></dt>
+     <dd>Refers to <tt>llvmc</tt> itself.</dd>
+ 
+     <dt><a name="def_linking"><b>linking</b></a></dt>
+     <dd>A compilation <a href="#def_phase">phase</a> in which LLVM bytecode files 
+     and (optionally) native system libraries are combined to form a complete 
+     executable program.</dd>
+ 
+     <dt><a name="def_optimization"><b>optimization</b></a></dt>
+     <dd>A compilation <a href="#def_phase">phase</a> in which LLVM bytecode is 
+     optimized.</dd>
+ 
+     <dt><a name="def_phase"><b>phase</b></a></dt>
+     <dd>Refers to any one of the five compilation phases that that 
+     <tt>llvmc</tt> supports. The five phases are:
+     <a href="#def_preprocessing">preprocessing</a>, 
+     <a href="#def_translation">translation</a>,
+     <a href="#def_optimization">optimization</a>,
+     <a href="#def_assembly">assembly</a>,
+     <a href="#def_linking">linking</a>.</dd>
+ 
+     <dt><a name="def_sourcelanguage"><b>source language</b></a></dt>
+     <dd>Any common programming language (e.g. C, C++, Java, Stacker, ML,
+     FORTRAN).  These languages are distinguished from any of the lower level
+     languages (such as LLVM or native assembly), by the fact that a 
+     <a href="#def_translation">translation</a> <a href="#def_phase">phase</a> 
+     is required before LLVM can be applied.</dd> 
+ 
+     <dt><a name="def_tool"><b>tool</b></a></dt>
+     <dd>Refers to any program in the LLVM tool set.</dd>
+ 
+     <dt><a name="def_translation"><b>translation</b></a></dt>
+     <dd>A compilation <a href="#def_phase">phase</a> in which 
+     <a href="#def_sourcelanguage">source language</a> code is translated into 
+     either LLVM assembly language or LLVM bytecode.</dd>
+   </dl>
+ </div>
+ <!-- *********************************************************************** -->
+ <hr>
+ <address> <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+  src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a><a
+  href="http://validator.w3.org/check/referer"><img
+  src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a><a
+  href="mailto:rspencer at x10sys.com">Reid Spencer</a><br>
+ <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+ Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ <!-- vim: sw=2
+ -->
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/CompilerWriterInfo.html
diff -c /dev/null llvm-www/releases/1.8/docs/CompilerWriterInfo.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/CompilerWriterInfo.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,260 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" 
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>Architecture/platform information for compiler writers</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ 
+ <div class="doc_title">
+   Architecture/platform information for compiler writers
+ </div>
+ 
+ <div class="doc_warning">
+   <p>Note: This document is a work-in-progress.  Additions and clarifications
+   are welcome.</p>
+ </div>
+ 
+ <ol>
+   <li><a href="#hw">Hardware</a>
+   <ol>
+     <li><a href="#alpha">Alpha</a></li>
+     <li><a href="#arm">ARM</a></li>
+     <li><a href="#ia64">Itanium</a></li>
+     <li><a href="#mips">MIPS</a></li>
+     <li><a href="#ppc">PowerPC</a></li>
+     <li><a href="#sparc">SPARC</a></li>
+     <li><a href="#x86">X86</a></li>
+     <li><a href="#other">Other lists</a></li>
+   </ol></li>
+   <li><a href="#abi">Application Binary Interface (ABI)</a>
+   <ol>
+     <li><a href="#linux">Linux</a></li>
+     <li><a href="#osx">OS X</a></li>
+   </ol></li>
+   <li><a href="#misc">Miscellaneous resources</a></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Compiled by <a href="http://misha.brukman.net">Misha Brukman</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="hw">Hardware</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="alpha">Alpha</a></div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a
+ href="http://ftp.digital.com/pub/Digital/info/semiconductor/literature/dsc-library.html">Alpha manuals</a> 
+ </li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="arm">ARM</a></div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a href="http://www.arm.com/documentation/">ARM documentation</a> 
+ (<a href="http://www.arm.com/documentation/ARMProcessor_Cores/">Processor
+ Cores</a>)</li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="ia64">Itanium (ia64)</a></div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a
+ href="http://developer.intel.com/design/itanium2/documentation.htm">Itanium documentation</a> 
+ </li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="mips">MIPS</a></div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a
+ href="http://mips.com/content/Documentation/MIPSDocumentation/ProcessorArchitecture/doclibrary">MIPS
+ Processor Architecture</a></li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="ppc">PowerPC</a></div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">IBM - Official manuals and docs</div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a
+ href="http://www-106.ibm.com/developerworks/eserver/articles/archguide.html">PowerPC
+ Architecture Book</a>
+ <ul>
+   <li>Book I: <a
+   href="http://www-106.ibm.com/developerworks/eserver/pdfs/archpub1.pdf">PowerPC
+   User Instruction Set Architecture</a></li>
+   <li>Book II: <a
+   href="http://www-106.ibm.com/developerworks/eserver/pdfs/archpub2.pdf">PowerPC
+   Virtual Environment Architecture</a></li>
+   <li>Book III: <a
+   href="http://www-106.ibm.com/developerworks/eserver/pdfs/archpub3.pdf">PowerPC
+   Operating Environment Architecture</a></li>
+ </ul></li>
+ <li><a
+ href="http://www-3.ibm.com/chips/techlib/techlib.nsf/techdocs/852569B20050FF7785256996007558C6">PowerPC
+ Compiler Writer's Guide</a></li>
+ <li><A
+ href="http://www-3.ibm.com/chips/techlib/techlib.nsf/products/PowerPC">PowerPC
+ Processor Manuals</a></li>
+ <li><a
+ href="http://www-106.ibm.com/developerworks/linux/library/l-powarch/">Intro to
+ PowerPC architecture</a></li>
+ <li><a href="http://publibn.boulder.ibm.com/doc_link/en_US/a_doc_lib/aixassem/alangref/alangreftfrm.htm">IBM AIX/5L for POWER Assembly reference</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Other documents, collections, notes</div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a href="http://penguinppc.org/dev/#library">PowerPC ABI documents</a></li>
+ <li><a href="http://gcc.gnu.org/ml/gcc-patches/2003-09/msg00997.html">PowerPC64
+ alignment of long doubles (from GCC)</a></li>
+ <li><a href="http://sources.redhat.com/ml/binutils/2002-04/msg00573.html">Long
+ branch stubs for powerpc64-linux (from binutils)</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="sparc">SPARC</a></div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a href="http://www.sparc.org/resource.htm">SPARC resources</a></li>
+ <li><a href="http://www.sparc.org/standards.html">SPARC standards</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="x86">X86</a></div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">AMD - Official manuals and docs</div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a
+ href="http://www.amd.com/us-en/Processors/TechnicalResources/0,,30_182_739,00.html">AMD processor manuals</a></li>
+ <li><a href="http://www.x86-64.org/documentation">X86-64 ABI</a></li>
+ </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Intel - Official manuals and docs</div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a
+ href="http://developer.intel.com/design/pentium4/manuals/index_new.htm">IA-32
+ manuals</a></li>
+ <li><a
+ href="http://www.intel.com/design/itanium/documentation.htm?iid=ipp_srvr_proc_itanium2+techdocs">Intel
+ Itanium documentation</a></li>
+ </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Other x86-specific information</div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li><a href="http://www.agner.org/assem/calling_conventions.pdf">Calling
+ conventions for different C++ compilers and operating systems</a></li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="other">Other relevant lists</a></div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a href="http://gcc.gnu.org/readings.html">GCC reading list</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="abi">ABI</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="linux">Linux</a></div>
+ 
+ <div class="doc_text">
+ <ol>
+ <li><a href="http://www.linuxbase.org/spec/ELF/ppc64/">PowerPC 64-bit ELF ABI
+ Supplement</a></li>
+ </ol>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="osx">OS X</a></div>
+ 
+ <div class="doc_text">
+ <ol>
+ <li><a
+ href="http://developer.apple.com/documentation/Darwin/RuntimeArchitecture-date.html">Mach-O
+ Runtime Architecture</a></li>
+ <li><a href="http://www.unsanity.org/archives/000044.php">Notes on Mach-O
+ ABI</a></li>
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="misc">Miscellaneous resources</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <ul>
+ <li><a
+ href="http://www.nondot.org/sabre/os/articles/ExecutableFileFormats/">Executable
+ File Format library</a></li>
+ <li><a href="http://gcc.gnu.org/projects/prefetch.html">GCC prefetch project</a>
+ page has a good survey of the prefetching capabilities of a variety of modern
+ processors.</li>
+ </ul>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="http://misha.brukman.net">Misha Brukman</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/ExtendingLLVM.html
diff -c /dev/null llvm-www/releases/1.8/docs/ExtendingLLVM.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/ExtendingLLVM.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,389 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Extending LLVM: Adding instructions, intrinsics, types, etc.</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ 
+ <body>
+ 
+ <div class="doc_title">
+   Extending LLVM: Adding instructions, intrinsics, types, etc.
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction and Warning</a></li>
+   <li><a href="#intrinsic">Adding a new intrinsic function</a></li>
+   <li><a href="#instruction">Adding a new instruction</a></li>
+   <li><a href="#sdnode">Adding a new SelectionDAG node</a></li>
+   <li><a href="#type">Adding a new type</a>
+   <ol>
+     <li><a href="#fund_type">Adding a new fundamental type</a></li>
+     <li><a href="#derived_type">Adding a new derived type</a></li>
+   </ol></li>
+ </ol>
+ 
+ <div class="doc_author">    
+   <p>Written by <a href="http://misha.brukman.net">Misha Brukman</a>,
+   Brad Jones, Nate Begeman,
+   and <a href="http://nondot.org/sabre">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction and Warning</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>During the course of using LLVM, you may wish to customize it for your
+ research project or for experimentation. At this point, you may realize that
+ you need to add something to LLVM, whether it be a new fundamental type, a new
+ intrinsic function, or a whole new instruction.</p>
+ 
+ <p>When you come to this realization, stop and think. Do you really need to
+ extend LLVM? Is it a new fundamental capability that LLVM does not support at
+ its current incarnation or can it be synthesized from already pre-existing LLVM
+ elements? If you are not sure, ask on the <a
+ href="http://mail.cs.uiuc.edu/mailman/listinfo/llvmdev">LLVM-dev</a> list. The
+ reason is that extending LLVM will get involved as you need to update all the
+ different passes that you intend to use with your extension, and there are
+ <em>many</em> LLVM analyses and transformations, so it may be quite a bit of
+ work.</p>
+ 
+ <p>Adding an <a href="#intrinsic">intrinsic function</a> is far easier than
+ adding an instruction, and is transparent to optimization passes.  If your added
+ functionality can be expressed as a
+ function call, an intrinsic function is the method of choice for LLVM
+ extension.</p>
+ 
+ <p>Before you invest a significant amount of effort into a non-trivial
+ extension, <span class="doc_warning">ask on the list</span> if what you are
+ looking to do can be done with already-existing infrastructure, or if maybe
+ someone else is already working on it. You will save yourself a lot of time and
+ effort by doing so.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="intrinsic">Adding a new intrinsic function</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Adding a new intrinsic function to LLVM is much easier than adding a new
+ instruction.  Almost all extensions to LLVM should start as an intrinsic
+ function and then be turned into an instruction if warranted.</p>
+ 
+ <ol>
+ <li><tt>llvm/docs/LangRef.html</tt>:
+     Document the intrinsic.  Decide whether it is code generator specific and
+     what the restrictions are.  Talk to other people about it so that you are
+     sure it's a good idea.</li>
+ 
+ <li><tt>llvm/include/llvm/Intrinsics*.td</tt>:
+     Add an entry for your intrinsic.  Describe its memory access characteristics
+     for optimization (this controls whether it will be DCE'd, CSE'd, etc).</li>
+ 
+ <li><tt>llvm/lib/Analysis/ConstantFolding.cpp</tt>: If it is possible to 
+     constant fold your intrinsic, add support to it in the 
+     <tt>canConstantFoldCallTo</tt> and <tt>ConstantFoldCall</tt> functions.</li>
+ 
+ <li><tt>llvm/test/Regression/*</tt>: Add test cases for your test cases to the 
+     test suite</li>
+ </ol>
+ 
+ <p>Once the intrinsic has been added to the system, you must add code generator
+ support for it.  Generally you must do the following steps:</p>
+ 
+ <dl>
+ <dt>Add support to the C backend in <tt>lib/Target/CBackend/</tt></dt>
+ 
+ <dd>Depending on the intrinsic, there are a few ways to implement this.  For
+ most intrinsics, it makes sense to add code to lower your intrinsic in 
+ <tt>LowerIntrinsicCall</tt> in <tt>lib/CodeGen/IntrinsicLowering.cpp</tt>.
+ Second, if it makes sense to lower the intrinsic to an expanded sequence of C 
+ code in all cases, just emit the expansion in <tt>visitCallInst</tt> in
+ <tt>Writer.cpp</tt>.  If the intrinsic has some way to express it with GCC 
+ (or any other compiler) extensions, it can be conditionally supported based on 
+ the compiler compiling the CBE output (see <tt>llvm.prefetch</tt> for an 
+ example).  
+ Third, if the intrinsic really has no way to be lowered, just have the code 
+ generator emit code that prints an error message and calls abort if executed.
+ </dd>
+ 
+ <dl>
+ <dt>Add support to the .td file for the target(s) of your choice in 
+    <tt>lib/Target/*/*.td</tt>.</dt>
+ 
+ <dd>This is usually a matter of adding a pattern to the .td file that matches
+     the intrinsic, though it may obviously require adding the instructions you
+     want to generate as well.  There are lots of examples in the PowerPC and X86
+     backend to follow.</dd>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="sdnode">Adding a new SelectionDAG node</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>As with intrinsics, adding a new SelectionDAG node to LLVM is much easier
+ than adding a new instruction.  New nodes are often added to help represent
+ instructions common to many targets.  These nodes often map to an LLVM
+ instruction (add, sub) or intrinsic (byteswap, population count).  In other
+ cases, new nodes have been added to allow many targets to perform a common task
+ (converting between floating point and integer representation) or capture more
+ complicated behavior in a single node (rotate).</p>
+ 
+ <ol>
+ <li><tt>include/llvm/CodeGen/SelectionDAGNodes.h</tt>:
+     Add an enum value for the new SelectionDAG node.</li>
+ <li><tt>lib/CodeGen/SelectionDAG/SelectionDAG.cpp</tt>:
+     Add code to print the node to <tt>getOperationName</tt>.  If your new node
+     can be evaluated at compile time when given constant arguments (such as an
+     add of a constant with another constant), find the <tt>getNode</tt> method
+     that takes the appropriate number of arguments, and add a case for your node
+     to the switch statement that performs constant folding for nodes that take
+     the same number of arguments as your new node.</li>
+ <li><tt>lib/CodeGen/SelectionDAG/LegalizeDAG.cpp</tt>:
+     Add code to <a href="CodeGenerator.html#selectiondag_legalize">legalize, 
+     promote, and expand</a> the node as necessary.  At a minimum, you will need
+     to add a case statement for your node in <tt>LegalizeOp</tt> which calls
+     LegalizeOp on the node's operands, and returns a new node if any of the
+     operands changed as a result of being legalized.  It is likely that not all
+     targets supported by the SelectionDAG framework will natively support the
+     new node.  In this case, you must also add code in your node's case
+     statement in <tt>LegalizeOp</tt> to Expand your node into simpler, legal
+     operations.  The case for <tt>ISD::UREM</tt> for expanding a remainder into
+     a divide, multiply, and a subtract is a good example.</li>
+ <li><tt>lib/CodeGen/SelectionDAG/LegalizeDAG.cpp</tt>:
+     If targets may support the new node being added only at certain sizes, you 
+     will also need to add code to your node's case statement in 
+     <tt>LegalizeOp</tt> to Promote your node's operands to a larger size, and 
+     perform the correct operation.  You will also need to add code to 
+     <tt>PromoteOp</tt> to do this as well.  For a good example, see 
+     <tt>ISD::BSWAP</tt>,
+     which promotes its operand to a wider size, performs the byteswap, and then
+     shifts the correct bytes right to emulate the narrower byteswap in the
+     wider type.</li>
+ <li><tt>lib/CodeGen/SelectionDAG/LegalizeDAG.cpp</tt>:
+     Add a case for your node in <tt>ExpandOp</tt> to teach the legalizer how to
+     perform the action represented by the new node on a value that has been
+     split into high and low halves.  This case will be used to support your 
+     node with a 64 bit operand on a 32 bit target.</li>
+ <li><tt>lib/CodeGen/SelectionDAG/DAGCombiner.cpp</tt>:
+     If your node can be combined with itself, or other existing nodes in a 
+     peephole-like fashion, add a visit function for it, and call that function
+     from <tt></tt>.  There are several good examples for simple combines you
+     can do; <tt>visitFABS</tt> and <tt>visitSRL</tt> are good starting places.
+     </li>
+ <li><tt>lib/Target/PowerPC/PPCISelLowering.cpp</tt>:
+     Each target has an implementation of the <tt>TargetLowering</tt> class,
+     usually in its own file (although some targets include it in the same
+     file as the DAGToDAGISel).  The default behavior for a target is to
+     assume that your new node is legal for all types that are legal for
+     that target.  If this target does not natively support your node, then
+     tell the target to either Promote it (if it is supported at a larger
+     type) or Expand it.  This will cause the code you wrote in 
+     <tt>LegalizeOp</tt> above to decompose your new node into other legal
+     nodes for this target.</li>
+ <li><tt>lib/Target/TargetSelectionDAG.td</tt>:
+     Most current targets supported by LLVM generate code using the DAGToDAG
+     method, where SelectionDAG nodes are pattern matched to target-specific
+     nodes, which represent individual instructions.  In order for the targets
+     to match an instruction to your new node, you must add a def for that node
+     to the list in this file, with the appropriate type constraints. Look at
+     <tt>add</tt>, <tt>bswap</tt>, and <tt>fadd</tt> for examples.</li>
+ <li><tt>lib/Target/PowerPC/PPCInstrInfo.td</tt>:
+     Each target has a tablegen file that describes the target's instruction
+     set.  For targets that use the DAGToDAG instruction selection framework,
+     add a pattern for your new node that uses one or more target nodes.
+     Documentation for this is a bit sparse right now, but there are several
+     decent examples.  See the patterns for <tt>rotl</tt> in 
+     <tt>PPCInstrInfo.td</tt>.</li>
+ <li>TODO: document complex patterns.</li>
+ <li><tt>llvm/test/Regression/CodeGen/*</tt>: Add test cases for your new node
+     to the test suite.  <tt>llvm/test/Regression/CodeGen/X86/bswap.ll</tt> is
+     a good example.</li>
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="instruction">Adding a new instruction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p><span class="doc_warning">WARNING: adding instructions changes the bytecode
+ format, and it will take some effort to maintain compatibility with
+ the previous version.</span> Only add an instruction if it is absolutely
+ necessary.</p>
+ 
+ <ol>
+ 
+ <li><tt>llvm/include/llvm/Instruction.def</tt>:
+     add a number for your instruction and an enum name</li>
+ 
+ <li><tt>llvm/include/llvm/Instructions.h</tt>:
+     add a definition for the class that will represent your instruction</li>
+ 
+ <li><tt>llvm/include/llvm/Support/InstVisitor.h</tt>:
+     add a prototype for a visitor to your new instruction type</li>
+ 
+ <li><tt>llvm/lib/AsmParser/Lexer.l</tt>:
+     add a new token to parse your instruction from assembly text file</li>
+ 
+ <li><tt>llvm/lib/AsmParser/llvmAsmParser.y</tt>:
+     add the grammar on how your instruction can be read and what it will
+     construct as a result</li>
+ 
+ <li><tt>llvm/lib/Bytecode/Reader/Reader.cpp</tt>:
+     add a case for your instruction and how it will be parsed from bytecode</li>
+ 
+ <li><tt>llvm/lib/VMCore/Instruction.cpp</tt>:
+     add a case for how your instruction will be printed out to assembly</li>
+ 
+ <li><tt>llvm/lib/VMCore/Instructions.cpp</tt>:
+     implement the class you defined in
+     <tt>llvm/include/llvm/Instructions.h</tt></li>
+ 
+ <li>Test your instruction</li>
+ 
+ <li><tt>llvm/lib/Target/*</tt>: 
+     Add support for your instruction to code generators, or add a lowering
+     pass.</li>
+ 
+ <li><tt>llvm/test/Regression/*</tt>: add your test cases to the test suite.</li>
+ 
+ </ol>
+ 
+ <p>Also, you need to implement (or modify) any analyses or passes that you want
+ to understand this new instruction.</p>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="type">Adding a new type</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p><span class="doc_warning">WARNING: adding new types changes the bytecode
+ format, and will break compatibility with currently-existing LLVM
+ installations.</span> Only add new types if it is absolutely necessary.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="fund_type">Adding a fundamental type</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ol>
+ 
+ <li><tt>llvm/include/llvm/Type.h</tt>:
+     add enum for the new type; add static <tt>Type*</tt> for this type</li>
+ 
+ <li><tt>llvm/lib/VMCore/Type.cpp</tt>:
+     add mapping from <tt>TypeID</tt> => <tt>Type*</tt>;
+     initialize the static <tt>Type*</tt></li>
+ 
+ <li><tt>llvm/lib/AsmReader/Lexer.l</tt>:
+     add ability to parse in the type from text assembly</li>
+ 
+ <li><tt>llvm/lib/AsmReader/llvmAsmParser.y</tt>:
+     add a token for that type</li>
+ 
+ </ol>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="derived_type">Adding a derived type</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ol>
+ <li><tt>llvm/include/llvm/Type.h</tt>:
+     add enum for the new type; add a forward declaration of the type
+     also</li>
+ 
+ <li><tt>llvm/include/llvm/DerivedTypes.h</tt>:
+     add new class to represent new class in the hierarchy; add forward 
+     declaration to the TypeMap value type</li>
+ 
+ <li><tt>llvm/lib/VMCore/Type.cpp</tt>:
+     add support for derived type to: 
+ <div class="doc_code">
+ <pre>
+ std::string getTypeDescription(const Type &Ty,
+   std::vector<const Type*> &TypeStack)
+ bool TypesEqual(const Type *Ty, const Type *Ty2,
+   std::map<const Type*, const Type*> & EqTypes)
+ </pre>
+ </div>
+     add necessary member functions for type, and factory methods</li>
+ 
+ <li><tt>llvm/lib/AsmReader/Lexer.l</tt>:
+     add ability to parse in the type from text assembly</li>
+ 
+ <li><tt>llvm/lib/ByteCode/Writer/Writer.cpp</tt>:
+     modify <tt>void BytecodeWriter::outputType(const Type *T)</tt> to serialize
+     your type</li>
+ 
+ <li><tt>llvm/lib/ByteCode/Reader/Reader.cpp</tt>:
+     modify <tt>const Type *BytecodeReader::ParseType()</tt> to read your data
+     type</li> 
+ 
+ <li><tt>llvm/lib/VMCore/AsmWriter.cpp</tt>:
+     modify
+ <div class="doc_code">
+ <pre>
+ void calcTypeName(const Type *Ty,
+                   std::vector<const Type*> &TypeStack,
+                   std::map<const Type*,std::string> &TypeNames,
+                   std::string & Result)
+ </pre>
+ </div>
+     to output the new derived type
+ </li>  
+  
+ 
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a>
+   <br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/FAQ.html
diff -c /dev/null llvm-www/releases/1.8/docs/FAQ.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/FAQ.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,678 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM: Frequently Asked Questions</title>
+   <style type="text/css">
+     @import url("llvm.css");
+     .question { font-weight: bold }
+     .answer   { margin-left: 2em  }
+   </style>
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   LLVM: Frequently Asked Questions
+ </div>
+ 
+ <ol>
+   <li><a href="#license">License</a>
+   <ol>
+   <li>Why are the LLVM source code and the front-end distributed under different
+   licenses?</li>
+   <li>Does the University of Illinois Open Source License really qualify as an
+   "open source" license?</li>
+   <li>Can I modify LLVM source code and redistribute the modified source?</li>
+   <li>Can I modify LLVM source code and redistribute binaries or other tools
+   based on it, without redistributing the source?</li>
+   </ol></li>
+ 
+   <li><a href="#source">Source code</a>
+   <ol>
+   <li>In what language is LLVM written?</li>
+   <li>How portable is the LLVM source code?</li>
+   </ol></li>
+ 
+   <li><a href="#build">Build Problems</a>
+   <ol>
+   <li>When I run configure, it finds the wrong C compiler.</li>
+   <li>I compile the code, and I get some error about <tt>/localhome</tt>.</li>
+   <li>The <tt>configure</tt> script finds the right C compiler, but it uses the
+   LLVM linker from a previous build.  What do I do?</li>
+   <li>When creating a dynamic library, I get a strange GLIBC error.</li>
+   <li>I've updated my source tree from CVS, and now my build is trying to use a
+   file/directory that doesn't exist.</li>
+   <li>I've modified a Makefile in my source tree, but my build tree keeps using
+   the old version.  What do I do?</li>
+   <li>I've upgraded to a new version of LLVM, and I get strange build
+   errors.</li>
+   <li>I've built LLVM and am testing it, but the tests freeze.</li>
+   <li>Why do test results differ when I perform different types of builds?</li>
+   <li>Compiling LLVM with GCC 3.3.2 fails, what should I do?</li>
+   <li>When I use the test suite, all of the C Backend tests fail.  What is
+       wrong?</li>
+   <li>After CVS update, rebuilding gives the error "No rule to make
+   target".</li>
+   </ol></li>
+ 
+   <li><a href="#felangs">Source Languages</a>
+   <ol>
+     <li><a href="#langs">What source languages are supported?</a></li>
+     <li><a href="#langhlsupp">What support is there for higher level source
+       language constructs for building a compiler?</a></li>
+   </ol>
+ 
+   <li><a href="#cfe">Using the GCC Front End</a>
+   <ol>
+     <li>
+     When I compile software that uses a configure script, the configure script
+     thinks my system has all of the header files and libraries it is testing
+     for.  How do I get configure to work correctly?
+     </li>
+ 
+     <li>
+     When I compile code using the LLVM GCC front end, it complains that it
+     cannot find libcrtend.a.
+     </li>
+ 
+     <li>
+     How can I disable all optimizations when compiling code using the LLVM GCC front end?
+     </li>
+ 
+     <li><a href="#translatec++">Can I use LLVM to convert C++ code to C code?</a></li>
+ 
+   </ol>
+   </li>
+ 
+   <li><a href="#cfe_code">Questions about code generated by the GCC front-end</a>
+   <ol>
+      <li><a href="#__main">What is this <tt>__main()</tt> call that gets inserted into
+          <tt>main()</tt>?</a></li>
+      <li><a href="#iosinit">What is this <tt>llvm.global_ctors</tt> and
+           <tt>_GLOBAL__I__tmp_webcompile...</tt> stuff that happens when I
+           #include <iostream>?</a></li>
+      <li><a href="#codedce">Where did all of my code go??</a></li>
+      <li><a href="#undef">What is this "<tt>undef</tt>" thing that shows up in my code?</a></li>
+   </ol>
+   </li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="http://llvm.org">The LLVM Team</a></p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="license">License</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="question">
+ <p>Why are the LLVM source code and the front-end distributed under different
+ licenses?</p>
+ </div>
+ 	
+ <div class="answer">
+ <p>The C/C++ front-ends are based on GCC and must be distributed under the GPL.
+ Our aim is to distribute LLVM source code under a <em>much less restrictive</em>
+ license, in particular one that does not compel users who distribute tools based
+ on modifying the source to redistribute the modified source code as well.</p>
+ </div>
+ 
+ <div class="question">
+ <p>Does the University of Illinois Open Source License really qualify as an
+ "open source" license?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>Yes, the license is <a
+ href="http://www.opensource.org/licenses/UoI-NCSA.php">certified</a> by the Open
+ Source Initiative (OSI).</p>
+ </div>
+ 
+ <div class="question">
+ <p>Can I modify LLVM source code and redistribute the modified source?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>Yes.  The modified source distribution must retain the copyright notice and
+ follow the three bulletted conditions listed in the <a
+ href="http://llvm.org/releases/1.3/LICENSE.TXT">LLVM license</a>.</p>
+ </div>
+ 
+ <div class="question">
+ <p>Can I modify LLVM source code and redistribute binaries or other tools based
+ on it, without redistributing the source?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>Yes, this is why we distribute LLVM under a less restrictive license than
+ GPL, as explained in the first question above.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="source">Source Code</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="question">
+ <p>In what language is LLVM written?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>All of the LLVM tools and libraries are written in C++ with extensive use of
+ the STL.</p>
+ </div>
+ 
+ <div class="question">
+ <p>How portable is the LLVM source code?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>The LLVM source code should be portable to most modern UNIX-like operating
+ systems.  Most of the code is written in standard C++ with operating system
+ services abstracted to a support library.  The tools required to build and test
+ LLVM have been ported to a plethora of platforms.</p>
+ 
+ <p>Some porting problems may exist in the following areas:</p>
+ 
+ <ul>
+ 
+   <li>The GCC front end code is not as portable as the LLVM suite, so it may not
+   compile as well on unsupported platforms.</li>
+ 
+   <li>The LLVM build system relies heavily on UNIX shell tools, like the Bourne
+   Shell and sed.  Porting to systems without these tools (MacOS 9, Plan 9) will
+   require more effort.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="build">Build Problems</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="question">
+ <p>When I run configure, it finds the wrong C compiler.</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>The <tt>configure</tt> script attempts to locate first <tt>gcc</tt> and then
+ <tt>cc</tt>, unless it finds compiler paths set in <tt>CC</tt> and <tt>CXX</tt>
+ for the C and C++ compiler, respectively.</p>
+ 
+ <p>If <tt>configure</tt> finds the wrong compiler, either adjust your
+ <tt>PATH</tt> environment variable or set <tt>CC</tt> and <tt>CXX</tt>
+ explicitly.</p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>I compile the code, and I get some error about <tt>/localhome</tt>.</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>There are several possible causes for this.  The first is that you didn't set
+ a pathname properly when using <tt>configure</tt>, and it defaulted to a
+ pathname that we use on our research machines.</p>
+ 
+ <p>Another possibility is that we hardcoded a path in our Makefiles.  If you see
+ this, please email the LLVM bug mailing list with the name of the offending
+ Makefile and a description of what is wrong with it.</p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>The <tt>configure</tt> script finds the right C compiler, but it uses the
+ LLVM linker from a previous build.  What do I do?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>The <tt>configure</tt> script uses the <tt>PATH</tt> to find executables, so
+ if it's grabbing the wrong linker/assembler/etc, there are two ways to fix
+ it:</p>
+ 
+ <ol>
+ 		
+   <li><p>Adjust your <tt>PATH</tt> environment variable so that the correct
+   program appears first in the <tt>PATH</tt>.  This may work, but may not be
+   convenient when you want them <i>first</i> in your path for other
+   work.</p></li>
+ 
+   <li><p>Run <tt>configure</tt> with an alternative <tt>PATH</tt> that is
+   correct. In a Borne compatible shell, the syntax would be:</p>
+ 		
+       <p><tt>PATH=[the path without the bad program] ./configure ...</tt></p>
+ 
+       <p>This is still somewhat inconvenient, but it allows <tt>configure</tt>
+       to do its work without having to adjust your <tt>PATH</tt>
+       permanently.</p></li>
+ 	
+ </ol>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>When creating a dynamic library, I get a strange GLIBC error.</p>
+ </div>
+ 
+ <div class="answer">
+ <p>Under some operating systems (i.e. Linux), libtool does not work correctly if
+ GCC was compiled with the --disable-shared option.  To work around this, install
+ your own version of GCC that has shared libraries enabled by default.</p>
+ </div>
+ 
+ <div class="question">
+ <p>I've updated my source tree from CVS, and now my build is trying to use a
+ file/directory that doesn't exist.</p>
+ </div>
+ 
+ <div class="answer">
+ <p>You need to re-run configure in your object directory.  When new Makefiles
+ are added to the source tree, they have to be copied over to the object tree in
+ order to be used by the build.</p>
+ </div>
+ 
+ <div class="question">
+ <p>I've modified a Makefile in my source tree, but my build tree keeps using the
+ old version.  What do I do?</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>If the Makefile already exists in your object tree, you
+ can just run the following command in the top level directory of your object
+ tree:</p>
+ 
+ <p><tt>./config.status <relative path to Makefile></tt><p>
+ 
+ <p>If the Makefile is new, you will have to modify the configure script to copy
+ it over.</p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>I've upgraded to a new version of LLVM, and I get strange build errors.</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>Sometimes, changes to the LLVM source code alters how the build system works.
+ Changes in libtool, autoconf, or header file dependencies are especially prone
+ to this sort of problem.</p>
+ 
+ <p>The best thing to try is to remove the old files and re-build.  In most
+ cases, this takes care of the problem.  To do this, just type <tt>make
+ clean</tt> and then <tt>make</tt> in the directory that fails to build.</p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>I've built LLVM and am testing it, but the tests freeze.</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>This is most likely occurring because you built a profile or release
+ (optimized) build of LLVM and have not specified the same information on the
+ <tt>gmake</tt> command line.</p>
+ 
+ <p>For example, if you built LLVM with the command:</p>
+ 
+ <p><tt>gmake ENABLE_PROFILING=1</tt>
+ 
+ <p>...then you must run the tests with the following commands:</p>
+ 
+ <p><tt>cd llvm/test<br>gmake  ENABLE_PROFILING=1</tt></p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>Why do test results differ when I perform different types of builds?</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>The LLVM test suite is dependent upon several features of the LLVM tools and
+ libraries.</p>
+ 
+ <p>First, the debugging assertions in code are not enabled in optimized or
+ profiling builds.  Hence, tests that used to fail may pass.</p>
+ 	
+ <p>Second, some tests may rely upon debugging options or behavior that is only
+ available in the debug build.  These tests will fail in an optimized or profile
+ build.</p>
+ 
+ </div>
+ 
+ <div class="question">
+ <p>Compiling LLVM with GCC 3.3.2 fails, what should I do?</p>
+ </div>
+ 
+ <div class="answer">
+ <p>This is <a href="http://gcc.gnu.org/PR?13392">a bug in GCC</a>, and 
+    affects projects other than LLVM.  Try upgrading or downgrading your GCC.</p>
+ </div>
+ 
+ <div class="question">
+ <p>After CVS update, rebuilding gives the error "No rule to make target".</p>
+ </div>
+ 
+ <div class="answer">
+ <p>If the error is of the form:</p>
+ 
+ <div class="doc_code">
+ <tt>
+ gmake[2]: *** No rule to make target `/path/to/somefile', needed by
+ `/path/to/another/file.d'.<br>
+ Stop.
+ </tt>
+ </div>
+ 
+ <p>This may occur anytime files are moved within the CVS repository or removed
+ entirely.  In this case, the best solution is to erase all <tt>.d</tt> files,
+ which list dependencies for source files, and rebuild:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ % cd $LLVM_OBJ_DIR
+ % rm -f `find . -name \*\.d` 
+ % gmake 
+ </pre>
+ </div>
+ 
+ <p>In other cases, it may be necessary to run <tt>make clean</tt> before
+ rebuilding.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="felangs">Source Languages</a></div>
+ 
+ <div class="question"><p>
+   <a name="langs">What source languages are supported?</a></p>
+ </div>
+ <div class="answer">
+   <p>LLVM currently has full support for C and C++ source languages. These are
+   available through a special version of GCC that LLVM calls the 
+   <a href="#cfe">C Front End</a></p>
+   <p>There is an incomplete version of a Java front end available in the
+   <tt>llvm-java</tt> CVS repository. There is no documentation on this yet so
+   you'll need to download the code, compile it, and try it.</p>
+   <p>In the <tt>examples/BFtoLLVM</tt> directory is a translator for the 
+   BrainF*** language (2002 Language Specification).</p>
+   <p>In the <tt>projects/Stacker</tt> directory is a compiler and runtime
+   library for the Stacker language, a "toy" language loosely based on Forth.</p>
+   <p>The PyPy developers are working on integrating LLVM into the PyPy backend
+   so that PyPy language can translate to LLVM.</p>
+ </div>
+ <div class="question"><a name="langhlsupp">
+   <p>What support is there for a higher level source language constructs for 
+   building a compiler?</a></p>
+ </div>
+ <div class="answer">
+   <p>Currently, there isn't much. LLVM supports an intermediate representation
+   which is useful for code representation but will not support the high level
+   (abstract syntax tree) representation needed by most compilers. There are no
+   facilities for lexical nor semantic analysis. There is, however, a <i>mostly
+     implemented</i> configuration-driven 
+   <a href="CompilerDriver.html">compiler driver</a> which simplifies the task
+   of running optimizations, linking, and executable generation.</p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="cfe">Using the GCC Front End</a>
+ </div>
+ 
+ <div class="question">
+ <p>
+ When I compile software that uses a configure script, the configure script
+ thinks my system has all of the header files and libraries it is testing for.
+ How do I get configure to work correctly?
+ </p>
+ </div>
+ 
+ <div class="answer">
+ <p>
+ The configure script is getting things wrong because the LLVM linker allows
+ symbols to be undefined at link time (so that they can be resolved during JIT
+ or translation to the C back end).  That is why configure thinks your system
+ "has everything."
+ </p>
+ <p>
+ To work around this, perform the following steps:
+ </p>
+ 
+ <ol>
+   <li>
+   Make sure the CC and CXX environment variables contains the full path to the
+   LLVM GCC front end.
+   </li>
+ 
+   <li>
+   Make sure that the regular C compiler is first in your PATH.
+   </li>
+ 
+   <li>
+   Add the string "-Wl,-native" to your CFLAGS environment variable.
+   </li>
+ </ol>
+ 
+ <p>
+ This will allow the gccld linker to create a native code executable instead of
+ a shell script that runs the JIT.  Creating native code requires standard
+ linkage, which in turn will allow the configure script to find out if code is
+ not linking on your system because the feature isn't available on your system.
+ </p>
+ </div>
+ 
+ <div class="question">
+ <p>
+ When I compile code using the LLVM GCC front end, it complains that it cannot
+ find libcrtend.a.
+ </p>
+ </div>
+ 
+ <div class="answer">
+ <p>
+ The only way this can happen is if you haven't installed the runtime library. To
+ correct this, do:</p>
+ <pre>
+   % cd llvm/runtime
+   % make clean ; make install-bytecode
+ </pre>
+ </div>
+ 
+ <div class="question">
+ <p>
+ How can I disable all optimizations when compiling code using the LLVM GCC front end?
+ </p>
+ </div>
+ 
+ <div class="answer">
+ <p>
+ Passing "-Wa,-disable-opt -Wl,-disable-opt" will disable *all* cleanup and
+ optimizations done at the llvm level, leaving you with the truly horrible
+ code that you desire.
+ </p>
+ </div>
+ 
+ 
+ <div class="question">
+ <p>
+ <a name="translatec++">Can I use LLVM to convert C++ code to C code?</a>
+ </p>
+ </div>
+ 
+ <div class="answer">
+ <p>Yes, you can use LLVM to convert code from any language LLVM supports to C.
+ Note that the generated C code will be very low level (all loops are lowered
+ to gotos, etc) and not very pretty (comments are stripped, original source
+ formatting is totally lost, variables are renamed, expressions are regrouped), 
+ so this may not be what you're looking for.  However, this is a good way to add
+ C++ support for a processor that does not otherwise have a C++ compiler.
+ </p>
+ 
+ <p>Use commands like this:</p>
+ 
+ <ol>
+ <li><p>Compile your program as normal with llvm-g++:</p></li>
+ 
+ <div class="doc_code">$ llvm-g++ x.cpp -o program</div>
+ 
+ <p>or:</p>
+ 
+ <div class="doc_code">
+  llvm-g++ a.cpp -c
+  llvm-g++ b.cpp -c
+  llvm-g++ a.o b.o -o program
+ </div>
+ 
+ <p>With llvm-gcc3, this will generate program and program.bc.  The .bc file is 
+ the LLVM version of the program all linked together.</p>
+ 
+ <li><p>Convert the LLVM code to C code, using the LLC tool with the C
+ backend:</p></li>
+ 
+ <div class="doc_code">$ llc -march=c program.bc -o program.c</div>
+ 
+ <li><p>Finally, compile the c file:</p></li>
+ 
+ <div class="doc_code">$ cc x.c</div>
+ 
+ </ol>
+ 
+ <p>Note that, by default, the C backend does not support exception handling.
+ If you want/need it for a certain program, you can enable it by passing
+ "-enable-correct-eh-support" to the llc program.  The resultant code will
+ use setjmp/longjmp to implement exception support that is correct but
+ relatively slow.
+ </p>
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="cfe_code">Questions about code generated by the GCC front-end</a>
+ </div>
+ 
+ <div class="question"><p>
+ <a name="__main"></a>
+ What is this <tt>__main()</tt> call that gets inserted into <tt>main()</tt>?
+ </p></div>
+ 
+ <div class="answer">
+ <p>
+ The <tt>__main</tt> call is inserted by the C/C++ compiler in order to guarantee
+ that static constructors and destructors are called when the program starts up
+ and shuts down.  In C, you can create static constructors and destructors by
+ using GCC extensions, and in C++ you can do so by creating a global variable
+ whose class has a ctor or dtor.
+ </p>
+ 
+ <p>
+ The actual implementation of <tt>__main</tt> lives in the
+ <tt>llvm/runtime/GCCLibraries/crtend/</tt> directory in the source-base, and is
+ linked in automatically when you link the program.
+ </p>
+ </div>
+ 
+ <!--=========================================================================-->
+ 
+ <div class="question">
+ <a name="iosinit"></a>
+ <p> What is this <tt>llvm.global_ctors</tt> and
+ <tt>_GLOBAL__I__tmp_webcompile...</tt> stuff that happens when I #include
+ <iostream>?</p>
+ </div>
+ 
+ <div class="answer">
+ 
+ <p>If you #include the <iostream> header into a C++ translation unit, the
+ file will probably use the <tt>std::cin</tt>/<tt>std::cout</tt>/... global
+ objects.  However, C++ does not guarantee an order of initialization between
+ static objects in different translation units, so if a static ctor/dtor in your
+ .cpp file used <tt>std::cout</tt>, for example, the object would not necessarily
+ be automatically initialized before your use.</p>
+ 
+ <p>To make <tt>std::cout</tt> and friends work correctly in these scenarios, the
+ STL that we use declares a static object that gets created in every translation
+ unit that includes <tt><iostream></tt>.  This object has a static
+ constructor and destructor that initializes and destroys the global iostream
+ objects before they could possibly be used in the file.  The code that you see
+ in the .ll file corresponds to the constructor and destructor registration code.
+ </p>
+ 
+ <p>If you would like to make it easier to <b>understand</b> the LLVM code
+ generated by the compiler in the demo page, consider using <tt>printf()</tt>
+ instead of <tt>iostream</tt>s to print values.</p>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ 
+ <div class="question"><p>
+ <a name="codedce"></a>
+ Where did all of my code go??
+ </p></div>
+ 
+ <div class="answer">
+ <p>
+ If you are using the LLVM demo page, you may often wonder what happened to all
+ of the code that you typed in.  Remember that the demo script is running the
+ code through the LLVM optimizers, so if your code doesn't actually do anything
+ useful, it might all be deleted.
+ </p>
+ 
+ <p>
+ To prevent this, make sure that the code is actually needed.  For example, if
+ you are computing some expression, return the value from the function instead of
+ leaving it in a local variable.  If you really want to constrain the optimizer,
+ you can read from and assign to <tt>volatile</tt> global variables.
+ </p>
+ </div>
+ 
+ <!--=========================================================================-->
+ 
+ <div class="question"><p>
+ <a name="undef"></a>
+ <p>What is this "<tt>undef</tt>" thing that shows up in my code?
+ </p></div>
+ 
+ <div class="answer">
+ <p>
+ <a href="LangRef.html#undef"><tt>undef</tt></a> is the LLVM way of representing
+ a value that is not defined.  You can get these if you do not initialize a 
+ variable before you use it.  For example, the C function:</p>
+ 
+ <div class="doc_code">
+   <tt>int X() { int i; return i; }</tt>
+ </div>
+ 
+ <p>Is compiled to "<tt>ret int undef</tt>" because "i" never has a value 
+ specified for it.
+ </p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/GarbageCollection.html
diff -c /dev/null llvm-www/releases/1.8/docs/GarbageCollection.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/GarbageCollection.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,533 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Accurate Garbage Collection with LLVM</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   Accurate Garbage Collection with LLVM
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a>
+     <ul>
+     <li><a href="#feature">GC features provided and algorithms supported</a></li>
+     </ul>
+   </li>
+ 
+   <li><a href="#interfaces">Interfaces for user programs</a>
+     <ul>
+     <li><a href="#roots">Identifying GC roots on the stack: <tt>llvm.gcroot</tt></a></li>
+     <li><a href="#allocate">Allocating memory from the GC</a></li>
+     <li><a href="#barriers">Reading and writing references to the heap</a></li>
+     <li><a href="#explicit">Explicit invocation of the garbage collector</a></li>
+     </ul>
+   </li>
+ 
+   <li><a href="#gcimpl">Implementing a garbage collector</a>
+     <ul>
+     <li><a href="#llvm_gc_readwrite">Implementing <tt>llvm_gc_read</tt> and <tt>llvm_gc_write</tt></a></li>
+     <li><a href="#callbacks">Callback functions used to implement the garbage collector</a></li>
+     </ul>
+   </li>
+   <li><a href="#gcimpls">GC implementations available</a>
+     <ul>
+     <li><a href="#semispace">SemiSpace - A simple copying garbage collector</a></li>
+     </ul>
+   </li>
+ 
+ <!--
+   <li><a href="#codegen">Implementing GC support in a code generator</a></li>
+ -->
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Garbage collection is a widely used technique that frees the programmer from
+ having to know the life-times of heap objects, making software easier to produce
+ and maintain.  Many programming languages rely on garbage collection for
+ automatic memory management.  There are two primary forms of garbage collection:
+ conservative and accurate.</p>
+ 
+ <p>Conservative garbage collection often does not require any special support
+ from either the language or the compiler: it can handle non-type-safe
+ programming languages (such as C/C++) and does not require any special
+ information from the compiler.  The [LINK] Boehm collector is an example of a
+ state-of-the-art conservative collector.</p>
+ 
+ <p>Accurate garbage collection requires the ability to identify all pointers in
+ the program at run-time (which requires that the source-language be type-safe in
+ most cases).  Identifying pointers at run-time requires compiler support to
+ locate all places that hold live pointer variables at run-time, including the
+ <a href="#roots">processor stack and registers</a>.</p>
+ 
+ <p>
+ Conservative garbage collection is attractive because it does not require any
+ special compiler support, but it does have problems.  In particular, because the
+ conservative garbage collector cannot <i>know</i> that a particular word in the
+ machine is a pointer, it cannot move live objects in the heap (preventing the
+ use of compacting and generational GC algorithms) and it can occasionally suffer
+ from memory leaks due to integer values that happen to point to objects in the
+ program.  In addition, some aggressive compiler transformations can break
+ conservative garbage collectors (though these seem rare in practice).
+ </p>
+ 
+ <p>
+ Accurate garbage collectors do not suffer from any of these problems, but they
+ can suffer from degraded scalar optimization of the program.  In particular,
+ because the runtime must be able to identify and update all pointers active in
+ the program, some optimizations are less effective.  In practice, however, the
+ locality and performance benefits of using aggressive garbage allocation
+ techniques dominates any low-level losses.
+ </p>
+ 
+ <p>
+ This document describes the mechanisms and interfaces provided by LLVM to
+ support accurate garbage collection.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="feature">GC features provided and algorithms supported</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ LLVM provides support for a broad class of garbage collection algorithms,
+ including compacting semi-space collectors, mark-sweep collectors, generational
+ collectors, and even reference counting implementations.  It includes support
+ for <a href="#barriers">read and write barriers</a>, and associating <a
+ href="#roots">meta-data with stack objects</a> (used for tagless garbage
+ collection).  All LLVM code generators support garbage collection, including the
+ C backend.
+ </p>
+ 
+ <p>
+ We hope that the primitive support built into LLVM is sufficient to support a
+ broad class of garbage collected languages, including Scheme, ML, scripting
+ languages, Java, C#, etc.  That said, the implemented garbage collectors may
+ need to be extended to support language-specific features such as finalization,
+ weak references, or other features.  As these needs are identified and
+ implemented, they should be added to this specification.
+ </p>
+ 
+ <p>
+ LLVM does not currently support garbage collection of multi-threaded programs or
+ GC-safe points other than function calls, but these will be added in the future
+ as there is interest.
+ </p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="interfaces">Interfaces for user programs</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section describes the interfaces provided by LLVM and by the garbage
+ collector run-time that should be used by user programs.  As such, this is the
+ interface that front-end authors should generate code for.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="roots">Identifying GC roots on the stack: <tt>llvm.gcroot</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <div class="doc_code"><tt>
+   void %llvm.gcroot(<ty>** %ptrloc, <ty2>* %metadata)
+ </tt></div>
+ 
+ <p>
+ The <tt>llvm.gcroot</tt> intrinsic is used to inform LLVM of a pointer variable
+ on the stack.  The first argument contains the address of the variable on the
+ stack, and the second contains a pointer to metadata that should be associated
+ with the pointer (which <b>must</b> be a constant or global value address).  At
+ runtime, the <tt>llvm.gcroot</tt> intrinsic stores a null pointer into the
+ specified location to initialize the pointer.</p>
+ 
+ <p>
+ Consider the following fragment of Java code:
+ </p>
+ 
+ <pre>
+        {
+          Object X;   // A null-initialized reference to an object
+          ...
+        }
+ </pre>
+ 
+ <p>
+ This block (which may be located in the middle of a function or in a loop nest),
+ could be compiled to this LLVM code:
+ </p>
+ 
+ <pre>
+ Entry:
+    ;; In the entry block for the function, allocate the
+    ;; stack space for X, which is an LLVM pointer.
+    %X = alloca %Object*
+    ...
+ 
+    ;; "CodeBlock" is the block corresponding to the start
+    ;;  of the scope above.
+ CodeBlock:
+    ;; Initialize the object, telling LLVM that it is now live.
+    ;; Java has type-tags on objects, so it doesn't need any
+    ;; metadata.
+    call void %llvm.gcroot(%Object** %X, sbyte* null)
+    ...
+ 
+    ;; As the pointer goes out of scope, store a null value into
+    ;; it, to indicate that the value is no longer live.
+    store %Object* null, %Object** %X
+    ...
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="allocate">Allocating memory from the GC</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <div class="doc_code"><tt>
+   sbyte *%llvm_gc_allocate(unsigned %Size)
+ </tt></div>
+ 
+ <p>The <tt>llvm_gc_allocate</tt> function is a global function defined by the
+ garbage collector implementation to allocate memory.  It returns a
+ zeroed-out block of memory of the appropriate size.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="barriers">Reading and writing references to the heap</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <div class="doc_code"><tt>
+   sbyte *%llvm.gcread(sbyte *, sbyte **)<br>
+   void %llvm.gcwrite(sbyte*, sbyte*, sbyte**)
+ </tt></div>
+ 
+ <p>Several of the more interesting garbage collectors (e.g., generational
+ collectors) need to be informed when the mutator (the program that needs garbage
+ collection) reads or writes object references into the heap.  In the case of a
+ generational collector, it needs to keep track of which "old" generation objects
+ have references stored into them.  The amount of code that typically needs to be
+ executed is usually quite small (and not on the critical path of any 
+ computation), so the overall performance impact of the inserted code is 
+ tolerable.</p>
+ 
+ <p>To support garbage collectors that use read or write barriers, LLVM provides
+ the <tt>llvm.gcread</tt> and <tt>llvm.gcwrite</tt> intrinsics.  The first
+ intrinsic has exactly the same semantics as a non-volatile LLVM load and the
+ second has the same semantics as a non-volatile LLVM store, with the
+ additions that they also take a pointer to the start of the memory
+ object as an argument.  At code generation
+ time, these intrinsics are replaced with calls into the garbage collector
+ (<tt><a href="#llvm_gc_readwrite">llvm_gc_read</a></tt> and <tt><a
+ href="#llvm_gc_readwrite">llvm_gc_write</a></tt> respectively), which are then
+ inlined into the code.
+ </p>
+ 
+ <p>
+ If you are writing a front-end for a garbage collected language, every load or
+ store of a reference from or to the heap should use these intrinsics instead of
+ normal LLVM loads/stores.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="initialize">Garbage collector startup and initialization</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <div class="doc_code"><tt>
+   void %llvm_gc_initialize(unsigned %InitialHeapSize)
+ </tt></div>
+ 
+ <p>
+ The <tt>llvm_gc_initialize</tt> function should be called once before any other
+ garbage collection functions are called.  This gives the garbage collector the
+ chance to initialize itself and allocate the heap spaces.  The initial heap size
+ to allocate should be specified as an argument.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="explicit">Explicit invocation of the garbage collector</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <div class="doc_code"><tt>
+   void %llvm_gc_collect()
+ </tt></div>
+ 
+ <p>
+ The <tt>llvm_gc_collect</tt> function is exported by the garbage collector
+ implementations to provide a full collection, even when the heap is not
+ exhausted.  This can be used by end-user code as a hint, and may be ignored by
+ the garbage collector.
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="gcimpl">Implementing a garbage collector</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>
+ Implementing a garbage collector for LLVM is fairly straight-forward.  The LLVM
+ garbage collectors are provided in a form that makes them easy to link into the
+ language-specific runtime that a language front-end would use.  They require
+ functionality from the language-specific runtime to get information about <a
+ href="#gcdescriptors">where pointers are located in heap objects</a>.
+ </p>
+ 
+ <p>The
+ implementation must include the <a
+ href="#allocate"><tt>llvm_gc_allocate</tt></a> and <a
+ href="#explicit"><tt>llvm_gc_collect</tt></a> functions, and it must implement
+ the <a href="#llvm_gc_readwrite">read/write barrier</a> functions as well.  To
+ do this, it will probably have to <a href="#traceroots">trace through the roots
+ from the stack</a> and understand the <a href="#gcdescriptors">GC descriptors
+ for heap objects</a>.  Luckily, there are some <a href="#gcimpls">example
+ implementations</a> available.
+ </p>
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="llvm_gc_readwrite">Implementing <tt>llvm_gc_read</tt> and <tt>llvm_gc_write</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+   <div class="doc_code"><tt>
+     void *llvm_gc_read(void*, void **)<br>
+     void llvm_gc_write(void*, void *, void**)
+  </tt></div>
+ 
+ <p>
+ These functions <i>must</i> be implemented in every garbage collector, even if
+ they do not need read/write barriers.  In this case, just load or store the
+ pointer, then return.
+ </p>
+ 
+ <p>
+ If an actual read or write barrier is needed, it should be straight-forward to
+ implement it.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="callbacks">Callback functions used to implement the garbage collector</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ Garbage collector implementations make use of call-back functions that are
+ implemented by other parts of the LLVM system.
+ </p>
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="traceroots">Tracing GC pointers from the program stack</a>
+ </div>
+ 
+ <div class="doc_text">
+   <div class="doc_code"><tt>
+      void llvm_cg_walk_gcroots(void (*FP)(void **Root, void *Meta));
+   </tt></div>
+ 
+ <p>
+ The <tt>llvm_cg_walk_gcroots</tt> function is a function provided by the code
+ generator that iterates through all of the GC roots on the stack, calling the
+ specified function pointer with each record.  For each GC root, the address of
+ the pointer and the meta-data (from the <a
+ href="#gcroot"><tt>llvm.gcroot</tt></a> intrinsic) are provided.
+ </p>
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="staticroots">Tracing GC pointers from static roots</a>
+ </div>
+ 
+ <div class="doc_text">
+ TODO
+ </div>
+ 
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="gcdescriptors">Tracing GC pointers from heap objects</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The three most common ways to keep track of where pointers live in heap objects
+ are (listed in order of space overhead required):</p>
+ 
+ <ol>
+ <li>In languages with polymorphic objects, pointers from an object header are
+ usually used to identify the GC pointers in the heap object.  This is common for
+ object-oriented languages like Self, Smalltalk, Java, or C#.</li>
+ 
+ <li>If heap objects are not polymorphic, often the "shape" of the heap can be
+ determined from the roots of the heap or from some other meta-data [<a
+ href="#appel89">Appel89</a>, <a href="#goldberg91">Goldberg91</a>, <a
+ href="#tolmach94">Tolmach94</a>].  In this case, the garbage collector can
+ propagate the information around from meta data stored with the roots.  This
+ often eliminates the need to have a header on objects in the heap.  This is
+ common in the ML family.</li>
+ 
+ <li>If all heap objects have pointers in the same locations, or pointers can be
+ distinguished just by looking at them (e.g., the low order bit is clear), no
+ book-keeping is needed at all.  This is common for Lisp-like languages.</li>
+ </ol>
+ 
+ <p>The LLVM garbage collectors are capable of supporting all of these styles of
+ language, including ones that mix various implementations.  To do this, it
+ allows the source-language to associate meta-data with the <a
+ href="#roots">stack roots</a>, and the heap tracing routines can propagate the
+ information.  In addition, LLVM allows the front-end to extract GC information
+ from in any form from a specific object pointer (this supports situations #1 and
+ #3).
+ </p>
+ 
+ <p><b>Making this efficient</b></p>
+ 
+ 
+ 
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="gcimpls">GC implementations available</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>
+ To make this more concrete, the currently implemented LLVM garbage collectors
+ all live in the <tt>llvm/runtime/GC/*</tt> directories in the LLVM source-base.
+ If you are interested in implementing an algorithm, there are many interesting
+ possibilities (mark/sweep, a generational collector, a reference counting
+ collector, etc), or you could choose to improve one of the existing algorithms.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="semispace">SemiSpace - A simple copying garbage collector</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ SemiSpace is a very simple copying collector.  When it starts up, it allocates
+ two blocks of memory for the heap.  It uses a simple bump-pointer allocator to
+ allocate memory from the first block until it runs out of space.  When it runs
+ out of space, it traces through all of the roots of the program, copying blocks
+ to the other half of the memory space.
+ </p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection">
+   Possible Improvements
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ If a collection cycle happens and the heap is not compacted very much (say less
+ than 25% of the allocated memory was freed), the memory regions should be
+ doubled in size.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="references">References</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p><a name="appel89">[Appel89]</a> Runtime Tags Aren't Necessary. Andrew
+ W. Appel. Lisp and Symbolic Computation 19(7):703-705, July 1989.</p>
+ 
+ <p><a name="goldberg91">[Goldberg91]</a> Tag-free garbage collection for
+ strongly typed programming languages.  Benjamin Goldberg. ACM SIGPLAN
+ PLDI'91.</p>
+ 
+ <p><a name="tolmach94">[Tolmach94]</a> Tag-free garbage collection using
+ explicit type parameters.  Andrew Tolmach.  Proceedings of the 1994 ACM
+ conference on LISP and functional programming.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/GettingStarted.html
diff -c /dev/null llvm-www/releases/1.8/docs/GettingStarted.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/GettingStarted.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1601 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>Getting Started with LLVM System</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   Getting Started with the LLVM System  
+ </div>
+ 
+ <ul>
+   <li><a href="#overview">Overview</a>
+   <li><a href="#quickstart">Getting Started Quickly (A Summary)</a>
+   <li><a href="#requirements">Requirements</a>
+     <ol>
+       <li><a href="#hardware">Hardware</a>
+       <li><a href="#software">Software</a>
+       <li><a href="#brokengcc">Broken versions of GCC</a>
+     </ol></li>
+ 
+   <li><a href="#starting">Getting Started with LLVM</a>
+     <ol>
+       <li><a href="#terminology">Terminology and Notation</a>
+       <li><a href="#environment">Setting Up Your Environment</a>
+       <li><a href="#unpack">Unpacking the LLVM Archives</a>
+       <li><a href="#checkout">Checkout LLVM from CVS</a>
+       <li><a href="#installcf">Install the GCC Front End</a>
+       <li><a href="#config">Local LLVM Configuration</a>
+       <li><a href="#compile">Compiling the LLVM Suite Source Code</a>
+       <li><a href="#cross-compile">Cross-Compiling LLVM</a>
+       <li><a href="#objfiles">The Location of LLVM Object Files</a>
+       <li><a href="#optionalconfig">Optional Configuration Items</a>
+     </ol></li>
+ 
+   <li><a href="#layout">Program layout</a>
+     <ol>
+       <li><a href="#cvsdir"><tt>CVS</tt> directories</a>
+       <li><a href="#examples"><tt>llvm/examples</tt></a>
+       <li><a href="#include"><tt>llvm/include</tt></a>
+       <li><a href="#lib"><tt>llvm/lib</tt></a>
+       <li><a href="#projects"><tt>llvm/projects</tt></a>
+       <li><a href="#runtime"><tt>llvm/runtime</tt></a>  
+       <li><a href="#test"><tt>llvm/test</tt></a>
+       <li><a href="#llvmtest"><tt>llvm-test</tt></a>
+       <li><a href="#tools"><tt>llvm/tools</tt></a>  
+       <li><a href="#utils"><tt>llvm/utils</tt></a>
+       <li><a href="#win32"><tt>llvm/win32</tt></a>
+     </ol></li>
+ 
+   <li><a href="#tutorial">An Example Using the LLVM Tool Chain</a>
+   <li><a href="#problems">Common Problems</a>
+   <li><a href="#links">Links</a>
+ </ul>
+ 
+ <div class="doc_author">
+   <p>Written by: 
+     <a href="mailto:criswell at uiuc.edu">John Criswell</a>, 
+     <a href="mailto:sabre at nondot.org">Chris Lattner</a>,
+     <a href="http://misha.brukman.net">Misha Brukman</a>, 
+     <a href="http://www.cs.uiuc.edu/~vadve">Vikram Adve</a>, and
+     <a href="mailto:gshi1 at uiuc.edu">Guochun Shi</a>.
+   </p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="overview"><b>Overview</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Welcome to LLVM! In order to get started, you first need to know some
+ basic information.</p>
+ 
+ <p>First, LLVM comes in two pieces. The first piece is the LLVM suite. This
+ contains all of the tools, libraries, and header files needed to use the low
+ level virtual machine.  It contains an assembler, disassembler, bytecode
+ analyzer, and bytecode optimizer.  It also contains a test suite that can be
+ used to test the LLVM tools and the GCC front end.</p>
+ 
+ <p>The second piece is the GCC front end.  This component provides a version of
+ GCC that compiles C and C++ code into LLVM bytecode.  Currently, the GCC front
+ end is a modified version of GCC 3.4 (we track the GCC 3.4 development).  Once
+ compiled into LLVM bytecode, a program can be manipulated with the LLVM tools
+ from the LLVM suite.</p>
+ 
+ <p>
+ There is a third, optional piece called llvm-test.  It is a suite of programs
+ with a testing harness that can be used to further test LLVM's functionality
+ and performance.
+ </p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="quickstart"><b>Getting Started Quickly (A Summary)</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Here's the short story for getting up and running quickly with LLVM:</p>
+ 
+ <ol>
+   <li>Read the documentation.</li>
+   <li>Read the documentation.</li>
+   <li>Remember that you were warned twice about reading the documentation.</li>
+   <li>Install the GCC front end if you intend to compile C or C++:
+     <ol>
+       <li><tt>cd <i>where-you-want-the-C-front-end-to-live</i></tt></li>
+       <li><tt>gunzip --stdout cfrontend.<i>platform</i>.tar.gz | tar -xvf -</tt>
+       </li>
+       <li><tt>cd cfrontend/<i>platform</i><br>
+         ./fixheaders</tt></li>
+       <li>Add the cfrontend's "bin" directory to your PATH variable.</li>
+     </ol></li>
+ 
+   <li>Get the LLVM Source Code
+   <ul>
+     <li>With the distributed files (or use <a href="#checkout">CVS</a>):
+     <ol>
+       <li><tt>cd <i>where-you-want-llvm-to-live</i></tt>
+       <li><tt>gunzip --stdout llvm-<i>version</i>.tar.gz | tar -xvf -</tt>
+     </ol></li>
+ 
+   </ul></li>
+ 
+   <li><b>[Optional]</b> Get the Test Suite Source Code 
+   <ul>
+     <li>With the distributed files (or use <a href="#checkout">CVS</a>):
+     <ol>
+       <li><tt>cd <i>where-you-want-llvm-to-live</i></tt>
+       <li><tt>cd llvm/projects</tt>
+       <li><tt>gunzip --stdout llvm-test-<i>version</i>.tar.gz | tar -xvf -</tt>
+     </ol></li>
+ 
+   </ul></li>
+ 
+ 
+   <li>Configure the LLVM Build Environment
+   <ol>
+     <li><tt>cd <i>where-you-want-to-build-llvm</i></tt></li>
+     <li><tt><i>/path/to/llvm/</i>configure [options]</tt><br>
+     Some common options:
+ 
+       <ul>
+         <li><tt>--prefix=<i>directory</i></tt>
+         <p>Specify for <i>directory</i> the full pathname of where you
+         want the LLVM tools and libraries to be installed (default
+         <tt>/usr/local</tt>).</p></li>
+         <li><tt>--with-llvmgccdir=<i>directory</i></tt>
+         <p>Optionally, specify for <i>directory</i> the full pathname of the 
+         C/C++ front end installation to use with this LLVM configuration. If
+         not specified, the PATH will be searched.</p></li>
+         <li><tt>--enable-spec2000=<i>directory</i></tt>
+             <p>Enable the SPEC2000 benchmarks for testing.  The SPEC2000
+             benchmarks should be available in
+             <tt><i>directory</i></tt>.</p></li>
+       </ul>
+   </ol></li>
+ 
+   <li>Build the LLVM Suite:
+   <ol>
+       <li><tt>gmake -k |& tee gnumake.out
+          # this is csh or tcsh syntax</tt></li>
+       <li>If you get an "internal compiler error (ICE)" see <a href="#brokengcc">below</a>.</li>
+   </ol>
+ 
+ </ol>
+ 
+ <p>Consult the <a href="#starting">Getting Started with LLVM</a> section for
+ detailed information on configuring and compiling LLVM.  See <a
+ href="#environment">Setting Up Your Environment</a> for tips that simplify
+ working with the GCC front end and LLVM tools.  Go to <a href="#layout">Program
+ Layout</a> to learn about the layout of the source code tree.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="requirements"><b>Requirements</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Before you begin to use the LLVM system, review the requirements given below.
+ This may save you some trouble by knowing ahead of time what hardware and
+ software you will need.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="hardware"><b>Hardware</b></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM is known to work on the following platforms:</p>
+ 
+ <table cellpadding="3">
+ <tr>
+   <th>OS</th>
+   <th>Arch</th>
+   <th>Compilers</th>
+ </tr>
+ <tr>
+   <td>Linux</td>
+   <td>x86<sup><a href="#pf_1">1</a></sup></td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>Solaris</td>
+   <td>V9 (Ultrasparc)</td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>FreeBSD</td>
+   <td>x86<sup><a href="#pf_1">1</a></sup></td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>MacOS X<sup><a href="#pf_2">2</a></sup></td>
+   <td>PowerPC</td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>MacOS X<sup><a href="#pf_2">2</a></sup></td>
+   <td>x86</td>
+   <td>GCC</td>
+ 
+ </tr>
+ <tr>
+   <td>Cygwin/Win32</td>
+   <td>x86<sup><a href="#pf_1">1</a>,<a href="#pf_8">8</a></sup></td>
+   <td>GCC 3.4.X, binutils 2.15</td>
+ </tr>
+ <tr>
+   <td>MinGW/Win32</td>
+   <td>x86<sup><a href="#pf_1">1</a>,<a href="#pf_6">6</a>,<a href="#pf_8">8</a></sup></td>
+   <td>GCC 3.4.X, binutils 2.15</td>
+ </tr>
+ <tr>
+   <td>Linux</td>
+   <td>amd64<sup><a href="#pf_3">3</a></sup></td>
+   <td>GCC</td>
+ </tr>
+ </table>
+ 
+ <p>LLVM has partial support for the following platforms:</p>
+ 
+ <table>
+ <tr>
+   <th>OS</th>
+   <th>Arch</th>
+   <th>Compilers</th>
+ </tr>
+ <tr>
+   <td>Windows</td>
+   <td>x86<sup><a href="#pf_1">1</a></sup></td>
+   <td>Visual Studio .NET<sup><a href="#pf_4">4</a>,<a href="#pf_5">5</a></sup></td>
+ <tr>
+   <td>AIX<sup><a href="#pf_3">3</a>,<a href="#pf_4">4</a></sup></td>
+   <td>PowerPC</td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>Linux<sup><a href="#pf_3">3</a>,<a href="#pf_5">5</a></sup></td>
+   <td>PowerPC</td>
+   <td>GCC</td>
+ </tr>
+ 
+ <tr>
+   <td>Linux<sup><a href="#pf_7">7</a></sup></td>
+   <td>Alpha</td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>Linux<sup><a href="#pf_7">7</a></sup></td>
+   <td>Itanium (IA-64)</td>
+   <td>GCC</td>
+ </tr>
+ <tr>
+   <td>HP-UX<sup><a href="#pf_7">7</a></sup></td>
+   <td>Itanium (IA-64)</td>
+   <td>HP aCC</td>
+ </tr>
+ </table>
+ 
+ <p><b>Notes:</b></p>
+ 
+ <div class="doc_notes">
+ <ol>
+ <li><a name="pf_1">Code generation supported for Pentium processors and
+ up</a></li>
+ <li><a name="pf_2">Code generation supported for 32-bit ABI only</a></li>
+ <li><a name="pf_3">No native code generation</a></li>
+ <li><a name="pf_4">Build is not complete: one or more tools don't link</a></li>
+ <li><a name="pf_5">The GCC-based C/C++ frontend does not build</a></li>
+ <li><a name="pf_6">The port is done using the MSYS shell. 
+ <a href="http://www.mingw.org/MinGWiki/">Download</a> and install 
+ bison (excl. M4.exe) and flex in that order. Build binutils-2.15 from source,
+ if necessary. Bison & flex can be also grabbed from GNUWin32 sf.net project</li>
+ <li><a name="pf_7">Native code generation exists but is not complete.</a></li>
+ <li><a name="pf_8">Binutils up to post-2.17 has bug in bfd/cofflink.c
+     preventing LLVM from building correctly. Several workarounds have been
+     introduced into LLVM build system, but the bug can occur anytime in the
+     future. It's highly recommended to rebuild your current binutils with the
+     patch from <a href="http://sourceware.org/bugzilla/show_bug.cgi?id=2659">
+     Binutils bugzilla</a>, if it's wasn't already applied. </a></li>
+ </ol>
+ </div>
+ 
+ <p>Note that you will need about 1-3 GB of space for a full LLVM build in Debug
+ mode, depending on the system (it is so large because of all the debugging
+ information and the fact that the libraries are statically linked into multiple
+ tools).  If you do not need many of the tools and you are space-conscious,
+ you can disable them individually in <tt>llvm/tools/Makefile</tt>.  The Release
+ build requires considerably less space.</p>
+ 
+ <p>The LLVM suite <i>may</i> compile on other platforms, but it is not
+ guaranteed to do so.  If compilation is successful, the LLVM utilities should be
+ able to assemble, disassemble, analyze, and optimize LLVM bytecode.  Code
+ generation should work as well, although the generated native code may not work
+ on your platform.</p>
+ 
+ <p>The GCC front end is not very portable at the moment.  If you want to get it
+ to work on another platform, you can download a copy of the source and <a
+ href="CFEBuildInstrs.html">try to compile it</a> on your platform.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="software"><b>Software</b></a></div>
+ <div class="doc_text">
+   <p>Compiling LLVM requires that you have several software packages 
+   installed. The table below lists those required packages. The Package column
+   is the usual name for the software package that LLVM depends on. The Version
+   column provides "known to work" versions of the package. The Notes column
+   describes how LLVM uses the package and provides other details.</p>
+   <table>
+     <tr><th>Package</th><th>Version</th><th>Notes</th></tr>
+ 
+     <tr>
+       <td><a href="http://savannah.gnu.org/projects/make">GNU Make</a></td>
+       <td>3.79, 3.79.1</td>
+       <td>Makefile/build processor</td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://gcc.gnu.org">GCC</a></td>
+       <td>3.4.2</td>
+       <td>C/C++ compiler<sup><a href="#sf1">1</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.gnu.org/software/texinfo">TeXinfo</a></td>
+       <td>4.5</td>
+       <td>For building the CFE</td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.gnu.org/software/flex">Flex</a></td>
+       <td>2.5.4</td>
+       <td>LEX compiler</td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.gnu.org/software/bison/bison.html">Bison</a></td>
+       <td>1.28, 1.35, 1.75, 1.875d, 2.0, or 2.1<br>(not 1.85 or 1.875)</td>
+       <td>YACC compiler</td>
+     </tr>
+ 
+     <tr>
+       <td><a href="https://www.cvshome.org/downloads.html">CVS</a></td>
+       <td>≥1.11</td>
+       <td>CVS access to LLVM<sup><a href="#sf2">2</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://savannah.gnu.org/projects/dejagnu">DejaGnu</a></td>
+       <td>1.4.2</td>
+       <td>Automated test suite<sup><a href="#sf3">3</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.tcl.tk/software/tcltk/">tcl</a></td>
+       <td>8.3, 8.4</td>
+       <td>Automated test suite<sup><a href="#sf3">3</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://expect.nist.gov/">expect</a></td>
+       <td>5.38.0</td>
+       <td>Automated test suite<sup><a href="#sf3">3</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.perl.com/download.csp">perl</a></td>
+       <td>≥5.6.0</td>
+       <td>Nightly tester, utilities</td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://savannah.gnu.org/projects/m4">GNU M4</a>
+       <td>1.4</td>
+       <td>Macro processor for configuration<sup><a href="#sf4">4</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.gnu.org/software/autoconf">GNU Autoconf</a></td>
+       <td>2.59</td>
+       <td>Configuration script builder<sup><a href="#sf4">4</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://www.gnu.org/software/automake">GNU Automake</a></td>
+       <td>1.9.2</td>
+       <td>aclocal macro generator<sup><a href="#sf4">4</a></sup></td>
+     </tr>
+ 
+     <tr>
+       <td><a href="http://savannah.gnu.org/projects/libtool">libtool</a></td>
+       <td>1.5.10</td>
+       <td>Shared library manager<sup><a href="#sf4">4</a></sup></td>
+     </tr>
+ 
+   </table>
+ 
+   <p><b>Notes:</b></p>
+   <div class="doc_notes">
+   <ol>
+     <li><a name="sf3">Only the C and C++ languages are needed so there's no
+       need to build the other languages for LLVM's purposes.</a> See 
+       <a href="#brokengcc">below</a> for specific version info.</li>
+     <li><a name="sf2">You only need CVS if you intend to build from the 
+       latest LLVM sources. If you're working from a release distribution, you
+       don't need CVS.</a></li>
+     <li><a name="sf3">Only needed if you want to run the automated test 
+       suite in the <tt>llvm/test</tt> directory.</a></li>
+     <li><a name="sf4">If you want to make changes to the configure scripts, 
+       you will need GNU autoconf (2.59), and consequently, GNU M4 (version 1.4 
+       or higher). You will also need automake (1.9.2). We only use aclocal 
+       from that package.</a></li>
+   </ol>
+   </div>
+   
+   <p>Additionally, your compilation host is expected to have the usual 
+   plethora of Unix utilities. Specifically:</p>
+   <ul>
+     <li><b>ar</b> - archive library builder</li>
+     <li><b>bzip2*</b> - bzip2 command for distribution generation</li>
+     <li><b>bunzip2*</b> - bunzip2 command for distribution checking</li>
+     <li><b>chmod</b> - change permissions on a file</li>
+     <li><b>cat</b> - output concatenation utility</li>
+     <li><b>cp</b> - copy files</li>
+     <li><b>date</b> - print the current date/time </li>
+     <li><b>echo</b> - print to standard output</li>
+     <li><b>egrep</b> - extended regular expression search utility</li>
+     <li><b>etags</b> - C/C++ tag file creator for vim/emacs</li>
+     <li><b>find</b> - find files/dirs in a file system</li>
+     <li><b>grep</b> - regular expression search utility</li>
+     <li><b>gzip*</b> - gzip command for distribution generation</li>
+     <li><b>gunzip*</b> - gunzip command for distribution checking</li>
+     <li><b>install</b> - install directories/files </li>
+     <li><b>mkdir</b> - create a directory</li>
+     <li><b>mv</b> - move (rename) files</li>
+     <li><b>ranlib</b> - symbol table builder for archive libraries</li>
+     <li><b>rm</b> - remove (delete) files and directories</li>
+     <li><b>sed</b> - stream editor for transforming output</li>
+     <li><b>sh</b> - Bourne shell for make build scripts</li>
+     <li><b>tar</b> - tape archive for distribution generation</li>
+     <li><b>test</b> - test things in file system</li>
+     <li><b>unzip*</b> - unzip command for distribution checking</li>
+     <li><b>zip*</b> - zip command for distribution generation</li>
+   </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="brokengcc">Broken versions of GCC</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM is very demanding of the host C++ compiler, and as such tends to expose
+ bugs in the compiler.  In particular, several versions of GCC crash when trying
+ to compile LLVM.  We routinely use GCC 3.3.3, 3.4.0, and Apple 4.0.1 
+ successfully with them (however, see below).  Other versions of GCC will 
+ probably work as well.  GCC versions listed
+ here are known to not work.  If you are using one of these versions, please try
+ to upgrade your GCC to something more recent.  If you run into a problem with a
+ version of GCC not listed here, please <a href="mailto:llvmdev at cs.uiuc.edu">let
+ us know</a>.  Please use the "<tt>gcc -v</tt>" command to find out which version
+ of GCC you are using.
+ </p>
+ 
+ <p><b>GCC versions prior to 3.0</b>: GCC 2.96.x and before had several
+ problems in the STL that effectively prevent it from compiling LLVM.
+ </p>
+ 
+ <p><b>GCC 3.2.2</b>: This version of GCC fails to compile LLVM.</p>
+ 
+ <p><b>GCC 3.3.2</b>: This version of GCC suffered from a <a 
+ href="http://gcc.gnu.org/PR13392">serious bug</a> which causes it to crash in
+ the "<tt>convert_from_eh_region_ranges_1</tt>" GCC function.</p>
+ 
+ <p><b>Cygwin GCC 3.3.3</b>: The version of GCC 3.3.3 commonly shipped with 
+    Cygwin does not work.  Please <a href="CFEBuildInstrs.html#cygwin">upgrade 
+    to a newer version</a> if possible.</p>
+ <p><b>SuSE GCC 3.3.3</b>: The version of GCC 3.3.3 shipped with SuSE 9.1 (and 
+    possibly others) does not compile LLVM correctly (it appears that exception 
+    handling is broken in some cases).  Please download the FSF 3.3.3 or upgrade
+    to a newer version of GCC.</p>
+ <p><b>IA-64 GCC 4.0.0</b>: The IA-64 version of GCC 4.0.0 is known to
+    miscompile LLVM.</p>
+ <p><b>Apple Xcode 2.3</b>: GCC crashes when compiling LLVM at -O3 (which is the
+    default with ENABLE_OPTIMIZED=1.  To work around this, build with 
+    "ENABLE_OPTIMIZED=1 OPTIMIZE_OPTION=-O2".</p>
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="starting"><b>Getting Started with LLVM</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The remainder of this guide is meant to get you up and running with
+ LLVM and to give you some basic information about the LLVM environment.</p>
+ 
+ <p>The later sections of this guide describe the <a
+ href="#layout">general layout</a> of the the LLVM source tree, a <a
+ href="#tutorial">simple example</a> using the LLVM tool chain, and <a
+ href="#links">links</a> to find more information about LLVM or to get
+ help via e-mail.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="terminology">Terminology and Notation</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Throughout this manual, the following names are used to denote paths
+ specific to the local system and working environment.  <i>These are not
+ environment variables you need to set but just strings used in the rest
+ of this document below</i>.  In any of the examples below, simply replace
+ each of these names with the appropriate pathname on your local system.
+ All these paths are absolute:</p>
+ 
+ <dl>
+     <dt>SRC_ROOT
+     <dd>
+     This is the top level directory of the LLVM source tree.
+     <p>
+ 
+     <dt>OBJ_ROOT
+     <dd>
+     This is the top level directory of the LLVM object tree (i.e. the
+     tree where object files and compiled programs will be placed.  It
+     can be the same as SRC_ROOT).
+     <p>
+ 
+     <dt>LLVMGCCDIR
+     <dd>
+     This is where the LLVM GCC Front End is installed.
+     <p>
+     For the pre-built GCC front end binaries, the LLVMGCCDIR is
+     <tt>cfrontend/<i>platform</i>/llvm-gcc</tt>.
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="environment">Setting Up Your Environment</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ In order to compile and use LLVM, you may need to set some environment
+ variables.
+ 
+ <dl>
+   <dt><tt>LLVM_LIB_SEARCH_PATH</tt>=<tt>/path/to/your/bytecode/libs</tt></dt>
+   <dd>[Optional] This environment variable helps LLVM linking tools find the
+   locations of your bytecode libraries. It is provided only as a
+   convenience since you can specify the paths using the -L options of the
+   tools and the C/C++ front-end will automatically use the bytecode files
+   installed in its
+   <tt>lib</tt> directory.</dd>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="unpack">Unpacking the LLVM Archives</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ If you have the LLVM distribution, you will need to unpack it before you
+ can begin to compile it.  LLVM is distributed as a set of two files: the LLVM
+ suite and the LLVM GCC front end compiled for your platform.  There is an
+ additional test suite that is optional.  Each file is a TAR archive that is
+ compressed with the gzip program.
+ </p>
+ 
+ <p>The files are as follows, with <em>x.y</em> marking the version number:
+ <dl>
+   <dt><tt>llvm-x.y.tar.gz</tt></dt>
+   <dd>Source release for the LLVM libraries and tools.<br/></dd>
+ 
+   <dt><tt>llvm-test-x.y.tar.gz</tt></dt>
+   <dd>Source release for the LLVM test suite.</dd>
+ 
+   <dt><tt>cfrontend-x.y.source.tar.gz</tt></dt>
+   <dd>Source release of the GCC front end.<br/></dd>
+ 
+   <dt><tt>cfrontend-x.y.i686-redhat-linux-gnu.tar.gz</tt></dt>
+   <dd>Binary release of the GCC front end for Linux/x86.<br/></dd>
+ 
+   <dt><tt>llvm-gcc4-x.y.source.tar.gz</tt></dt>
+   <dd>Source release of the llvm-gcc4 front end.  See README.LLVM in the root
+       directory for build instructions.<br/></dd>
+ 
+   <dt><tt>llvm-gcc4-x.y.powerpc-apple-darwin8.6.0.tar.gz</tt></dt>
+   <dd>Binary release of the llvm-gcc4 front end for MacOS X/PowerPC.<br/></dd>
+ 
+   <dt><tt>llvm-gcc4-x.y.i686-apple-darwin8.6.1.tar.gz</tt></dt>
+   <dd>Binary release of the llvm-gcc4 front end for MacOS X/X86.<br/></dd>
+ </dl>
+ 
+ <p>It is also possible to download the sources of the llvm-gcc4 front end from a
+ read-only subversion mirror at
+ svn://anonsvn.opensource.apple.com/svn/llvm/trunk.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="checkout">Checkout LLVM from CVS</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If you have access to our CVS repository, you can get a fresh copy of
+ the entire source code.  All you need to do is check it out from CVS as
+ follows:</p>
+ 
+ <ul>
+ <li><tt>cd <i>where-you-want-llvm-to-live</i></tt>
+   <li><tt>cvs -d :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm login</tt>
+   <li>Hit the return key when prompted for the password.
+   <li><tt>cvs -z3 -d :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm co
+       llvm</tt>
+ </ul>
+ 
+ <p>This will create an '<tt>llvm</tt>' directory in the current
+ directory and fully populate it with the LLVM source code, Makefiles,
+ test directories, and local copies of documentation files.</p>
+ 
+ <p>If you want to get a specific release (as opposed to the most recent
+ revision), you can specify a label.  The following releases have the following
+ labels:</p>
+ 
+ <ul>
+ <li>Release 1.7: <b>RELEASE_17</b></li>
+ <li>Release 1.6: <b>RELEASE_16</b></li>
+ <li>Release 1.5: <b>RELEASE_15</b></li>
+ <li>Release 1.4: <b>RELEASE_14</b></li>
+ <li>Release 1.3: <b>RELEASE_13</b></li>
+ <li>Release 1.2: <b>RELEASE_12</b></li>
+ <li>Release 1.1: <b>RELEASE_11</b></li>
+ <li>Release 1.0: <b>RELEASE_1</b></li>
+ </ul>
+ 
+ <p>If you would like to get the LLVM test suite (a separate package as of 1.4),
+ you get it from the CVS repository:</p>
+ <pre>
+   cd llvm/projects
+   cvs -z3 -d :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm co llvm-test
+ </pre>
+ <p>By placing it in the <tt>llvm/projects</tt>, it will be automatically
+ configured by the LLVM configure script as well as automatically updated when
+ you run <tt>cvs update</tt>.</p>
+ 
+ <p>If you would like to get the GCC 3.4 front end source code, you can also get it from the CVS repository:</p>
+ 
+ <pre>
+   cvs -z3 -d :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm co llvm-gcc
+ </pre>
+ 
+ <p>Please note that you must follow <a href="CFEBuildInstrs.html">these 
+ instructions</a> to successfully build the LLVM GCC front-end.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="installcf">Install the GCC Front End</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Before configuring and compiling the LLVM suite, you need to extract the LLVM
+ GCC front end from the binary distribution.  It is used for building the
+ bytecode libraries later used by the GCC front end for linking programs, and its
+ location must be specified when the LLVM suite is configured.</p>
+ 
+ <p>To install the GCC front end, do the following:</p>
+ 
+ <ol>
+   <li><tt>cd <i>where-you-want-the-front-end-to-live</i></tt></li>
+   <li><tt>gunzip --stdout cfrontend-<i>version</i>.<i>platform</i>.tar.gz | tar -xvf
+       -</tt></li>
+ </ol>
+ 
+ <p>Next, you will need to fix your system header files:</p>
+ 
+ <p><tt>cd cfrontend/<i>platform</i><br>
+    ./fixheaders</tt></p>
+ 
+ <p>The binary versions of the GCC front end may not suit all of your needs.  For
+ example, the binary distribution may include an old version of a system header
+ file, not "fix" a header file that needs to be fixed for GCC, or it may be
+ linked with libraries not available on your system.</p>
+ 
+ <p>In cases like these, you may want to try <a
+ href="CFEBuildInstrs.html">building the GCC front end from source.</a> This is
+ not for the faint of heart, so be forewarned.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="config">Local LLVM Configuration</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Once checked out from the CVS repository, the LLVM suite source code must be
+ configured via the <tt>configure</tt> script.  This script sets variables in the
+ various <tt>*.in</tt> files, most notably <tt>llvm/Makefile.config</tt> and 
+ <tt>llvm/include/Config/config.h</tt>.  It also populates <i>OBJ_ROOT</i> with 
+ the Makefiles needed to begin building LLVM.</p>
+ 
+ <p>The following environment variables are used by the <tt>configure</tt>
+ script to configure the build system:</p>
+ 
+ <table>
+   <tr><th>Variable</th><th>Purpose</th></tr>
+   <tr>
+     <td>CC</td>
+     <td>Tells <tt>configure</tt> which C compiler to use.  By default,
+         <tt>configure</tt> will look for the first GCC C compiler in
+         <tt>PATH</tt>.  Use this variable to override
+         <tt>configure</tt>'s default behavior.</td>
+   </tr>
+   <tr>
+     <td>CXX</td>
+     <td>Tells <tt>configure</tt> which C++ compiler to use.  By default,
+        <tt>configure</tt> will look for the first GCC C++ compiler in
+        <tt>PATH</tt>.  Use this variable to override
+        <tt>configure</tt>'s default behavior.</td>
+   </tr>
+ </table>
+ 
+ <p>The following options can be used to set or enable LLVM specific options:</p>
+ 
+ <dl>
+   <dt><i>--with-llvmgccdir</i></dt>
+   <dd>Path to the LLVM C/C++ FrontEnd to be used with this LLVM configuration. 
+   The value of this option should specify the full pathname of the C/C++ Front
+   End to be used. If this option is not provided, the PATH will be searched for
+   a program named <i>llvm-gcc</i> and the C/C++ FrontEnd install directory will
+   be inferred from the path found. If the option is not given, and no llvm-gcc
+   can be found in the path then a warning will be produced by 
+   <tt>configure</tt> indicating this situation. LLVM may still be built with 
+   the <tt>tools-only</tt> target but attempting to build the runtime libraries
+   will fail as these libraries require llvm-gcc and llvm-g++. See 
+   <a href="#installcf">Install the GCC Front End</a> for details on installing
+   the C/C++ Front End. See
+   <a href="CFEBuildInstrs.html">Bootstrapping the LLVM C/C++ Front-End</a>
+   for details on building the C/C++ Front End.</dd>
+   <dt><i>--with-tclinclude</i></dt>
+   <dd>Path to the tcl include directory under which <tt>tclsh</tt> can be
+   found. Use this if you have multiple tcl installations on your machine and you
+   want to use a specific one (8.x) for LLVM. LLVM only uses tcl for running the
+   dejagnu based test suite in <tt>llvm/test</tt>. If you don't specify this
+   option, the LLVM configure script will search for the tcl 8.4 and 8.3
+   releases.
+   <p></p>
+   </dd>
+   <dt><i>--enable-optimized</i></dt>
+   <dd>
+     Enables optimized compilation by default (debugging symbols are removed
+     and GCC optimization flags are enabled).  The default is to use an
+     unoptimized build (also known as a debug build).
+     <p></p>
+   </dd>
+   <dt><i>--enable-debug-runtime</i></dt>
+   <dd>
+     Enables debug symbols in the runtime libraries. The default is to strip
+     debug symbols from the runtime libraries. 
+   </dd>
+   <dt><i>--enable-jit</i></dt>
+   <dd>
+     Compile the Just In Time (JIT) compiler functionality.  This is not
+     available
+     on all platforms.  The default is dependent on platform, so it is best
+     to explicitly enable it if you want it.
+     <p></p>
+   </dd>
+   <dt><i>--enable-targets=</i><tt>target-option</tt></dt>
+   <dd>Controls which targets will be built and linked into llc. The default 
+   value for <tt>target_options</tt> is "all" which builds and links all 
+   available targets.  The value "host-only" can be specified to build only a 
+   native compiler (no cross-compiler targets available). The "native" target is 
+   selected as the target of the build host. You can also specify a comma 
+   separated list of target names that you want available in llc. The target 
+   names use all lower case. The current set of targets is: <br/>
+   <tt>alpha, ia64, powerpc, skeleton, sparc, x86</tt>.
+   <p></p></dd>
+   <dt><i>--enable-doxygen</i></dt>
+   <dd>Look for the doxygen program and enable construction of doxygen based
+   documentation from the source code. This is disabled by default because 
+   generating the documentation can take a long time and producess 100s of 
+   megabytes of output.</dd>
+ </dl>
+ 
+ <p>To configure LLVM, follow these steps:</p>
+ 
+ <ol>
+     <li>Change directory into the object root directory:
+     <br>
+     <tt>cd <i>OBJ_ROOT</i></tt>
+     <p>
+ 
+     <li>Run the <tt>configure</tt> script located in the LLVM source tree:
+     <br>
+     <tt><i>SRC_ROOT</i>/configure --prefix=/install/path [other options]</tt>
+     <p>
+ </ol>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="compile">Compiling the LLVM Suite Source Code</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Once you have configured LLVM, you can build it.  There are three types of
+ builds:</p>
+ 
+ <dl>
+     <dt>Debug Builds
+     <dd>
+     These builds are the default when one types <tt>gmake</tt> (unless the
+     <tt>--enable-optimized</tt> option was used during configuration).  The
+     build system will compile the tools and libraries with debugging
+     information.
+     <p>
+ 
+     <dt>Release (Optimized) Builds
+     <dd>
+     These builds are enabled with the <tt>--enable-optimized</tt> option to
+     <tt>configure</tt> or by specifying <tt>ENABLE_OPTIMIZED=1</tt> on the
+     <tt>gmake</tt> command line.  For these builds, the build system will
+     compile the tools and libraries with GCC optimizations enabled and strip
+     debugging information from the libraries and executables it generates. 
+     <p>
+ 
+     <dt>Profile Builds
+     <dd>
+     These builds are for use with profiling.  They compile profiling
+     information into the code for use with programs like <tt>gprof</tt>.
+     Profile builds must be started by specifying <tt>ENABLE_PROFILING=1</tt>
+     on the <tt>gmake</tt> command line.
+ </dl>
+ 
+ <p>Once you have LLVM configured, you can build it by entering the
+ <i>OBJ_ROOT</i> directory and issuing the following command:</p>
+ 
+ <p><tt>gmake</tt></p>
+ 
+ <p>If the build fails, please <a href="#brokengcc">check here</a> to see if you
+ are using a version of GCC that is known not to compile LLVM.</p>
+ 
+ <p>
+ If you have multiple processors in your machine, you may wish to use some of
+ the parallel build options provided by GNU Make.  For example, you could use the
+ command:</p>
+ 
+ <p><tt>gmake -j2</tt></p>
+ 
+ <p>There are several special targets which are useful when working with the LLVM
+ source code:</p>
+ 
+ <dl>
+   <dt><tt>gmake clean</tt>
+   <dd>
+   Removes all files generated by the build.  This includes object files,
+   generated C/C++ files, libraries, and executables.
+   <p>
+ 
+   <dt><tt>gmake dist-clean</tt>
+   <dd>
+   Removes everything that <tt>gmake clean</tt> does, but also removes files
+   generated by <tt>configure</tt>.  It attempts to return the source tree to the
+   original state in which it was shipped.
+   <p>
+ 
+   <dt><tt>gmake install</tt>
+   <dd>
+   Installs LLVM header files, libraries, tools, and documentation in a
+   hierarchy 
+   under $PREFIX, specified with <tt>./configure --prefix=[dir]</tt>, which 
+   defaults to <tt>/usr/local</tt>.
+   <p>
+   
+   <dt><tt>gmake -C runtime install-bytecode</tt>
+   <dd>
+   Assuming you built LLVM into $OBJDIR, when this command is run, it will 
+   install bytecode libraries into the GCC front end's bytecode library 
+   directory.  If you need to update your bytecode libraries,
+   this is the target to use once you've built them.
+   <p>
+ </dl>
+ 
+ <p>Please see the <a href="MakefileGuide.html">Makefile Guide</a> for further
+ details on these <tt>make</tt> targets and descriptions of other targets
+ available.</p>
+ 
+ <p>It is also possible to override default values from <tt>configure</tt> by
+ declaring variables on the command line.  The following are some examples:</p>
+ 
+ <dl>
+   <dt><tt>gmake ENABLE_OPTIMIZED=1</tt>
+   <dd>
+   Perform a Release (Optimized) build.
+   <p>
+ 
+   <dt><tt>gmake ENABLE_OPTIMIZED=1 DISABLE_ASSERTIONS=1</tt>
+   <dd>
+   Perform a Release (Optimized) build without assertions enabled.
+   <p>
+ 
+   <dt><tt>gmake ENABLE_PROFILING=1</tt>
+   <dd>
+   Perform a Profiling build.
+   <p>
+ 
+   <dt><tt>gmake VERBOSE=1</tt>
+   <dd>
+   Print what <tt>gmake</tt> is doing on standard output.
+   <p>
+ 
+   <dt><tt>gmake TOOL_VERBOSE=1</tt></dt>
+   <dd>Ask each tool invoked by the makefiles to print out what it is doing on 
+   the standard output. This also implies <tt>VERBOSE=1</tt>.
+   <p></dd>
+ </dl>
+ 
+ <p>Every directory in the LLVM object tree includes a <tt>Makefile</tt> to build
+ it and any subdirectories that it contains.  Entering any directory inside the
+ LLVM object tree and typing <tt>gmake</tt> should rebuild anything in or below
+ that directory that is out of date.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="cross-compile">Cross-Compiling LLVM</a>
+ </div>
+ 
+ <div class="doc_text">
+   <p>It is possible to cross-compile LLVM. That is, you can create LLVM
+   executables and libraries for a platform different than the one one which you
+   are compiling.  To do this, a few additional steps are 
+   required. <sup><a href="#ccn_1">1</a></sup> To cross-compile LLVM, use
+   these instructions:</p>
+   <ol>
+     <li>Configure and build LLVM as a native compiler. You will need
+     just <tt>TableGen</tt> from that build.
+       <ul>
+         <li>If you have <tt>$LLVM_OBJ_ROOT=$LLVM_SRC_ROOT</tt> just execute 
+           <tt>make -C utils/TableGen</tt> after configuring.</li>
+         <li>Otherwise you will need to monitor building process and terminate 
+           it just after <tt>TableGen</tt> was built.</li>
+       </ul>
+     </li>
+     <li>Copy the TableGen binary to somewhere safe (out of your build tree).
+     </li>
+     <li>Configure LLVM to build with a cross-compiler. To do this, supply the
+     configure script with <tt>--build</tt> and <tt>--host</tt> options that
+     are different. The values of these options must be legal target triples 
+     that your GCC compiler supports.</li>
+     <li>Put the saved <tt>TableGen</tt> executable into the
+     into <tt>$LLVM_OBJ_ROOT/{BUILD_TYPE}/bin</tt> directory (e.g. into 
+     <tt>.../Release/bin</tt> for a Release build).</li>
+     <li>Build LLVM  as usual.</li>
+   </ol>
+   <p>The result of such a build will produce executables that are not executable
+   on your build host (--build option) but can be executed on your compile host
+   (--host option).</p>
+   <p><b>Notes:</b></p>
+   <div class="doc_notes">
+     <ol>
+       <li><a name="ccn_1">Cross-compiling</a> was tested only with Linux as 
+       build platform and Windows as host using mingw32 cross-compiler. Other
+       combinations have not been tested.</li>
+     </ol>
+   </div>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="objfiles">The Location of LLVM Object Files</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM build system is capable of sharing a single LLVM source tree among
+ several LLVM builds.  Hence, it is possible to build LLVM for several different
+ platforms or configurations using the same source tree.</p>
+ 
+ <p>This is accomplished in the typical autoconf manner:</p>
+ 
+ <ul>
+   <li><p>Change directory to where the LLVM object files should live:</p>
+ 
+       <p><tt>cd <i>OBJ_ROOT</i></tt></p></li>
+ 
+   <li><p>Run the <tt>configure</tt> script found in the LLVM source
+       directory:</p>
+ 
+       <p><tt><i>SRC_ROOT</i>/configure</tt></p></li>
+ </ul>
+ 
+ <p>The LLVM build will place files underneath <i>OBJ_ROOT</i> in directories
+ named after the build type:</p>
+ 
+ <dl>
+   <dt>Debug Builds
+   <dd>
+   <dl>
+     <dt>Tools
+     <dd><tt><i>OBJ_ROOT</i>/Debug/bin</tt>
+     <dt>Libraries
+     <dd><tt><i>OBJ_ROOT</i>/Debug/lib</tt>
+   </dl>
+   <p>
+ 
+   <dt>Release Builds
+   <dd>
+   <dl>
+     <dt>Tools
+     <dd><tt><i>OBJ_ROOT</i>/Release/bin</tt>
+     <dt>Libraries
+     <dd><tt><i>OBJ_ROOT</i>/Release/lib</tt>
+   </dl>
+   <p>
+ 
+   <dt>Profile Builds
+   <dd>
+   <dl>
+     <dt>Tools
+     <dd><tt><i>OBJ_ROOT</i>/Profile/bin</tt>
+     <dt>Libraries
+     <dd><tt><i>OBJ_ROOT</i>/Profile/lib</tt>
+   </dl>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="optionalconfig">Optional Configuration Items</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ If you're running on a Linux system that supports the "<a
+   href="http://www.tat.physik.uni-tuebingen.de/~rguenth/linux/binfmt_misc.html">
+   binfmt_misc</a>"
+ module, and you have root access on the system, you can set your system up to
+ execute LLVM bytecode files directly.  To do this, use commands like this (the
+ first command may not be required if you are already using the module):</p>
+ 
+ <div class="doc_code">
+ <pre>
+    $ mount -t binfmt_misc none /proc/sys/fs/binfmt_misc
+    $ echo ':llvm:M::llvm::/path/to/lli:' > /proc/sys/fs/binfmt_misc/register
+    $ chmod u+x hello.bc                (if needed)
+    $ ./hello.bc
+ </pre>
+ </div>
+ 
+ <p>
+ This allows you to execute LLVM bytecode files directly.  Thanks to Jack
+ Cummings for pointing this out!
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="layout"><b>Program Layout</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>One useful source of information about the LLVM source base is the LLVM <a
+ href="http://www.doxygen.org">doxygen</a> documentation available at <tt><a
+ href="http://llvm.org/doxygen/">http://llvm.org/doxygen/</a></tt>.
+ The following is a brief introduction to code layout:</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="cvsdir"><tt>CVS</tt> directories</a></div>
+ <div class="doc_text">
+ <p>Every directory checked out of CVS will contain a <tt>CVS</tt> directory; for
+ the most part these can just be ignored.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="examples"><tt>llvm/examples</tt></a></div>
+ <div class="doc_text">
+   <p>This directory contains some simple examples of how to use the LLVM IR and
+   JIT.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="include"><tt>llvm/include</tt></a></div>
+ <div class="doc_text">
+ 
+ <p>This directory contains public header files exported from the LLVM
+ library. The three main subdirectories of this directory are:</p>
+ 
+ <dl>
+   <dt><tt><b>llvm/include/llvm</b></tt></dt>
+   <dd>This directory contains all of the LLVM specific header files.  This 
+   directory also has subdirectories for different portions of LLVM: 
+   <tt>Analysis</tt>, <tt>CodeGen</tt>, <tt>Target</tt>, <tt>Transforms</tt>, 
+   etc...</dd>
+ 
+   <dt><tt><b>llvm/include/llvm/Support</b></tt></dt>
+   <dd>This directory contains generic support libraries that are provided with 
+   LLVM but not necessarily specific to LLVM. For example, some C++ STL utilities 
+   and a Command Line option processing library store their header files here.
+   </dd>
+ 
+   <dt><tt><b>llvm/include/llvm/Config</b></tt></dt>
+   <dd>This directory contains header files configured by the <tt>configure</tt> 
+   script.  They wrap "standard" UNIX and C header files.  Source code can 
+   include these header files which automatically take care of the conditional 
+   #includes that the <tt>configure</tt> script generates.</dd>
+ </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="lib"><tt>llvm/lib</tt></a></div>
+ <div class="doc_text">
+ 
+ <p>This directory contains most of the source files of the LLVM system. In LLVM,
+ almost all code exists in libraries, making it very easy to share code among the
+ different <a href="#tools">tools</a>.</p>
+ 
+ <dl>
+   <dt><tt><b>llvm/lib/VMCore/</b></tt></dt>
+   <dd> This directory holds the core LLVM source files that implement core 
+   classes like Instruction and BasicBlock.</dd>
+ 
+   <dt><tt><b>llvm/lib/AsmParser/</b></tt></dt>
+   <dd>This directory holds the source code for the LLVM assembly language parser 
+   library.</dd>
+ 
+   <dt><tt><b>llvm/lib/ByteCode/</b></tt></dt>
+   <dd>This directory holds code for reading and write LLVM bytecode.</dd>
+ 
+   <dt><tt><b>llvm/lib/Analysis/</b></tt><dd>This directory contains a variety of
+   different program analyses, such as Dominator Information, Call Graphs,
+   Induction Variables, Interval Identification, Natural Loop Identification,
+   etc.</dd>
+ 
+   <dt><tt><b>llvm/lib/Transforms/</b></tt></dt>
+   <dd> This directory contains the source code for the LLVM to LLVM program 
+   transformations, such as Aggressive Dead Code Elimination, Sparse Conditional 
+   Constant Propagation, Inlining, Loop Invariant Code Motion, Dead Global 
+   Elimination, and many others.</dd>
+ 
+   <dt><tt><b>llvm/lib/Target/</b></tt></dt>
+   <dd> This directory contains files that describe various target architectures
+   for code generation.  For example, the <tt>llvm/lib/Target/X86</tt> 
+   directory holds the X86 machine description while
+   <tt>llvm/lib/Target/CBackend</tt> implements the LLVM-to-C converter.</dd>
+     
+   <dt><tt><b>llvm/lib/CodeGen/</b></tt></dt>
+   <dd> This directory contains the major parts of the code generator: Instruction 
+   Selector, Instruction Scheduling, and Register Allocation.</dd>
+ 
+   <dt><tt><b>llvm/lib/Debugger/</b></tt></dt>
+   <dd> This directory contains the source level debugger library that makes 
+   it possible to instrument LLVM programs so that a debugger could identify 
+   source code locations at which the program is executing.</dd>
+ 
+   <dt><tt><b>llvm/lib/ExecutionEngine/</b></tt></dt>
+   <dd> This directory contains libraries for executing LLVM bytecode directly 
+   at runtime in both interpreted and JIT compiled fashions.</dd>
+ 
+   <dt><tt><b>llvm/lib/Support/</b></tt></dt>
+   <dd> This directory contains the source code that corresponds to the header 
+   files located in <tt>llvm/include/Support/</tt>.</dd>
+ 
+   <dt><tt><b>llvm/lib/System/</b></tt></dt>
+   <dd>This directory contains the operating system abstraction layer that
+   shields LLVM from platform-specific coding.</dd>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="projects"><tt>llvm/projects</tt></a></div>
+ <div class="doc_text">
+   <p>This directory contains projects that are not strictly part of LLVM but are
+   shipped with LLVM. This is also the directory where you should create your own
+   LLVM-based projects. See <tt>llvm/projects/sample</tt> for an example of how
+   to set up your own project. See <tt>llvm/projects/Stacker</tt> for a fully 
+   functional example of a compiler front end.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="runtime"><tt>llvm/runtime</tt></a></div>
+ <div class="doc_text">
+ 
+ <p>This directory contains libraries which are compiled into LLVM bytecode and
+ used when linking programs with the GCC front end.  Most of these libraries are
+ skeleton versions of real libraries; for example, libc is a stripped down
+ version of glibc.</p>
+ 
+ <p>Unlike the rest of the LLVM suite, this directory needs the LLVM GCC front
+ end to compile.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="test"><tt>llvm/test</tt></a></div>
+ <div class="doc_text">
+   <p>This directory contains feature and regression tests and other basic sanity
+   checks on the LLVM infrastructure. These are intended to run quickly and cover
+   a lot of territory without being exhaustive.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="llvmtest"><tt>llvm-test</tt></a></div>
+ <div class="doc_text">
+   <p>This is not a directory in the normal llvm module; it is a separate CVS
+   module that must be checked out (usually to <tt>projects/llvm-test</tt>). This
+   module contains a comprehensive correctness, performance, and benchmarking
+   test
+   suite for LLVM. It is a separate CVS module because not every LLVM user is
+   interested in downloading or building such a comprehensive test suite. For
+   further details on this test suite, please see the 
+   <a href="TestingGuide.html">Testing Guide</a> document.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="tools"><tt>llvm/tools</tt></a></div>
+ <div class="doc_text">
+ 
+ <p>The <b>tools</b> directory contains the executables built out of the
+ libraries above, which form the main part of the user interface.  You can
+ always get help for a tool by typing <tt>tool_name --help</tt>.  The
+ following is a brief introduction to the most important tools.  More detailed
+ information is in the <a href="CommandGuide/index.html">Command Guide</a>.</p>
+ 
+ <dl>
+   <dt><tt><b>analyze</b></tt></dt>
+   <dd><tt>analyze</tt> is used to run a specific
+   analysis on an input LLVM bytecode file and print out the results.  It is
+   primarily useful for debugging analyses, or familiarizing yourself with
+   what an analysis does.</dd>
+ 
+   <dt><tt><b>bugpoint</b></tt></dt>
+   <dd><tt>bugpoint</tt> is used to debug
+   optimization passes or code generation backends by narrowing down the
+   given test case to the minimum number of passes and/or instructions that
+   still cause a problem, whether it is a crash or miscompilation. See <a
+   href="HowToSubmitABug.html">HowToSubmitABug.html</a> for more information
+   on using <tt>bugpoint</tt>.</dd>
+ 
+   <dt><tt><b>llvmc</b></tt></dt>
+   <dd>The LLVM Compiler Driver. This program can
+   be configured to utilize both LLVM and non-LLVM compilation tools to enable
+   pre-processing, translation, optimization, assembly, and linking of programs
+   all from one command line. <tt>llvmc</tt> also takes care of processing the
+   dependent libraries found in bytecode. This reduces the need to get the
+   traditional <tt>-l<name></tt> options right on the command line. Please
+   note that this tool, while functional, is still experimental and not feature
+   complete.</dd>
+ 
+   <dt><tt><b>llvm-ar</b></tt></dt>
+   <dd>The archiver produces an archive containing
+   the given LLVM bytecode files, optionally with an index for faster
+   lookup.</dd>
+   
+   <dt><tt><b>llvm-as</b></tt></dt>
+   <dd>The assembler transforms the human readable LLVM assembly to LLVM 
+   bytecode.</dd>
+ 
+   <dt><tt><b>llvm-dis</b></tt></dt>
+   <dd>The disassembler transforms the LLVM bytecode to human readable 
+   LLVM assembly.</dd>
+ 
+   <dt><tt><b>llvm-ld</b></tt></dt>
+   <dd><tt>llvm-ld</tt> is very similar to gccld and provides a general purpose
+   and extensible linker for LLVM. This is the linker invoked by <tt>llvmc</tt>.
+   It allows optimization modules to be loaded so that language specific
+   optimizations can be applied at link time. This tool is considered
+   experimental.</dd>
+ 
+   <dt><tt><b>llvm-link</b></tt></dt>
+   <dd><tt>llvm-link</tt>, not surprisingly, links multiple LLVM modules into 
+   a single program.</dd>
+   
+   <dt><tt><b>lli</b></tt></dt>
+   <dd><tt>lli</tt> is the LLVM interpreter, which
+   can directly execute LLVM bytecode (although very slowly...). In addition
+   to a simple interpreter, <tt>lli</tt> also has a tracing mode (entered by
+   specifying <tt>-trace</tt> on the command line). Finally, for
+   architectures that support it (currently x86, Sparc, and PowerPC), by default,
+   <tt>lli</tt> will function as a Just-In-Time compiler (if the
+   functionality was compiled in), and will execute the code <i>much</i>
+   faster than the interpreter.</dd>
+ 
+   <dt><tt><b>llc</b></tt></dt>
+   <dd> <tt>llc</tt> is the LLVM backend compiler, which
+   translates LLVM bytecode to a native code assembly file or to C code (with
+   the -march=c option).</dd>
+ 
+   <dt><tt><b>llvm-gcc</b></tt></dt>
+   <dd><tt>llvm-gcc</tt> is a GCC-based C frontend
+   that has been retargeted to emit LLVM code as the machine code output.  It
+   works just like any other GCC compiler, taking the typical <tt>-c, -S, -E,
+   -o</tt> options that are typically used.  The source code for the
+   <tt>llvm-gcc</tt> tool is available as a separate CVS module.
+   <blockquote>
+     <dl>
+       <dt><tt><b>gccas</b></tt></dt>
+       <dd>This tool is invoked by the <tt>llvm-gcc</tt> frontend as the 
+       "assembler" part of the compiler.  This tool actually assembles LLVM 
+       assembly to LLVM bytecode, performs a variety of optimizations, and 
+       outputs LLVM bytecode.  Thus when you invoke 
+       <tt>llvm-gcc -c x.c -o x.o</tt>, you are causing <tt>gccas</tt> to be 
+       run, which writes the <tt>x.o</tt> file (which is an LLVM bytecode file 
+       that can be disassembled or manipulated just like any other bytecode 
+       file).  The command line interface to <tt>gccas</tt> is designed to be 
+       as close as possible to the <b>system</b> `<tt>as</tt>' utility so that 
+       the gcc frontend itself did not have to be modified to interface to 
+       a "weird" assembler.</dd>
+ 
+       <dt><tt><b>gccld</b></tt></dt>
+       <dd><tt>gccld</tt> links together several LLVM bytecode files into one 
+       bytecode file and does some optimization.  It is the linker invoked by 
+       the GCC frontend when multiple .o files need to be linked together.  
+       Like <tt>gccas</tt>, the command line interface of <tt>gccld</tt> is 
+       designed to match the system linker, to aid interfacing with the GCC 
+       frontend.</dd>
+     </dl>
+   </blockquote>
+   </dd>
+ 
+   <dt><tt><b>opt</b></tt></dt>
+   <dd><tt>opt</tt> reads LLVM bytecode, applies a
+   series of LLVM to LLVM transformations (which are specified on the command
+   line), and then outputs the resultant bytecode.  The '<tt>opt --help</tt>'
+   command is a good way to get a list of the program transformations
+   available in LLVM.</dd>
+ </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="utils"><tt>llvm/utils</tt></a></div>
+ <div class="doc_text">
+ 
+ <p>This directory contains utilities for working with LLVM source code, and some
+ of the utilities are actually required as part of the build process because they
+ are code generators for parts of LLVM infrastructure.</p>
+ 
+ <dl>
+   <dt><tt><b>codegen-diff</b></tt> <dd><tt>codegen-diff</tt> is a script
+   that finds differences between code that LLC generates and code that LLI
+   generates. This is a useful tool if you are debugging one of them,
+   assuming that the other generates correct output. For the full user
+   manual, run <tt>`perldoc codegen-diff'</tt>.<p>
+ 
+   <dt><tt><b>cvsupdate</b></tt> <dd><tt>cvsupdate</tt> is a script that will
+   update your CVS tree, but produce a much cleaner and more organized output
+   than simply running <tt>`cvs -z3 up -dP'</tt> will. For example, it will group
+   together all the new and updated files and modified files in separate
+   sections, so you can see at a glance what has changed. If you are at the
+   top of your LLVM CVS tree, running <tt>utils/cvsupdate</tt> is the
+   preferred way of updating the tree.<p>
+ 
+   <dt><tt><b>emacs/</b></tt> <dd>The <tt>emacs</tt> directory contains
+   syntax-highlighting files which will work with Emacs and XEmacs editors,
+   providing syntax highlighting support for LLVM assembly files and TableGen
+   description files. For information on how to use the syntax files, consult
+   the <tt>README</tt> file in that directory.<p>
+ 
+   <dt><tt><b>getsrcs.sh</b></tt> <dd>The <tt>getsrcs.sh</tt> script finds
+   and outputs all non-generated source files, which is useful if one wishes
+   to do a lot of development across directories and does not want to
+   individually find each file. One way to use it is to run, for example:
+   <tt>xemacs `utils/getsources.sh`</tt> from the top of your LLVM source
+   tree.<p>
+   
+   <dt><tt><b>llvmgrep</b></tt></dt>
+   <dd>This little tool performs an "egrep -H -n" on each source file in LLVM and
+   passes to it a regular expression provided on <tt>llvmgrep</tt>'s command
+   line. This is a very efficient way of searching the source base for a
+   particular regular expression.</dd>
+ 
+   <dt><tt><b>makellvm</b></tt> <dd>The <tt>makellvm</tt> script compiles all
+   files in the current directory and then compiles and links the tool that
+   is the first argument. For example, assuming you are in the directory
+   <tt>llvm/lib/Target/Sparc</tt>, if <tt>makellvm</tt> is in your path,
+   simply running <tt>makellvm llc</tt> will make a build of the current
+   directory, switch to directory <tt>llvm/tools/llc</tt> and build it,
+   causing a re-linking of LLC.<p>
+ 
+   <dt><tt><b>NightlyTest.pl</b></tt> and
+   <tt><b>NightlyTestTemplate.html</b></tt> <dd>These files are used in a
+   cron script to generate nightly status reports of the functionality of
+   tools, and the results can be seen by following the appropriate link on
+   the <a href="http://llvm.org/">LLVM homepage</a>.<p>
+ 
+   <dt><tt><b>TableGen/</b></tt> <dd>The <tt>TableGen</tt> directory contains
+   the tool used to generate register descriptions, instruction set
+   descriptions, and even assemblers from common TableGen description
+   files.<p>
+ 
+   <dt><tt><b>vim/</b></tt> <dd>The <tt>vim</tt> directory contains
+   syntax-highlighting files which will work with the VIM editor, providing
+   syntax highlighting support for LLVM assembly files and TableGen
+   description files. For information on how to use the syntax files, consult
+   the <tt>README</tt> file in that directory.<p>
+ 
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="win32"><tt>llvm/win32</tt></a></div>
+ <div class="doc_text">
+   <p>This directory contains build scripts and project files for use with 
+   Visual C++. This allows developers on Windows to build LLVM without the need
+   for Cygwin. The contents of this directory should be considered experimental
+   at this time.
+   </p>
+ </div>
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="tutorial">An Example Using the LLVM Tool Chain</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <ol>
+   <li>First, create a simple C file, name it 'hello.c':
+        <pre>
+    #include <stdio.h>
+    int main() {
+      printf("hello world\n");
+      return 0;
+    }
+        </pre></li>
+ 
+   <li><p>Next, compile the C file into a LLVM bytecode file:</p>
+       <p><tt>% llvm-gcc hello.c -o hello</tt></p>
+ 
+       <p>Note that you should have already built the tools and they have to be
+       in your path, at least <tt>gccas</tt> and <tt>gccld</tt>.</p>
+ 
+       <p>This will create two result files: <tt>hello</tt> and
+       <tt>hello.bc</tt>. The <tt>hello.bc</tt> is the LLVM bytecode that
+       corresponds the the compiled program and the library facilities that it
+       required.  <tt>hello</tt> is a simple shell script that runs the bytecode
+       file with <tt>lli</tt>, making the result directly executable.  Note that
+       all LLVM optimizations are enabled by default, so there is no need for a 
+       "-O3" switch.</p></li>
+ 
+   <li><p>Run the program. To make sure the program ran, execute one of the
+       following commands:</p>
+       
+       <p><tt>% ./hello</tt></p>
+  
+       <p>or</p>
+ 
+       <p><tt>% lli hello.bc</tt></p></li>
+ 
+   <li><p>Use the <tt>llvm-dis</tt> utility to take a look at the LLVM assembly
+       code:</p>
+ 
+       <p><tt>% llvm-dis < hello.bc | less</tt><p></li>
+ 
+   <li><p>Compile the program to native assembly using the LLC code
+       generator:</p>
+ 
+       <p><tt>% llc hello.bc -o hello.s</tt></p>
+ 
+   <li><p>Assemble the native assembly language file into a program:</p>
+ 
+       <p><b>Solaris:</b><tt>% /opt/SUNWspro/bin/cc -xarch=v9 hello.s -o hello.native</tt></p>
+       <p><b>Others:</b><tt>% gcc hello.s -o hello.native</tt></p>
+ 
+   <li><p>Execute the native code program:</p>
+ 
+       <p><tt>% ./hello.native</tt></p></li>
+ 
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="problems">Common Problems</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>If you are having problems building or using LLVM, or if you have any other
+ general questions about LLVM, please consult the <a href="FAQ.html">Frequently
+ Asked Questions</a> page.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="links">Links</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document is just an <b>introduction</b> to how to use LLVM to do
+ some simple things... there are many more interesting and complicated things
+ that you can do that aren't documented here (but we'll gladly accept a patch
+ if you want to write something up!).  For more information about LLVM, check
+ out:</p>
+ 
+ <ul>
+   <li><a href="http://llvm.org/">LLVM homepage</a></li>
+   <li><a href="http://llvm.org/doxygen/">LLVM doxygen tree</a></li>
+   <li><a href="http://llvm.org/docs/Projects.html">Starting a Project
+   that Uses LLVM</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.x10sys.com/rspencer/">Reid Spencer</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/GettingStartedVS.html
diff -c /dev/null llvm-www/releases/1.8/docs/GettingStartedVS.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/GettingStartedVS.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,353 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>Getting Started with LLVM System for Microsoft Visual Studio</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   Getting Started with the LLVM System using Microsoft Visual Studio
+ </div>
+ 
+ <ul>
+   <li><a href="#overview">Overview</a>
+   <li><a href="#quickstart">Getting Started Quickly (A Summary)</a>
+   <li><a href="#requirements">Requirements</a>
+     <ol>
+       <li><a href="#hardware">Hardware</a>
+       <li><a href="#software">Software</a>
+     </ol></li>
+ 
+   <li><a href="#starting">Getting Started with LLVM</a>
+     <ol>
+       <li><a href="#terminology">Terminology and Notation</a>
+       <li><a href="#objfiles">The Location of LLVM Object Files</a>
+     </ol></li>
+ 
+   <li><a href="#tutorial">An Example Using the LLVM Tool Chain</a>
+   <li><a href="#problems">Common Problems</a>
+   <li><a href="#links">Links</a>
+ </ul>
+ 
+ <div class="doc_author">
+   <p>Written by: 
+     <a href="mailto:jeffc at jolt-lang.org">Jeff Cohen</a>
+   </p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="overview"><b>Overview</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+   <p>The Visual Studio port at this time is experimental.  It is suitable for
+   use only if you are writing your own compiler front end or otherwise have a
+   need to dynamically generate machine code.  The JIT and interpreter are
+   functional, but it is currently not possible to generate assembly code which
+   is then assembled into an executable.  You can indirectly create executables
+   by using the C back end.</p>
+ 
+   <p>To emphasize, there is no C/C++ front end currently available.
+   <tt>llvm-gcc</tt> is based on GCC, which cannot be bootstrapped using VC++.
+   Eventually there should be a <tt>llvm-gcc</tt> based on Cygwin or MinGW that
+   is usable.  There is also the option of generating bytecode files on Unix and
+   copying them over to Windows.  But be aware the odds of linking C++ code
+   compiled with <tt>llvm-gcc</tt> with code compiled with VC++ is essentially
+   zero.</p>
+ 
+   <p>The LLVM test suite cannot be run on the Visual Studio port at this
+   time.</p>
+ 
+   <p>Most of the tools build and work.  <tt>llvm-db</tt> does not build at this
+   time.  <tt>bugpoint</tt> does build, but does not work.
+ 
+   <p>Additional information about the LLVM directory structure and tool chain
+   can be found on the main <a href="GettingStarted.html">Getting Started</a>
+   page.</P>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="quickstart"><b>Getting Started Quickly (A Summary)</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Here's the short story for getting up and running quickly with LLVM:</p>
+ 
+ <ol>
+   <li>Read the documentation.</li>
+   <li>Read the documentation.</li>
+   <li>Remember that you were warned twice about reading the documentation.</li>
+ 
+   <li>Get the Source Code
+   <ul>
+     <li>With the distributed files:
+     <ol>
+       <li><tt>cd <i>where-you-want-llvm-to-live</i></tt>
+       <li><tt>gunzip --stdout llvm-<i>version</i>.tar.gz | tar -xvf -</tt>
+       <i>      or use WinZip</i>
+       <li><tt>cd llvm</tt></li>
+     </ol></li>
+ 
+     <li>With anonymous CVS access:
+     <ol>
+       <li><tt>cd <i>where-you-want-llvm-to-live</i></tt></li>
+       <li><tt>cvs -d
+           :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm login</tt></li>
+       <li>Hit the return key when prompted for the password.
+       <li><tt>cvs -z3 -d :pserver:anon at llvm-cvs.cs.uiuc.edu:/var/cvs/llvm 
+           co llvm</tt></li>
+       <li><tt>cd llvm</tt></li>
+       <li><tt>cvs up -P -d</tt></li>
+     </ol></li>
+   </ul></li>
+ 
+   <li>Start Visual Studio
+   <ol>
+     <li>Simply double click on the solution file <tt>llvm/win32/llvm.sln</tt>.
+     </li>
+   </ol></li>
+ 
+   <li>Build the LLVM Suite:
+   <ol>
+     <li>Simply build the solution.</li>
+     <li>The Fibonacci project is a sample program that uses the JIT.  Modify
+     the project's debugging properties to provide a numeric command line
+     argument.  The program will print the corresponding fibonacci value.</li>
+   </ol></li>
+ 
+ </ol>
+ 
+ <p>It is strongly encouraged that you get the latest version from CVS.  Much
+ progress has been made since the 1.4 release.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="requirements"><b>Requirements</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+   <p>Before you begin to use the LLVM system, review the requirements given
+   below.  This may save you some trouble by knowing ahead of time what hardware
+   and software you will need.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="hardware"><b>Hardware</b></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+   <p>Any system that can adequately run Visual Studio .NET 2003 is fine.  The
+   LLVM source tree and object files, libraries and executables will consume
+   approximately 3GB.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="software"><b>Software</b></a></div>
+ <div class="doc_text">
+ 
+   <p>You will need Visual Studio .NET 2003.  Earlier versions cannot open the
+   solution/project files.  The VS 2005 beta can, but will migrate these files
+   to its own format in the process.  While it should work with the VS 2005
+   beta, there are no guarantees and there is no support for it at this time.
+   It has been reported that VC++ Express also works.</p>
+ 
+   <p>If you plan to modify any .y or .l files, you will need to have bison
+   and/or flex installed where Visual Studio can find them.  Otherwise, you do
+   not need them and the pre-generated files that come with the source tree
+   will be used.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="starting"><b>Getting Started with LLVM</b></a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The remainder of this guide is meant to get you up and running with
+ LLVM using Visual Studio and to give you some basic information about the LLVM
+ environment.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="terminology">Terminology and Notation</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Throughout this manual, the following names are used to denote paths
+ specific to the local system and working environment.  <i>These are not
+ environment variables you need to set but just strings used in the rest
+ of this document below</i>.  In any of the examples below, simply replace
+ each of these names with the appropriate pathname on your local system.
+ All these paths are absolute:</p>
+ 
+ <dl>
+     <dt>SRC_ROOT
+     <dd>
+     This is the top level directory of the LLVM source tree.
+     <p>
+ 
+     <dt>OBJ_ROOT
+     <dd>
+     This is the top level directory of the LLVM object tree (i.e. the
+     tree where object files and compiled programs will be placed.  It
+     is fixed at SRC_ROOT/win32).
+     <p>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="objfiles">The Location of LLVM Object Files</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+   <p>The object files are placed under <tt>OBJ_ROOT/Debug</tt> for debug builds
+   and <tt>OBJ_ROOT/Release</tt> for release (optimized) builds.  These include
+   both executables and libararies that your application can link against.
+ 
+   <p>The files that <tt>configure</tt> would create when building on Unix are
+   created by the <tt>Configure</tt> project and placed in
+   <tt>OBJ_ROOT/llvm</tt>.  You application must have OBJ_ROOT in its include
+   search path just before <tt>SRC_ROOT/include</tt>.
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="tutorial">An Example Using the LLVM Tool Chain</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <ol>
+   <li>First, create a simple C file, name it 'hello.c':
+        <pre>
+    #include <stdio.h>
+    int main() {
+      printf("hello world\n");
+      return 0;
+    }
+        </pre></li>
+ 
+   <li><p>Next, compile the C file into a LLVM bytecode file:</p>
+       <p><tt>% llvm-gcc hello.c -o hello</tt></p>
+ 
+       <p>Note that you should have already built the tools and they have to be
+       in your path, at least <tt>gccas</tt> and <tt>gccld</tt>.</p>
+ 
+       <p>This will create two result files: <tt>hello</tt> and
+       <tt>hello.bc</tt>. The <tt>hello.bc</tt> is the LLVM bytecode that
+       corresponds the the compiled program and the library facilities that it
+       required.  <tt>hello</tt> is a simple shell script that runs the bytecode
+       file with <tt>lli</tt>, making the result directly executable.  Note that
+       all LLVM optimizations are enabled by default, so there is no need for a 
+       "-O3" switch.</p>
+       
+       <p><b>Note: while you cannot do this step on Windows, you can do it on a
+         Unix system and transfer <tt>hello.bc</tt> to Windows.</b></p></li>
+ 
+   <li><p>Run the program using the just-in-time compiler:</p>
+       
+       <p><tt>% lli hello.bc</tt></p></li>
+ 
+   <li><p>Use the <tt>llvm-dis</tt> utility to take a look at the LLVM assembly
+       code:</p>
+ 
+       <p><tt>% llvm-dis < hello.bc | more</tt><p></li>
+ 
+   <li><p>Compile the program to C using the LLC code generator:</p>
+ 
+       <p><tt>% llc -march=c hello.bc</tt></p></li>
+ 
+   <li><p>Compile to binary using Microsoft C:</p>
+ 
+       <p><tt>% cl hello.cbe.c</tt></p></li>
+ 
+   <li><p>Execute the native code program:</p>
+ 
+       <p><tt>% hello.cbe.exe</tt></p></li>
+ 
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="problems">Common Problems</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>If you are having problems building or using LLVM, or if you have any other
+ general questions about LLVM, please consult the <a href="FAQ.html">Frequently
+ Asked Questions</a> page.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="links">Links</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document is just an <b>introduction</b> to how to use LLVM to do
+ some simple things... there are many more interesting and complicated things
+ that you can do that aren't documented here (but we'll gladly accept a patch
+ if you want to write something up!).  For more information about LLVM, check
+ out:</p>
+ 
+ <ul>
+   <li><a href="http://llvm.org/">LLVM homepage</a></li>
+   <li><a href="http://llvm.org/doxygen/">LLVM doxygen tree</a></li>
+   <li><a href="http://llvm.org/docs/Projects.html">Starting a Project
+   that Uses LLVM</a></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:jeffc at jolt-lang.org">Jeff Cohen</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/HowToReleaseLLVM.html
diff -c /dev/null llvm-www/releases/1.8/docs/HowToReleaseLLVM.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/HowToReleaseLLVM.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,474 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>How To Release LLVM To The Public</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">How To Release LLVM To The Public</div>
+ <p class="doc_warning">NOTE: THIS DOCUMENT IS A WORK IN PROGRESS!</p>
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#process">Release Process</a></li>
+   <li><a href="#dist_targets">Distribution Targets</a></li>
+ </ol>
+ <div class="doc_author">
+   <p>Written by <a href="mailto:rspencer at x10sys.com">Reid Spencer</a>,
+   <a href="mailto:criswell at cs.uiuc.edu">John Criswell</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="introduction">Introduction</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>This document collects information about successfully releasing LLVM to the
+ public. It is the release manager's guide to ensuring that a high quality build
+ of LLVM is released. Mostly, it's just a bunch of reminders of things to do at
+ release time so we don't inadvertently ship something that is utility 
+ deficient.</p>
+ 
+ <p>
+ There are three main tasks for building a release of LLVM:
+ <ol>
+   <li>Create the LLVM source distribution.</li>
+   <li>Create the LLVM GCC source distribtuion.</li>
+   <li>Create a set of LLVM GCC binary distribtuions for each supported
+       platform.  These binary distributions must include compiled versions
+       of the libraries found in <tt>llvm/runtime</tt> from the LLVM
+       source distribution created in Step 1.</li>
+ </ol>
+ </p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="process">Release Process</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="overview">Process Overview</a></div>
+ <div class="doc_text">
+   <ol>
+     <li><a href="#updocs">Update Documentation</a></li>
+     <li><a href="#merge">Merge Branches</a></li>
+     <li><a href="#deps">Make LibDeps.txt</a></li>
+     <li><a href="#settle">Settle LLVM HEAD</a></li>
+     <li><a href="#tag">Tag LLVM and Create the Release Branch</a></li>
+     <li><a href="#build">Build LLVM</a></li>
+     <li><a href="#check">Run 'make check'</a></li>
+     <li><a href="#test">Run LLVM Test Suite</a></li>
+     <li><a href="#dist">Build the LLVM Source Distributions</a></li>
+     <li><a href="#llvmgccbin">Build the LLVM GCC Binary Distribution</a></li>
+   </ol>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="updocs">Update Documentation</a></div>
+ <div class="doc_text">
+   <p>
+   Review the documentation and ensure that it is up to date.  The Release Notes
+   must be updated to reflect bug fixes, new known issues, and changes in the
+   list of supported platforms.  The Getting Started Guide should be updated to
+   reflect the new release version number tag avaiable from CVS and changes in
+   basic system requirements.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="merge">Merge Branches</a></div>
+ <div class="doc_text">
+ <p>
+ Merge any work done on branches intended for release into mainline. Finish and
+ commit all new features or bug fixes that are scheduled to go into the release.
+ Work that is not to be incorporated into the release should not be merged from
+ branchs or commited from developer's working directories.
+ </p>
+ 
+ <p>
+ From this point until the release branch is created, developers should
+ <em>not</em>
+ commit changes to the llvm and llvm-gcc CVS repositories unless it is a bug
+ fix <em>for the release</em>.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="deps">Make LibDeps.txt</a></div>
+ <div class="doc_text">
+   <p>Rebuild the <tt>LibDeps.txt</tt> target in <tt>utils/llvm-config</tt>. This
+   makes sure that the <tt>llvm-config</tt> utility remains relevant for the
+   release, reflecting any changes in the library dependencies.</p>
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="settle">Settle CVS HEAD</a></div>
+ <div class="doc_text">
+   <p>
+   Use the nightly test reports and 'make check' (deja-gnu based tests) to 
+   ensure that recent changes and merged branches have not destabilized LLVM.
+   Platforms which are used less often should be given special attention as they
+   are the most likely to break from commits from the previous step.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="tag">CVS Tag And Branch</a></div>
+ <div class="doc_text">
+   <p>Tag and branch the CVS HEAD using the following procedure:</p>
+   <ol>
+     <li>
+     Request all developers to refrain from committing. Offenders get commit
+     rights taken away (temporarily).
+     </li>
+ 
+     <li>
+     The Release Manager updates his/her llvm, llvm-test, and llvm-gcc source
+     trees with the
+     latest sources from mainline CVS.  The Release Manage may want to consider
+     using a new working directory for this to keep current uncommitted work
+     separate from release work.
+     </li>
+ 
+     <li>
+     The Release Manager tags his/her llvm, llvm-test, and llvm-gcc working
+     directories with
+     "ROOT_RELEASE_XX" where XX is the major and minor
+     release numbers (you can't have . in a cvs tag name). So, for Release 1.2,
+     XX=12 and for Release 1.10, XX=110.
+     </li>
+ 
+     <li>
+     Immediately create cvs branches based on the ROOT_RELEASE_XX tag. The tag
+     should be "release_XX" (where XX matches that used for the ROOT_RELEASE_XX
+     tag).  This is where the release distribution will be created.
+     </li>
+ 
+     <li>
+     Advise developers they can work on CVS HEAD again.
+     </li>
+ 
+     <li>
+     The Release Manager and any developers working on the release should switch
+     to the release branch (as all changes to the release will now be done in
+     the branch).  The easiest way to do this is to grab another working copy
+     using the following commands:
+ 
+     <p>
+     <tt>cvs -d <CVS Repository> co -r release_XX llvm</tt><br>
+     <tt>cvs -d <CVS Repository> co -r release_XX llvm-test</tt><br>
+     <tt>cvs -d <CVS Repository> co -r release_XX llvm-gcc</tt><br>
+     </p>
+     </li>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="build">Build LLVM</a></div>
+ <div class="doc_text">
+   <p>
+   Build both debug and release (optimized) versions of LLVM on all
+   platforms. Ensure the build is warning and error free on each platform.
+   </p>
+ 
+   <p>
+   Build a new version of the LLVM GCC front-end after building the LLVM tools.
+   Once that is complete, go back to the LLVM source tree and build and install
+   the <tt>llvm/runtime</tt> libraries.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="check">Run 'make check'</a></div>
+ <div class="doc_text">
+   <p>Run <tt>make check</tt> and ensure there are no unexpected failures. If
+   there are, resolve the failures, commit them back into the release branch,
+   and restart testing by <a href="#build">re-building LLVM</a>.
+   </p>
+ 
+   <p>
+   Ensure that 'make check' passes on all platforms for all targets. If certain
+   failures cannot be resolved before release time, determine if marking them
+   XFAIL is appropriate. If not, fix the bug and go back. The test suite must
+   complete with "0 unexpected failures" for release.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="test">LLVM Test Suite</a></div>
+ <div class="doc_text">
+   <p>Run the llvm-test suite and ensure there are no unacceptable failures.
+   If there are, resolve the failures and go back to
+   <a href="#build">re-building LLVM</a>. The test suite
+   should be run in Nightly Test mode. All tests must pass.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="dist">Build the LLVM Source Distributions</a></div>
+ <div class="doc_text">
+   <p>
+   Create source distributions for LLVM, LLVM GCC, and the LLVM Test Suite by
+   exporting the source
+   from CVS and archiving it.  This can be done with the following commands:
+   </p>
+ 
+   <p>
+   <tt>cvs -d <CVS Repository> export -r release_XX llvm</tt><br>
+   <tt>cvs -d <CVS Repository> export -r release_XX llvm-test</tt><br>
+   <tt>cvs -d <CVS Repository> export -r release_XX llvm-gcc</tt><br>
+   <tt>mkdir cfrontend; mv llvm-gcc cfrontend/src</tt><br>
+   <tt>tar -cvf - llvm          | gzip > llvm-X.X.tar.gz</tt><br>
+   <tt>tar -cvf - llvm-test     | gzip > llvm-test-X.X.tar.gz</tt><br>
+   <tt>tar -cvf - cfrontend/src | gzip > cfrontend-X.X.source.tar.gz</tt><br>
+   </p>
+ 
+   <!-- This is a
+   two step process. First, use "make dist" to simply build the distribution. Any
+   failures need to be corrected (on the branch). Once "make dist" can be
+   successful, do "make dist-check". This target will do the same thing as the
+   'dist' target but also test that distribution to make sure it works. This
+   ensures that needed files are not missing and that the src tarball can be
+   successfully unbacked, built, installed, and cleaned. This two-level testing
+   needs to be done on each target platform.
+   -->
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="llvmgccbin">Build the LLVM GCC Binary Distribution</a></div>
+ <div class="doc_text">
+   <p>
+   Creating the LLVM GCC binary distribution requires performing the following
+   steps for each supported platform:
+   </p>
+ 
+   <ol>
+     <li>
+     Build the LLVM GCC front-end.  The LLVM GCC front-end must be installed in
+     a directory named <tt>cfrontend/<platform>/llvm-gcc</tt>.  For
+     example, the Sparc/Solaris directory is named
+     <tt>cfrontend/sparc/llvm-gcc</tt>.
+     </li>
+ 
+     <li>
+     Build the libraries in <tt>llvm/runtime</tt> and install them into the 
+     created LLVM GCC installation directory.
+     </li>
+ 
+     <li>
+     For systems with non-distributable header files (e.g. Solaris), manually
+     remove header files that the GCC build process has "fixed."  This process
+     is admittedly painful, but not as bad as it looks; these header files are
+     almost always easily identifiable with simple grep expressions and are
+     installed in only a few directories in the GCC installation directory.
+     </li>
+ 
+     <li>
+     Add the copyright files and header file fix script.
+     </li>
+ 
+     <li>
+     Archive and compress the installation directory.  These can be found in
+     previous releases of the LLVM-GCC front-end.
+     </li>
+   </ol>
+ </div>
+ 
+ <!--
+ <div class="doc_subsection"><a name="release">Release</a></div>
+ <div class="doc_text">
+   <p>Release the distribution tarball to the public. This consists of generating
+   several tarballs. The first set, the source distributions, are automatically
+   generated by the "make dist" and "make dist-check". There are gzip, bzip2, and
+   zip versions of these bundles.</p>
+   <p>The second set of tarballs is the binary release. When "make dist-check"
+   succeeds, it will have created an _install directory into which it installed
+   the binary release. You need to rename that directory as "llvm" and then
+   create tarballs from the contents of that "llvm" directory.</p>
+   <p>Finally, use rpm to make an rpm package based on the llvm.spec file. Don't
+   forget to update the version number, documentation, etc. in the llvm.spec
+   file.</p>
+ </div>
+ -->
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="dist_targets">Distribution Targets</a></div>
+ <!-- *********************************************************************** -->
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">Overview</div>
+ <div class="doc_text">
+ <p>The first thing you need to understand is that there are multiple make 
+ targets to support this feature. Here's an overview, we'll delve into the 
+ details later.</p>
+ <ul>
+   <li><b>distdir</b> - builds the distribution directory from which the 
+   distribution will be packaged</li>
+   <li><b>dist</b> - builds each of the distribution tarballs (tar.gz, 
+   tar.bzip2, .zip). These can be built individually as well, with separate 
+   targets.</li>
+   <li><b>dist-check</b> - this is identical to <tt>dist</tt> but includes a 
+   check on the distribution that ensures the tarball can: unpack successfully,
+   compile correctly, pass 'make check', and pass 'make clean'.</li>
+   <li><b>dist-clean</b>- this just does a normal clean but also cleans up the
+   stuff generated by the other three <tt>dist</tt> targets (above).</li>
+ </ul>
+ <p>Okay, that's the basic functionality. When making a release, we want to 
+ ensure that the tree you build the distribution from passes
+ <tt>dist-check</tt>. Beyond fixing the usual bugs, there is generally one 
+ impediment to making the release in this fashion: missing files. The 
+ <tt>dist-check</tt> process guards against that possibility. It will either 
+ fail and that failure will indicate what's missing, or it will succeed 
+ meaning that it has proved that the tarballs can actually succeed in 
+ building LLVM correctly and that it passes <tt>make check</tt>.</p>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">distdir</div>
+ <p>This target builds the distribution directory which is the directory from 
+ which the tarballs are generated. The distribution directory has the same 
+ name as the release, e.g. LLVM-1.7). This target goes through the following 
+ process:
+ <ol>
+   <li>First, if there was an old distribution directory (for the current 
+   release), it is removed in its entirety and you see <tt>Removing old 
+   LLVM-1.7</tt></li>
+   <li>Second, it issues a <tt>make all ENABLE_OPTIMIZED=3D1</tt> to ensure 
+   that the everything in your tree can be built in release mode. Often times 
+   there are discrepancies in building between debug and release modes so it 
+   enforces release mode first. If that fails, the <tt>distdir</tt> target 
+   fails too. This is preceded by the message <tt>Making 'all' to verify 
+   build</tt>.</li>
+   <li>Next, it traverses your source tree and copies it to a new directory 
+   that has the name of the release (<tt>LLVM-M.m</tt> in our current case). 
+   This is the directory that will get tar'd. It contains all the software 
+   that needs to be in the distribution. During the copying process, it omits 
+   generated files, CVS directories, and any other "cruft" that's in your 
+   build tree. This is done to eliminate the possibility of huge distribution 
+   tarballs that include useless or irrelevant stuff in them. This is the 
+   trickiest part of making the distribution. Done manually you will either 
+   include stuff that shouldn't be in the distribution or exclude stuff that 
+   should. This step is preceded by the message <tt>Building Distribution 
+   Directory LLVM-1.7</tt></li>
+   <li>The distribution directory is then traversed and all <tt>CVS</tt> or 
+   <tt>.svn</tt> directories are removed. You see: <tt>Eliminating CVS/.svn 
+   directories from distribution</tt></li>
+   <li>The recursive <tt>dist-hook</tt> target is executed. This gives each 
+   directory a chance to modify the distribution in some way (more on this 
+   below).</li>
+   <li>The distribution directory is traversed and the correct file 
+   permissions and modes are set based on the type of file.</li>
+ </ol>
+ <p>To control the process of making the distribution directory correctly, 
+ each Makefile can utilize two features:</p>
+ <ol>
+   <li><b><tt>EXTRA_DIST</tt></B> - this make variable specifies which files 
+   it should distribute. By default, all source files are automatically 
+   included for distribution as well as certain <tt>well known</tt> files 
+   (see DistAlways variable in Makefile.rules for details). Each Makefile 
+   specifies, via the <tt>EXTRA_DIST</tt> variable, which additional files 
+   need to be distributed. Only those files that are needed to build LLVM 
+   should be added to <tt>EXTRA_DIST</tt>. <tt>EXTRA_DIST</tt> contains a 
+   list of file or directory names that should be distributed. For example, 
+   the top level Makefile contains 
+   <tt>EXTRA_DIST := test llvm.spec include</tt>. 
+   This means that in addition to regular things that are distributed at the 
+   top level (<tt>CREDITS.txt, LICENSE.txt</tt>, etc.) the distribution should
+   contain the entire <tt>test</tt> and <tt>include</tt> directories as well 
+   as the <tt>llvm.spec</tt> file.</li>
+   <li><b><tt>dist-hook</tt></B> - this make target can be used to alter the 
+   content of the distribution directory. For example, in the top level 
+   Makefile there is some logic to eliminate files in the <tt>include</tt> 
+   subtree that are generated by the configure script. These should not be 
+   distributed. Similarly, any <tt>dist-hook</tt> target found in any 
+   directory can add or remove or modify things just before it gets packaged. 
+   Any transformation is permitted. Generally, not much is needed.
+ </ol>
+ <p>You will see various messages if things go wrong:</p>
+ <ol>
+   <li>During the copying process, any files that are missing will be flagged 
+   with: <tt>===== WARNING: Distribution Source 'dir/file' Not Found!</tt>
+   These must be corrected by either adding the file or removing it from 
+   <tt>EXTRA_DIST</tt>.
+   <li>If you build the distribution with <tt>VERBOSE=1</tt>, then you might 
+   also see: <tt>Skipping non-existent 'dir/file'</tt> in certain cases where 
+   its okay to skip the file.</li>
+   <li>The target can fail if any of the things it does fail. Error messages 
+   should indicate what went wrong.</li>
+ </ol>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">dist</div>
+ <p>This target does exactly what <tt>distdir</tt> target does, but also 
+ includes assembling the tarballs. There are actually four related targets 
+ here:<p>
+   <ul>
+     <li><b><tt>dist-gzip</tt></b>: package the gzipped distribution tar 
+     file. The distribution directory is packaged into a single file ending in 
+     <tt>.tar.gz</tt> which is gzip compressed.</li>
+     <li><b><tt>dist-bzip2</tt></b>: package the bzip2 distribution tar file. 
+     The distribution directory is packaged into a single file ending in 
+     <tt>.tar.bzip2</tt> which is bzip2 compressed.</li>
+     <li><b><tt>dist-zip</tt></b>: package the zip distribution file. The 
+     distribution directory is packaged into a single file ending in 
+     <tt>.zip</tt> which is zip compressed.</li>
+     <li><b><tt>dist</tt></b>: does all three, dist-gzip, dist-bzip2,
+     dist-zip</li>
+   </ul>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">dist-check</div>
+ <p>This target checks the distribution. The basic idea is that it unpacks the 
+ distribution tarball and ensures that it can build. It takes the following 
+ actions:</p>
+ <ol>
+   <li>It depends on the <tt>dist-gzip</tt> target which, if it hasn't already 
+   been built, builds the gzip tar bundle (see dist and distdir above).</li>
+   <li>removes any pre-existing <tt>_distcheckdir</tt> at the top level.</li>
+   <li>creates a new <tt>_distcheckdir</tt> directory at the top level.</li>
+   <li>creates a <tt>build</tt> subdirectory and an <tt>install</tt> 
+   subdirectory under <tt>_distcheckdir</tt>.</li>
+   <li>unzips and untars the release tarball into <tt>_distcheckdir</tt>, 
+   creating <tt>LLVM-1.7</tt> directory (from the tarball).</li>
+   <li>in the build subdirectory, it configures with appropriate options to build
+   from the unpacked source tarball into the <tt>build</tt> directory with 
+   installation in the <tt>install</tt> directory.</li>
+   <li>runs <tt>make all</tt></li>
+   <li>runs <tt>make </tt><tt>check</tt></li>
+   <li>runs <tt>make install</tt></li>
+   <li>runs <tt>make uninstall</tt></li>
+   <li>runs <tt>make dist</tt></li>
+   <li>runs <tt>make clean</tt></li>
+   <li>runs <tt>make dist-clean</tt></li>
+ </ol>
+ <p>If it can pass all that, the distribution will be deemed distribution 
+ worth y and you will see:<p>
+ <pre>===== LLVM-1.7.tar.gz Ready For Distribution =====</pre>
+ <p>This means the tarball should then be tested on other platforms and have the
+ nightly test run against it. If those all pass, THEN it is ready for 
+ distribution.</p>
+ <p>
+ A note about disk space: using <tt>dist-check</tt> will easily triple the 
+ amount of disk space your build tree is using. You might want to check 
+ available space before you begin.</p>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">dist-clean</div>
+ <h2>dist-clean</h2>
+ <p>In addition to doing a normal <tt>clean</tt>, this target will clean up the 
+ files and directories created by the distribution targets. In particular the 
+ distribution directory <tt>(LLVM-X.X</tt>), check directory 
+ (<tt>_distcheckdir</tt>), and the various tarballs will be removed. You do 
+ this after the release has shipped and you no longer need this stuff in your 
+ build tree.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:rspencer at x10sys.com">Reid Spencer</a><br>
+   <a href="http://llvm.cs.uiuc.edu">The LLVM Compiler Infrastructure</a>
+   <br/>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/HowToSubmitABug.html
diff -c /dev/null llvm-www/releases/1.8/docs/HowToSubmitABug.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/HowToSubmitABug.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,359 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>How to submit an LLVM bug report</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   How to submit an LLVM bug report
+ </div>
+ 
+ <table class="layout" style="width: 90%" >
+ <tr class="layout">
+   <td class="left">
+ <ol>
+   <li><a href="#introduction">Introduction - Got bugs?</a></li>
+   <li><a href="#crashers">Crashing Bugs</a>
+     <ul>
+     <li><a href="#front-end">Front-end bugs</a>
+     <li><a href="#gccas">GCCAS bugs</a>
+     <li><a href="#gccld">GCCLD bugs</a>
+     <li><a href="#passes">Bugs in LLVM passes</a>
+     </ul></li>
+   <li><a href="#miscompilations">Miscompilations</a></li>
+   <li><a href="#codegen">Incorrect code generation (JIT and LLC)</a></li>
+ </ol>
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a> and
+                 <a href="http://misha.brukman.net">Misha Brukman</a></p>
+ </div>
+ </td>
+ <td class="right">
+   <img src="img/Debugging.gif" alt="Debugging" width="444" height="314">
+ </td>
+ </tr>
+ </table>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction - Got bugs?</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>If you're working with LLVM and run into a bug, we definitely want to know
+ about it.  This document describes what you can do to increase the odds of
+ getting it fixed quickly.</p>
+ 
+ <p>Basically you have to do two things at a minimum.  First, decide whether the
+ bug <a href="#crashers">crashes the compiler</a> (or an LLVM pass), or if the
+ compiler is <a href="#miscompilations">miscompiling</a> the program.  Based on
+ what type of bug it is, follow the instructions in the linked section to narrow
+ down the bug so that the person who fixes it will be able to find the problem
+ more easily.</p>
+ 
+ <p>Once you have a reduced test-case, go to <a
+ href="http://llvm.org/bugs/enter_bug.cgi">the LLVM Bug Tracking
+ System</a>, select the category in which the bug falls, and fill out the form
+ with the necessary details.  The bug description should contain the following
+ information:</p>
+ 
+ <ul>
+   <li>All information necessary to reproduce the problem.</li>
+   <li>The reduced test-case that triggers the bug.</li>
+   <li>The location where you obtained LLVM (if not from our CVS
+   repository).</li>
+ </ul>
+ 
+ <p>Thanks for helping us make LLVM better!</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="crashers">Crashing Bugs</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>More often than not, bugs in the compiler cause it to crash - often due to an
+ assertion failure of some sort.  If you are running <tt><b>opt</b></tt> or
+ <tt><b>analyze</b></tt> directly, and something crashes, jump to the section on
+ <a href="#passes">bugs in LLVM passes</a>.  Otherwise, the most important
+ piece of the puzzle is to figure out if it is the GCC-based front-end that is
+ buggy or if it's one of the LLVM tools that has problems.</p>
+ 
+ <p>To figure out which program is crashing (the front-end,
+ <tt><b>gccas</b></tt>, or <tt><b>gccld</b></tt>), run the
+ <tt><b>llvm-gcc</b></tt> command line as you were when the crash occurred, but
+ add a <tt>-v</tt> option to the command line.  The compiler will print out a
+ bunch of stuff, and should end with telling you that one of
+ <tt><b>cc1</b>/<b>cc1plus</b></tt>, <tt><b>gccas</b></tt>, or
+ <tt><b>gccld</b></tt> crashed.</p>
+ 
+ <ul>
+ 
+   <li>If <tt><b>cc1</b></tt> or <tt><b>cc1plus</b></tt> crashed, you found a
+   problem with the front-end.
+   Jump ahead to the section on <a href="#front-end">front-end bugs</a>.</li>
+ 
+   <li>If <tt><b>gccas</b></tt> crashed, you found a bug in <a href="#gccas">one
+   of the passes in <tt><b>gccas</b></tt></a>.</li>
+ 
+   <li>If <tt><b>gccld</b></tt> crashed, you found a bug in <a href="#gccld">one
+   of the passes in <tt><b>gccld</b></tt></a>.</li>
+ 
+   <li>Otherwise, something really weird happened. Email the list with what you
+   have at this point.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="front-end">Front-end bugs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If the problem is in the front-end, you should re-run the same
+ <tt>llvm-gcc</tt> command that resulted in the crash, but add the
+ <tt>-save-temps</tt> option.  The compiler will crash again, but it will leave
+ behind a <tt><i>foo</i>.i</tt> file (containing preprocessed C source code) and
+ possibly <tt><i>foo</i>.s</tt> (containing LLVM assembly code), for each
+ compiled <tt><i>foo</i>.c</tt> file. Send us the <tt><i>foo</i>.i</tt> file,
+ along with a brief description of the error it caused. A tool that might help
+ you reduce a front-end testcase to a more manageable size is
+ <a href="http://delta.tigris.org/">delta</a>.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="gccas">GCCAS bugs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If you find that a bug crashes in the <tt><b>gccas</b></tt> stage of
+ compilation, compile your test-case to a <tt>.s</tt> file with the
+ <tt>-save-temps</tt> option to <tt><b>llvm-gcc</b></tt>. Then run:</p>
+ 
+ <div class="doc_code">
+ <p><tt><b>gccas</b> -debug-pass=Arguments < /dev/null -o - > /dev/null</tt></p>
+ </div>
+ 
+ <p>... which will print a list of arguments, indicating the list of passes that
+ <tt><b>gccas</b></tt> runs.  Once you have the input file and the list of
+ passes, go to the section on <a href="#passes">debugging bugs in LLVM
+ passes</a>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="gccld">GCCLD bugs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If you find that a bug crashes in the <tt><b>gccld</b></tt> stage of
+ compilation, gather all of the <tt>.o</tt> bytecode files and libraries that are
+ being linked together (the "<tt><b>llvm-gcc</b> -v</tt>" output should include
+ the full list of objects linked).  Then run:</p>
+ 
+ <div class="doc_code">
+ <p><tt><b>llvm-as</b> < /dev/null > null.bc<br>
+ <b>gccld</b> -debug-pass=Arguments null.bc</tt>
+ </p>
+ </div>
+ 
+ <p>... which will print a list of arguments, indicating the list of passes that
+ <tt><b>gccld</b></tt> runs.  Once you have the input files and the list of
+ passes, go to the section on <a href="#passes">debugging bugs in LLVM
+ passes</a>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="passes">Bugs in LLVM passes</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>At this point, you should have some number of LLVM assembly files or bytecode
+ files and a list of passes which crash when run on the specified input.  In
+ order to reduce the list of passes (which is probably large) and the input to
+ something tractable, use the <tt><b>bugpoint</b></tt> tool as follows:</p>
+ 
+ <div class="doc_code">
+ <p><tt><b>bugpoint</b> <input files> <list of passes></tt></p>
+ </div>
+ 
+ <p><tt><b>bugpoint</b></tt> will print a bunch of output as it reduces the
+ test-case, but it should eventually print something like this:</p>
+ 
+ <div class="doc_code">
+ <p><tt>
+ ...<br>
+ Emitted bytecode to 'bugpoint-reduced-simplified.bc'<br>
+ <br>
+ *** You can reproduce the problem with: opt bugpoint-reduced-simplified.bc -licm<br>
+ </tt></p>
+ </div>
+ 
+ <p>Once you complete this, please send the LLVM bytecode file and the command
+ line to reproduce the problem to the llvmbugs mailing list.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="miscompilations">Miscompilations</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>A miscompilation occurs when a pass does not correctly transform a program,
+ thus producing errors that are only noticed during execution. This is different
+ from producing invalid LLVM code (i.e., code not in SSA form, using values
+ before defining them, etc.) which the verifier will check for after a pass
+ finishes its run.</p>
+ 
+ <p>If it looks like the LLVM compiler is miscompiling a program, the very first
+ thing to check is to make sure it is not using undefined behavior.  In
+ particular, check to see if the program <a
+ href="http://valgrind.kde.org/">valgrind</a>s clean, passes purify, or some
+ other memory checker tool.  Many of the "LLVM bugs" that we have chased down
+ ended up being bugs in the program being compiled, not LLVM.</p>
+ 
+ <p>Once you determine that the program itself is not buggy, you should choose 
+ which code generator you wish to compile the program with (e.g. C backend, the 
+ JIT, or LLC) and optionally a series of LLVM passes to run.  For example:</p>
+ 
+ <div class="doc_code">
+ <p><tt>
+ <b>bugpoint</b> -run-cbe [... optzn passes ...] file-to-test.bc --args -- [program arguments]</tt></p>
+ </div>
+ 
+ <p><tt>bugpoint</tt> will try to narrow down your list of passes to the one pass
+ that causes an error, and simplify the bytecode file as much as it can to assist
+ you. It will print a message letting you know how to reproduce the resulting
+ error.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="codegen">Incorrect code generation</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Similarly to debugging incorrect compilation by mis-behaving passes, you can
+ debug incorrect code generation by either LLC or the JIT, using
+ <tt>bugpoint</tt>. The process <tt>bugpoint</tt> follows in this case is to try
+ to narrow the code down to a function that is miscompiled by one or the other
+ method, but since for correctness, the entire program must be run,
+ <tt>bugpoint</tt> will compile the code it deems to not be affected with the C
+ Backend, and then link in the shared object it generates.</p>
+ 
+ <p>To debug the JIT:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ bugpoint -run-jit -output=[correct output file] [bytecode file]  \
+          --tool-args -- [arguments to pass to lli]               \
+          --args -- [program arguments]
+ </pre>
+ </div>
+ 
+ <p>Similarly, to debug the LLC, one would run:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ bugpoint -run-llc -output=[correct output file] [bytecode file]  \
+          --tool-args -- [arguments to pass to llc]               \
+          --args -- [program arguments]
+ </pre>
+ </div>
+ 
+ <p><b>Special note:</b> if you are debugging MultiSource or SPEC tests that
+ already exist in the <tt>llvm/test</tt> hierarchy, there is an easier way to
+ debug the JIT, LLC, and CBE, using the pre-written Makefile targets, which
+ will pass the program options specified in the Makefiles:</p>
+ 
+ <div class="doc_code">
+ <p><tt>
+ cd llvm/test/../../program<br>
+ make bugpoint-jit
+ </tt></p>
+ </div>
+ 
+ <p>At the end of a successful <tt>bugpoint</tt> run, you will be presented
+ with two bytecode files: a <em>safe</em> file which can be compiled with the C
+ backend and the <em>test</em> file which either LLC or the JIT
+ mis-codegenerates, and thus causes the error.</p>
+ 
+ <p>To reproduce the error that <tt>bugpoint</tt> found, it is sufficient to do
+ the following:</p>
+ 
+ <ol>
+ 
+ <li><p>Regenerate the shared object from the safe bytecode file:</p>
+ 
+ <div class="doc_code">
+ <p><tt>
+ <b>llc</b> -march=c safe.bc -o safe.c<br>
+ <b>gcc</b> -shared safe.c -o safe.so
+ </tt></p>
+ </div></li>
+ 
+ <li><p>If debugging LLC, compile test bytecode native and link with the shared
+     object:</p>
+ 
+ <div class="doc_code">
+ <p><tt>
+ <b>llc</b> test.bc -o test.s -f<br>
+ <b>gcc</b> test.s safe.so -o test.llc<br>
+ ./test.llc [program options]
+ </tt></p>
+ </div></li>
+     
+ <li><p>If debugging the JIT, load the shared object and supply the test
+     bytecode:</p>
+ 
+ <div class="doc_code">
+ <p><tt><b>lli</b> -load=safe.so test.bc [program options]</tt></p>
+ </div></li>  
+ 
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a>
+   <br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/LangRef.html
diff -c /dev/null llvm-www/releases/1.8/docs/LangRef.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/LangRef.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,3846 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM Assembly Language Reference Manual</title>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <meta name="author" content="Chris Lattner">
+   <meta name="description" 
+   content="LLVM Assembly Language Reference Manual.">
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ 
+ <body>
+ 
+ <div class="doc_title"> LLVM Language Reference Manual </div>
+ <ol>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#identifiers">Identifiers</a></li>
+   <li><a href="#highlevel">High Level Structure</a>
+     <ol>
+       <li><a href="#modulestructure">Module Structure</a></li>
+       <li><a href="#linkage">Linkage Types</a></li>
+       <li><a href="#callingconv">Calling Conventions</a></li>
+       <li><a href="#globalvars">Global Variables</a></li>
+       <li><a href="#functionstructure">Functions</a></li>
+       <li><a href="#moduleasm">Module-Level Inline Assembly</a></li>
+     </ol>
+   </li>
+   <li><a href="#typesystem">Type System</a>
+     <ol>
+       <li><a href="#t_primitive">Primitive Types</a>    
+         <ol>
+           <li><a href="#t_classifications">Type Classifications</a></li>
+         </ol>
+       </li>
+       <li><a href="#t_derived">Derived Types</a>
+         <ol>
+           <li><a href="#t_array">Array Type</a></li>
+           <li><a href="#t_function">Function Type</a></li>
+           <li><a href="#t_pointer">Pointer Type</a></li>
+           <li><a href="#t_struct">Structure Type</a></li>
+           <li><a href="#t_packed">Packed Type</a></li>
+           <li><a href="#t_opaque">Opaque Type</a></li>
+         </ol>
+       </li>
+     </ol>
+   </li>
+   <li><a href="#constants">Constants</a>
+     <ol>
+       <li><a href="#simpleconstants">Simple Constants</a>
+       <li><a href="#aggregateconstants">Aggregate Constants</a>
+       <li><a href="#globalconstants">Global Variable and Function Addresses</a>
+       <li><a href="#undefvalues">Undefined Values</a>
+       <li><a href="#constantexprs">Constant Expressions</a>
+     </ol>
+   </li>
+   <li><a href="#othervalues">Other Values</a>
+     <ol>
+       <li><a href="#inlineasm">Inline Assembler Expressions</a>
+     </ol>
+   </li>
+   <li><a href="#instref">Instruction Reference</a>
+     <ol>
+       <li><a href="#terminators">Terminator Instructions</a>
+         <ol>
+           <li><a href="#i_ret">'<tt>ret</tt>' Instruction</a></li>
+           <li><a href="#i_br">'<tt>br</tt>' Instruction</a></li>
+           <li><a href="#i_switch">'<tt>switch</tt>' Instruction</a></li>
+           <li><a href="#i_invoke">'<tt>invoke</tt>' Instruction</a></li>
+           <li><a href="#i_unwind">'<tt>unwind</tt>'  Instruction</a></li>
+           <li><a href="#i_unreachable">'<tt>unreachable</tt>' Instruction</a></li>
+         </ol>
+       </li>
+       <li><a href="#binaryops">Binary Operations</a>
+         <ol>
+           <li><a href="#i_add">'<tt>add</tt>' Instruction</a></li>
+           <li><a href="#i_sub">'<tt>sub</tt>' Instruction</a></li>
+           <li><a href="#i_mul">'<tt>mul</tt>' Instruction</a></li>
+           <li><a href="#i_div">'<tt>div</tt>' Instruction</a></li>
+           <li><a href="#i_rem">'<tt>rem</tt>' Instruction</a></li>
+           <li><a href="#i_setcc">'<tt>set<i>cc</i></tt>' Instructions</a></li>
+         </ol>
+       </li>
+       <li><a href="#bitwiseops">Bitwise Binary Operations</a>
+         <ol>
+           <li><a href="#i_and">'<tt>and</tt>' Instruction</a></li>
+           <li><a href="#i_or">'<tt>or</tt>'  Instruction</a></li>
+           <li><a href="#i_xor">'<tt>xor</tt>' Instruction</a></li>
+           <li><a href="#i_shl">'<tt>shl</tt>' Instruction</a></li>
+           <li><a href="#i_shr">'<tt>shr</tt>' Instruction</a></li>
+         </ol>
+       </li>
+       <li><a href="#vectorops">Vector Operations</a>
+         <ol>
+           <li><a href="#i_extractelement">'<tt>extractelement</tt>' Instruction</a></li>
+           <li><a href="#i_insertelement">'<tt>insertelement</tt>' Instruction</a></li>
+           <li><a href="#i_shufflevector">'<tt>shufflevector</tt>' Instruction</a></li>
+           <li><a href="#i_vsetint">'<tt>vsetint</tt>' Instruction</a></li>
+           <li><a href="#i_vsetfp">'<tt>vsetfp</tt>' Instruction</a></li>
+           <li><a href="#i_vselect">'<tt>vselect</tt>' Instruction</a></li>
+         </ol>
+       </li>
+       <li><a href="#memoryops">Memory Access Operations</a>
+         <ol>
+           <li><a href="#i_malloc">'<tt>malloc</tt>'   Instruction</a></li>
+           <li><a href="#i_free">'<tt>free</tt>'     Instruction</a></li>
+           <li><a href="#i_alloca">'<tt>alloca</tt>'   Instruction</a></li>
+          <li><a href="#i_load">'<tt>load</tt>'     Instruction</a></li>
+          <li><a href="#i_store">'<tt>store</tt>'    Instruction</a></li>
+          <li><a href="#i_getelementptr">'<tt>getelementptr</tt>' Instruction</a></li>
+         </ol>
+       </li>
+       <li><a href="#otherops">Other Operations</a>
+         <ol>
+           <li><a href="#i_phi">'<tt>phi</tt>'   Instruction</a></li>
+           <li><a href="#i_cast">'<tt>cast .. to</tt>' Instruction</a></li>
+           <li><a href="#i_select">'<tt>select</tt>' Instruction</a></li>
+           <li><a href="#i_call">'<tt>call</tt>'  Instruction</a></li>
+           <li><a href="#i_va_arg">'<tt>va_arg</tt>'  Instruction</a></li>
+         </ol>
+       </li>
+     </ol>
+   </li>
+   <li><a href="#intrinsics">Intrinsic Functions</a>
+     <ol>
+       <li><a href="#int_varargs">Variable Argument Handling Intrinsics</a>
+         <ol>
+           <li><a href="#i_va_start">'<tt>llvm.va_start</tt>' Intrinsic</a></li>
+           <li><a href="#i_va_end">'<tt>llvm.va_end</tt>'   Intrinsic</a></li>
+           <li><a href="#i_va_copy">'<tt>llvm.va_copy</tt>'  Intrinsic</a></li>
+         </ol>
+       </li>
+       <li><a href="#int_gc">Accurate Garbage Collection Intrinsics</a>
+         <ol>
+           <li><a href="#i_gcroot">'<tt>llvm.gcroot</tt>' Intrinsic</a></li>
+           <li><a href="#i_gcread">'<tt>llvm.gcread</tt>' Intrinsic</a></li>
+           <li><a href="#i_gcwrite">'<tt>llvm.gcwrite</tt>' Intrinsic</a></li>
+         </ol>
+       </li>
+       <li><a href="#int_codegen">Code Generator Intrinsics</a>
+         <ol>
+           <li><a href="#i_returnaddress">'<tt>llvm.returnaddress</tt>' Intrinsic</a></li>
+           <li><a href="#i_frameaddress">'<tt>llvm.frameaddress</tt>'   Intrinsic</a></li>
+           <li><a href="#i_stacksave">'<tt>llvm.stacksave</tt>' Intrinsic</a></li>
+           <li><a href="#i_stackrestore">'<tt>llvm.stackrestore</tt>' Intrinsic</a></li>
+           <li><a href="#i_prefetch">'<tt>llvm.prefetch</tt>' Intrinsic</a></li>
+           <li><a href="#i_pcmarker">'<tt>llvm.pcmarker</tt>' Intrinsic</a></li>
+           <li><a href="#i_readcyclecounter"><tt>llvm.readcyclecounter</tt>' Intrinsic</a></li>
+         </ol>
+       </li>
+       <li><a href="#int_libc">Standard C Library Intrinsics</a>
+         <ol>
+           <li><a href="#i_memcpy">'<tt>llvm.memcpy.*</tt>' Intrinsic</a></li>
+           <li><a href="#i_memmove">'<tt>llvm.memmove.*</tt>' Intrinsic</a></li>
+           <li><a href="#i_memset">'<tt>llvm.memset.*</tt>' Intrinsic</a></li>
+           <li><a href="#i_isunordered">'<tt>llvm.isunordered.*</tt>' Intrinsic</a></li>
+           <li><a href="#i_sqrt">'<tt>llvm.sqrt.*</tt>' Intrinsic</a></li>
+ 
+         </ol>
+       </li>
+       <li><a href="#int_manip">Bit Manipulation Intrinsics</a>
+         <ol>
+           <li><a href="#i_bswap">'<tt>llvm.bswap.*</tt>' Intrinsics</a></li>
+           <li><a href="#int_ctpop">'<tt>llvm.ctpop.*</tt>' Intrinsic </a></li>
+           <li><a href="#int_ctlz">'<tt>llvm.ctlz.*</tt>' Intrinsic </a></li>
+           <li><a href="#int_cttz">'<tt>llvm.cttz.*</tt>' Intrinsic </a></li>
+         </ol>
+       </li>
+       <li><a href="#int_debugger">Debugger intrinsics</a></li>
+     </ol>
+   </li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a>
+             and <a href="mailto:vadve at cs.uiuc.edu">Vikram Adve</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="abstract">Abstract </a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>This document is a reference manual for the LLVM assembly language. 
+ LLVM is an SSA based representation that provides type safety,
+ low-level operations, flexibility, and the capability of representing
+ 'all' high-level languages cleanly.  It is the common code
+ representation used throughout all phases of the LLVM compilation
+ strategy.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="introduction">Introduction</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM code representation is designed to be used in three
+ different forms: as an in-memory compiler IR, as an on-disk bytecode
+ representation (suitable for fast loading by a Just-In-Time compiler),
+ and as a human readable assembly language representation.  This allows
+ LLVM to provide a powerful intermediate representation for efficient
+ compiler transformations and analysis, while providing a natural means
+ to debug and visualize the transformations.  The three different forms
+ of LLVM are all equivalent.  This document describes the human readable
+ representation and notation.</p>
+ 
+ <p>The LLVM representation aims to be light-weight and low-level
+ while being expressive, typed, and extensible at the same time.  It
+ aims to be a "universal IR" of sorts, by being at a low enough level
+ that high-level ideas may be cleanly mapped to it (similar to how
+ microprocessors are "universal IR's", allowing many source languages to
+ be mapped to them).  By providing type information, LLVM can be used as
+ the target of optimizations: for example, through pointer analysis, it
+ can be proven that a C automatic variable is never accessed outside of
+ the current function... allowing it to be promoted to a simple SSA
+ value instead of a memory location.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="wellformed">Well-Formedness</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <p>It is important to note that this document describes 'well formed'
+ LLVM assembly language.  There is a difference between what the parser
+ accepts and what is considered 'well formed'.  For example, the
+ following instruction is syntactically okay, but not well formed:</p>
+ 
+ <pre>
+   %x = <a href="#i_add">add</a> int 1, %x
+ </pre>
+ 
+ <p>...because the definition of <tt>%x</tt> does not dominate all of
+ its uses. The LLVM infrastructure provides a verification pass that may
+ be used to verify that an LLVM module is well formed.  This pass is
+ automatically run by the parser after parsing input assembly and by
+ the optimizer before it outputs bytecode.  The violations pointed out
+ by the verifier pass indicate bugs in transformation passes or input to
+ the parser.</p>
+ 
+ <!-- Describe the typesetting conventions here. --> </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="identifiers">Identifiers</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM uses three different forms of identifiers, for different
+ purposes:</p>
+ 
+ <ol>
+   <li>Named values are represented as a string of characters with a '%' prefix.
+   For example, %foo, %DivisionByZero, %a.really.long.identifier.  The actual
+   regular expression used is '<tt>%[a-zA-Z$._][a-zA-Z$._0-9]*</tt>'.
+   Identifiers which require other characters in their names can be surrounded
+   with quotes.  In this way, anything except a <tt>"</tt> character can be used
+   in a name.</li>
+ 
+   <li>Unnamed values are represented as an unsigned numeric value with a '%'
+   prefix.  For example, %12, %2, %44.</li>
+ 
+   <li>Constants, which are described in a <a href="#constants">section about
+   constants</a>, below.</li>
+ </ol>
+ 
+ <p>LLVM requires that values start with a '%' sign for two reasons: Compilers
+ don't need to worry about name clashes with reserved words, and the set of
+ reserved words may be expanded in the future without penalty.  Additionally,
+ unnamed identifiers allow a compiler to quickly come up with a temporary
+ variable without having to avoid symbol table conflicts.</p>
+ 
+ <p>Reserved words in LLVM are very similar to reserved words in other
+ languages. There are keywords for different opcodes ('<tt><a
+ href="#i_add">add</a></tt>', '<tt><a href="#i_cast">cast</a></tt>', '<tt><a
+ href="#i_ret">ret</a></tt>', etc...), for primitive type names ('<tt><a
+ href="#t_void">void</a></tt>', '<tt><a href="#t_uint">uint</a></tt>', etc...),
+ and others.  These reserved words cannot conflict with variable names, because
+ none of them start with a '%' character.</p>
+ 
+ <p>Here is an example of LLVM code to multiply the integer variable
+ '<tt>%X</tt>' by 8:</p>
+ 
+ <p>The easy way:</p>
+ 
+ <pre>
+   %result = <a href="#i_mul">mul</a> uint %X, 8
+ </pre>
+ 
+ <p>After strength reduction:</p>
+ 
+ <pre>
+   %result = <a href="#i_shl">shl</a> uint %X, ubyte 3
+ </pre>
+ 
+ <p>And the hard way:</p>
+ 
+ <pre>
+   <a href="#i_add">add</a> uint %X, %X           <i>; yields {uint}:%0</i>
+   <a href="#i_add">add</a> uint %0, %0           <i>; yields {uint}:%1</i>
+   %result = <a href="#i_add">add</a> uint %1, %1
+ </pre>
+ 
+ <p>This last way of multiplying <tt>%X</tt> by 8 illustrates several
+ important lexical features of LLVM:</p>
+ 
+ <ol>
+ 
+   <li>Comments are delimited with a '<tt>;</tt>' and go until the end of
+   line.</li>
+ 
+   <li>Unnamed temporaries are created when the result of a computation is not
+   assigned to a named value.</li>
+ 
+   <li>Unnamed temporaries are numbered sequentially</li>
+ 
+ </ol>
+ 
+ <p>...and it also shows a convention that we follow in this document.  When
+ demonstrating instructions, we will follow an instruction with a comment that
+ defines the type and name of value produced.  Comments are shown in italic
+ text.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="highlevel">High Level Structure</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="modulestructure">Module Structure</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM programs are composed of "Module"s, each of which is a
+ translation unit of the input programs.  Each module consists of
+ functions, global variables, and symbol table entries.  Modules may be
+ combined together with the LLVM linker, which merges function (and
+ global variable) definitions, resolves forward declarations, and merges
+ symbol table entries. Here is an example of the "hello world" module:</p>
+ 
+ <pre><i>; Declare the string constant as a global constant...</i>
+ <a href="#identifiers">%.LC0</a> = <a href="#linkage_internal">internal</a> <a
+  href="#globalvars">constant</a> <a href="#t_array">[13 x sbyte]</a> c"hello world\0A\00"          <i>; [13 x sbyte]*</i>
+ 
+ <i>; External declaration of the puts function</i>
+ <a href="#functionstructure">declare</a> int %puts(sbyte*)                                            <i>; int(sbyte*)* </i>
+ 
+ <i>; Global variable / Function body section separator</i>
+ implementation
+ 
+ <i>; Definition of main function</i>
+ int %main() {                                                        <i>; int()* </i>
+         <i>; Convert [13x sbyte]* to sbyte *...</i>
+         %cast210 = <a
+  href="#i_getelementptr">getelementptr</a> [13 x sbyte]* %.LC0, long 0, long 0 <i>; sbyte*</i>
+ 
+         <i>; Call puts function to write out the string to stdout...</i>
+         <a
+  href="#i_call">call</a> int %puts(sbyte* %cast210)                              <i>; int</i>
+         <a
+  href="#i_ret">ret</a> int 0<br>}<br></pre>
+ 
+ <p>This example is made up of a <a href="#globalvars">global variable</a>
+ named "<tt>.LC0</tt>", an external declaration of the "<tt>puts</tt>"
+ function, and a <a href="#functionstructure">function definition</a>
+ for "<tt>main</tt>".</p>
+ 
+ <p>In general, a module is made up of a list of global values,
+ where both functions and global variables are global values.  Global values are
+ represented by a pointer to a memory location (in this case, a pointer to an
+ array of char, and a pointer to a function), and have one of the following <a
+ href="#linkage">linkage types</a>.</p>
+ 
+ <p>Due to a limitation in the current LLVM assembly parser (it is limited by
+ one-token lookahead), modules are split into two pieces by the "implementation"
+ keyword.  Global variable prototypes and definitions must occur before the
+ keyword, and function definitions must occur after it.  Function prototypes may
+ occur either before or after it.  In the future, the implementation keyword may
+ become a noop, if the parser gets smarter.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="linkage">Linkage Types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ All Global Variables and Functions have one of the following types of linkage:
+ </p>
+ 
+ <dl>
+ 
+   <dt><tt><b><a name="linkage_internal">internal</a></b></tt> </dt>
+ 
+   <dd>Global values with internal linkage are only directly accessible by
+   objects in the current module.  In particular, linking code into a module with
+   an internal global value may cause the internal to be renamed as necessary to
+   avoid collisions.  Because the symbol is internal to the module, all
+   references can be updated.  This corresponds to the notion of the
+   '<tt>static</tt>' keyword in C, or the idea of "anonymous namespaces" in C++.
+   </dd>
+ 
+   <dt><tt><b><a name="linkage_linkonce">linkonce</a></b></tt>: </dt>
+ 
+   <dd>"<tt>linkonce</tt>" linkage is similar to <tt>internal</tt> linkage, with
+   the twist that linking together two modules defining the same
+   <tt>linkonce</tt> globals will cause one of the globals to be discarded.  This
+   is typically used to implement inline functions.  Unreferenced
+   <tt>linkonce</tt> globals are allowed to be discarded.
+   </dd>
+ 
+   <dt><tt><b><a name="linkage_weak">weak</a></b></tt>: </dt>
+ 
+   <dd>"<tt>weak</tt>" linkage is exactly the same as <tt>linkonce</tt> linkage,
+   except that unreferenced <tt>weak</tt> globals may not be discarded.  This is
+   used to implement constructs in C such as "<tt>int X;</tt>" at global scope.
+   </dd>
+ 
+   <dt><tt><b><a name="linkage_appending">appending</a></b></tt>: </dt>
+ 
+   <dd>"<tt>appending</tt>" linkage may only be applied to global variables of
+   pointer to array type.  When two global variables with appending linkage are
+   linked together, the two global arrays are appended together.  This is the
+   LLVM, typesafe, equivalent of having the system linker append together
+   "sections" with identical names when .o files are linked.
+   </dd>
+ 
+   <dt><tt><b><a name="linkage_external">externally visible</a></b></tt>:</dt>
+ 
+   <dd>If none of the above identifiers are used, the global is externally
+   visible, meaning that it participates in linkage and can be used to resolve
+   external symbol references.
+   </dd>
+ </dl>
+ 
+ <p><a name="linkage_external">For example, since the "<tt>.LC0</tt>"
+ variable is defined to be internal, if another module defined a "<tt>.LC0</tt>"
+ variable and was linked with this one, one of the two would be renamed,
+ preventing a collision.  Since "<tt>main</tt>" and "<tt>puts</tt>" are
+ external (i.e., lacking any linkage declarations), they are accessible
+ outside of the current module.  It is illegal for a function <i>declaration</i>
+ to have any linkage type other than "externally visible".</a></p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="callingconv">Calling Conventions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM <a href="#functionstructure">functions</a>, <a href="#i_call">calls</a>
+ and <a href="#i_invoke">invokes</a> can all have an optional calling convention
+ specified for the call.  The calling convention of any pair of dynamic
+ caller/callee must match, or the behavior of the program is undefined.  The
+ following calling conventions are supported by LLVM, and more may be added in
+ the future:</p>
+ 
+ <dl>
+   <dt><b>"<tt>ccc</tt>" - The C calling convention</b>:</dt>
+ 
+   <dd>This calling convention (the default if no other calling convention is
+   specified) matches the target C calling conventions.  This calling convention
+   supports varargs function calls and tolerates some mismatch in the declared
+   prototype and implemented declaration of the function (as does normal C).
+   </dd>
+ 
+   <dt><b>"<tt>csretcc</tt>" - The C struct return calling convention</b>:</dt>
+ 
+   <dd>This calling convention matches the target C calling conventions, except
+   that functions with this convention are required to take a pointer as their
+   first argument, and the return type of the function must be void.  This is
+   used for C functions that return aggregates by-value.  In this case, the
+   function has been transformed to take a pointer to the struct as the first
+   argument to the function.  For targets where the ABI specifies specific
+   behavior for structure-return calls, the calling convention can be used to
+   distinguish between struct return functions and other functions that take a
+   pointer to a struct as the first argument.
+   </dd>
+ 
+   <dt><b>"<tt>fastcc</tt>" - The fast calling convention</b>:</dt>
+ 
+   <dd>This calling convention attempts to make calls as fast as possible
+   (e.g. by passing things in registers).  This calling convention allows the
+   target to use whatever tricks it wants to produce fast code for the target,
+   without having to conform to an externally specified ABI.  Implementations of
+   this convention should allow arbitrary tail call optimization to be supported.
+   This calling convention does not support varargs and requires the prototype of
+   all callees to exactly match the prototype of the function definition.
+   </dd>
+ 
+   <dt><b>"<tt>coldcc</tt>" - The cold calling convention</b>:</dt>
+ 
+   <dd>This calling convention attempts to make code in the caller as efficient
+   as possible under the assumption that the call is not commonly executed.  As
+   such, these calls often preserve all registers so that the call does not break
+   any live ranges in the caller side.  This calling convention does not support
+   varargs and requires the prototype of all callees to exactly match the
+   prototype of the function definition.
+   </dd>
+ 
+   <dt><b>"<tt>cc <<em>n</em>></tt>" - Numbered convention</b>:</dt>
+ 
+   <dd>Any calling convention may be specified by number, allowing
+   target-specific calling conventions to be used.  Target specific calling
+   conventions start at 64.
+   </dd>
+ </dl>
+ 
+ <p>More calling conventions can be added/defined on an as-needed basis, to
+ support pascal conventions or any other well-known target-independent
+ convention.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="globalvars">Global Variables</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Global variables define regions of memory allocated at compilation time
+ instead of run-time.  Global variables may optionally be initialized, may have
+ an explicit section to be placed in, and may
+ have an optional explicit alignment specified.  A
+ variable may be defined as a global "constant," which indicates that the
+ contents of the variable will <b>never</b> be modified (enabling better
+ optimization, allowing the global data to be placed in the read-only section of
+ an executable, etc).  Note that variables that need runtime initialization
+ cannot be marked "constant" as there is a store to the variable.</p>
+ 
+ <p>
+ LLVM explicitly allows <em>declarations</em> of global variables to be marked
+ constant, even if the final definition of the global is not.  This capability
+ can be used to enable slightly better optimization of the program, but requires
+ the language definition to guarantee that optimizations based on the
+ 'constantness' are valid for the translation units that do not include the
+ definition.
+ </p>
+ 
+ <p>As SSA values, global variables define pointer values that are in
+ scope (i.e. they dominate) all basic blocks in the program.  Global
+ variables always define a pointer to their "content" type because they
+ describe a region of memory, and all memory objects in LLVM are
+ accessed through pointers.</p>
+ 
+ <p>LLVM allows an explicit section to be specified for globals.  If the target
+ supports it, it will emit globals to the section specified.</p>
+ 
+ <p>An explicit alignment may be specified for a global.  If not present, or if
+ the alignment is set to zero, the alignment of the global is set by the target
+ to whatever it feels convenient.  If an explicit alignment is specified, the 
+ global is forced to have at least that much alignment.  All alignments must be
+ a power of 2.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="functionstructure">Functions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM function definitions consist of an optional <a href="#linkage">linkage
+ type</a>, an optional <a href="#callingconv">calling convention</a>, a return
+ type, a function name, a (possibly empty) argument list, an optional section,
+ an optional alignment, an opening curly brace,
+ a list of basic blocks, and a closing curly brace.  LLVM function declarations
+ are defined with the "<tt>declare</tt>" keyword, an optional <a
+ href="#callingconv">calling convention</a>, a return type, a function name,
+ a possibly empty list of arguments, and an optional alignment.</p>
+ 
+ <p>A function definition contains a list of basic blocks, forming the CFG for
+ the function.  Each basic block may optionally start with a label (giving the
+ basic block a symbol table entry), contains a list of instructions, and ends
+ with a <a href="#terminators">terminator</a> instruction (such as a branch or
+ function return).</p>
+ 
+ <p>The first basic block in a program is special in two ways: it is immediately
+ executed on entrance to the function, and it is not allowed to have predecessor
+ basic blocks (i.e. there can not be any branches to the entry block of a
+ function).  Because the block can have no predecessors, it also cannot have any
+ <a href="#i_phi">PHI nodes</a>.</p>
+ 
+ <p>LLVM functions are identified by their name and type signature.  Hence, two
+ functions with the same name but different parameter lists or return values are
+ considered different functions, and LLVM will resolve references to each
+ appropriately.</p>
+ 
+ <p>LLVM allows an explicit section to be specified for functions.  If the target
+ supports it, it will emit functions to the section specified.</p>
+ 
+ <p>An explicit alignment may be specified for a function.  If not present, or if
+ the alignment is set to zero, the alignment of the function is set by the target
+ to whatever it feels convenient.  If an explicit alignment is specified, the
+ function is forced to have at least that much alignment.  All alignments must be
+ a power of 2.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="moduleasm">Module-Level Inline Assembly</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ Modules may contain "module-level inline asm" blocks, which corresponds to the
+ GCC "file scope inline asm" blocks.  These blocks are internally concatenated by
+ LLVM and treated as a single unit, but may be separated in the .ll file if
+ desired.  The syntax is very simple:
+ </p>
+ 
+ <div class="doc_code"><pre>
+   module asm "inline asm code goes here"
+   module asm "more can go here"
+ </pre></div>
+ 
+ <p>The strings can contain any character by escaping non-printable characters.
+    The escape sequence used is simply "\xx" where "xx" is the two digit hex code
+    for the number.
+ </p>
+ 
+ <p>
+   The inline asm code is simply printed to the machine code .s file when
+   assembly code is generated.
+ </p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="typesystem">Type System</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM type system is one of the most important features of the
+ intermediate representation.  Being typed enables a number of
+ optimizations to be performed on the IR directly, without having to do
+ extra analyses on the side before the transformation.  A strong type
+ system makes it easier to read the generated code and enables novel
+ analyses and transformations that are not feasible to perform on normal
+ three address code representations.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="t_primitive">Primitive Types</a> </div>
+ <div class="doc_text">
+ <p>The primitive types are the fundamental building blocks of the LLVM
+ system. The current set of primitive types is as follows:</p>
+ 
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <table>
+         <tbody>
+         <tr><th>Type</th><th>Description</th></tr>
+         <tr><td><tt>void</tt></td><td>No value</td></tr>
+         <tr><td><tt>ubyte</tt></td><td>Unsigned 8-bit value</td></tr>
+         <tr><td><tt>ushort</tt></td><td>Unsigned 16-bit value</td></tr>
+         <tr><td><tt>uint</tt></td><td>Unsigned 32-bit value</td></tr>
+         <tr><td><tt>ulong</tt></td><td>Unsigned 64-bit value</td></tr>
+         <tr><td><tt>float</tt></td><td>32-bit floating point value</td></tr>
+         <tr><td><tt>label</tt></td><td>Branch destination</td></tr>
+         </tbody>
+       </table>
+     </td>
+     <td class="right">
+       <table>
+         <tbody>
+           <tr><th>Type</th><th>Description</th></tr>
+           <tr><td><tt>bool</tt></td><td>True or False value</td></tr>
+           <tr><td><tt>sbyte</tt></td><td>Signed 8-bit value</td></tr>
+           <tr><td><tt>short</tt></td><td>Signed 16-bit value</td></tr>
+           <tr><td><tt>int</tt></td><td>Signed 32-bit value</td></tr>
+           <tr><td><tt>long</tt></td><td>Signed 64-bit value</td></tr>
+           <tr><td><tt>double</tt></td><td>64-bit floating point value</td></tr>
+         </tbody>
+       </table>
+     </td>
+   </tr>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_classifications">Type
+ Classifications</a> </div>
+ <div class="doc_text">
+ <p>These different primitive types fall into a few useful
+ classifications:</p>
+ 
+ <table border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr><th>Classification</th><th>Types</th></tr>
+     <tr>
+       <td><a name="t_signed">signed</a></td>
+       <td><tt>sbyte, short, int, long, float, double</tt></td>
+     </tr>
+     <tr>
+       <td><a name="t_unsigned">unsigned</a></td>
+       <td><tt>ubyte, ushort, uint, ulong</tt></td>
+     </tr>
+     <tr>
+       <td><a name="t_integer">integer</a></td>
+       <td><tt>ubyte, sbyte, ushort, short, uint, int, ulong, long</tt></td>
+     </tr>
+     <tr>
+       <td><a name="t_integral">integral</a></td>
+       <td><tt>bool, ubyte, sbyte, ushort, short, uint, int, ulong, long</tt>
+       </td>
+     </tr>
+     <tr>
+       <td><a name="t_floating">floating point</a></td>
+       <td><tt>float, double</tt></td>
+     </tr>
+     <tr>
+       <td><a name="t_firstclass">first class</a></td>
+       <td><tt>bool, ubyte, sbyte, ushort, short, uint, int, ulong, long,<br> 
+       float, double, <a href="#t_pointer">pointer</a>, 
+       <a href="#t_packed">packed</a></tt></td>
+     </tr>
+   </tbody>
+ </table>
+ 
+ <p>The <a href="#t_firstclass">first class</a> types are perhaps the
+ most important.  Values of these types are the only ones which can be
+ produced by instructions, passed as arguments, or used as operands to
+ instructions.  This means that all structures and arrays must be
+ manipulated either by pointer or by component.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="t_derived">Derived Types</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The real power in LLVM comes from the derived types in the system. 
+ This is what allows a programmer to represent arrays, functions,
+ pointers, and other useful types.  Note that these derived types may be
+ recursive: For example, it is possible to have a two dimensional array.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_array">Array Type</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The array type is a very simple derived type that arranges elements
+ sequentially in memory.  The array type requires a size (number of
+ elements) and an underlying data type.</p>
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   [<# elements> x <elementtype>]
+ </pre>
+ 
+ <p>The number of elements is a constant integer value; elementtype may
+ be any type with a size.</p>
+ 
+ <h5>Examples:</h5>
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>[40 x int ]</tt><br/>
+       <tt>[41 x int ]</tt><br/>
+       <tt>[40 x uint]</tt><br/>
+     </td>
+     <td class="left">
+       Array of 40 integer values.<br/>
+       Array of 41 integer values.<br/>
+       Array of 40 unsigned integer values.<br/>
+     </td>
+   </tr>
+ </table>
+ <p>Here are some examples of multidimensional arrays:</p>
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>[3 x [4 x int]]</tt><br/>
+       <tt>[12 x [10 x float]]</tt><br/>
+       <tt>[2 x [3 x [4 x uint]]]</tt><br/>
+     </td>
+     <td class="left">
+       3x4 array of integer values.<br/>
+       12x10 array of single precision floating point values.<br/>
+       2x3x4 array of unsigned integer values.<br/>
+     </td>
+   </tr>
+ </table>
+ 
+ <p>Note that 'variable sized arrays' can be implemented in LLVM with a zero 
+ length array.  Normally, accesses past the end of an array are undefined in
+ LLVM (e.g. it is illegal to access the 5th element of a 3 element array).
+ As a special case, however, zero length arrays are recognized to be variable
+ length.  This allows implementation of 'pascal style arrays' with the  LLVM
+ type "{ int, [0 x float]}", for example.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_function">Function Type</a> </div>
+ <div class="doc_text">
+ <h5>Overview:</h5>
+ <p>The function type can be thought of as a function signature.  It
+ consists of a return type and a list of formal parameter types. 
+ Function types are usually used to build virtual function tables
+ (which are structures of pointers to functions), for indirect function
+ calls, and when defining a function.</p>
+ <p>
+ The return type of a function type cannot be an aggregate type.
+ </p>
+ <h5>Syntax:</h5>
+ <pre>  <returntype> (<parameter list>)<br></pre>
+ <p>...where '<tt><parameter list></tt>' is a comma-separated list of type
+ specifiers.  Optionally, the parameter list may include a type <tt>...</tt>,
+ which indicates that the function takes a variable number of arguments.
+ Variable argument functions can access their arguments with the <a
+  href="#int_varargs">variable argument handling intrinsic</a> functions.</p>
+ <h5>Examples:</h5>
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>int (int)</tt> <br/>
+       <tt>float (int, int *) *</tt><br/>
+       <tt>int (sbyte *, ...)</tt><br/>
+     </td>
+     <td class="left">
+       function taking an <tt>int</tt>, returning an <tt>int</tt><br/>
+       <a href="#t_pointer">Pointer</a> to a function that takes an
+       <tt>int</tt> and a <a href="#t_pointer">pointer</a> to <tt>int</tt>,
+       returning <tt>float</tt>.<br/>
+       A vararg function that takes at least one <a href="#t_pointer">pointer</a> 
+       to <tt>sbyte</tt> (signed char in C), which returns an integer.  This is 
+       the signature for <tt>printf</tt> in LLVM.<br/>
+     </td>
+   </tr>
+ </table>
+ 
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_struct">Structure Type</a> </div>
+ <div class="doc_text">
+ <h5>Overview:</h5>
+ <p>The structure type is used to represent a collection of data members
+ together in memory.  The packing of the field types is defined to match
+ the ABI of the underlying processor.  The elements of a structure may
+ be any type that has a size.</p>
+ <p>Structures are accessed using '<tt><a href="#i_load">load</a></tt>
+ and '<tt><a href="#i_store">store</a></tt>' by getting a pointer to a
+ field with the '<tt><a href="#i_getelementptr">getelementptr</a></tt>'
+ instruction.</p>
+ <h5>Syntax:</h5>
+ <pre>  { <type list> }<br></pre>
+ <h5>Examples:</h5>
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>{ int, int, int }</tt><br/>
+       <tt>{ float, int (int) * }</tt><br/>
+     </td>
+     <td class="left">
+       a triple of three <tt>int</tt> values<br/>
+       A pair, where the first element is a <tt>float</tt> and the second element 
+       is a <a href="#t_pointer">pointer</a> to a <a href="#t_function">function</a> 
+       that takes an <tt>int</tt>, returning an <tt>int</tt>.<br/>
+     </td>
+   </tr>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_pointer">Pointer Type</a> </div>
+ <div class="doc_text">
+ <h5>Overview:</h5>
+ <p>As in many languages, the pointer type represents a pointer or
+ reference to another object, which must live in memory.</p>
+ <h5>Syntax:</h5>
+ <pre>  <type> *<br></pre>
+ <h5>Examples:</h5>
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>[4x int]*</tt><br/>
+       <tt>int (int *) *</tt><br/>
+     </td>
+     <td class="left">
+       A <a href="#t_pointer">pointer</a> to <a href="#t_array">array</a> of
+       four <tt>int</tt> values<br/>
+       A <a href="#t_pointer">pointer</a> to a <a
+       href="#t_function">function</a> that takes an <tt>int*</tt>, returning an
+       <tt>int</tt>.<br/>
+     </td>
+   </tr>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_packed">Packed Type</a> </div>
+ <div class="doc_text">
+ 
+ <h5>Overview:</h5>
+ 
+ <p>A packed type is a simple derived type that represents a vector
+ of elements.  Packed types are used when multiple primitive data 
+ are operated in parallel using a single instruction (SIMD). 
+ A packed type requires a size (number of
+ elements) and an underlying primitive data type.  Vectors must have a power
+ of two length (1, 2, 4, 8, 16 ...).  Packed types are
+ considered <a href="#t_firstclass">first class</a>.</p>
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   < <# elements> x <elementtype> >
+ </pre>
+ 
+ <p>The number of elements is a constant integer value; elementtype may
+ be any integral or floating point type.</p>
+ 
+ <h5>Examples:</h5>
+ 
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt><4 x int></tt><br/>
+       <tt><8 x float></tt><br/>
+       <tt><2 x uint></tt><br/>
+     </td>
+     <td class="left">
+       Packed vector of 4 integer values.<br/>
+       Packed vector of 8 floating-point values.<br/>
+       Packed vector of 2 unsigned integer values.<br/>
+     </td>
+   </tr>
+ </table>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="t_opaque">Opaque Type</a> </div>
+ <div class="doc_text">
+ 
+ <h5>Overview:</h5>
+ 
+ <p>Opaque types are used to represent unknown types in the system.  This
+ corresponds (for example) to the C notion of a foward declared structure type.
+ In LLVM, opaque types can eventually be resolved to any type (not just a
+ structure type).</p>
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   opaque
+ </pre>
+ 
+ <h5>Examples:</h5>
+ 
+ <table class="layout">
+   <tr class="layout">
+     <td class="left">
+       <tt>opaque</tt>
+     </td>
+     <td class="left">
+       An opaque type.<br/>
+     </td>
+   </tr>
+ </table>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="constants">Constants</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM has several different basic types of constants.  This section describes
+ them all and their syntax.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="simpleconstants">Simple Constants</a></div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt><b>Boolean constants</b></dt>
+ 
+   <dd>The two strings '<tt>true</tt>' and '<tt>false</tt>' are both valid
+   constants of the <tt><a href="#t_primitive">bool</a></tt> type.
+   </dd>
+ 
+   <dt><b>Integer constants</b></dt>
+ 
+   <dd>Standard integers (such as '4') are constants of the <a
+   href="#t_integer">integer</a> type.  Negative numbers may be used with signed
+   integer types.
+   </dd>
+ 
+   <dt><b>Floating point constants</b></dt>
+ 
+   <dd>Floating point constants use standard decimal notation (e.g. 123.421),
+   exponential notation (e.g. 1.23421e+2), or a more precise hexadecimal
+   notation (see below).  Floating point constants must have a <a
+   href="#t_floating">floating point</a> type. </dd>
+ 
+   <dt><b>Null pointer constants</b></dt>
+ 
+   <dd>The identifier '<tt>null</tt>' is recognized as a null pointer constant
+   and must be of <a href="#t_pointer">pointer type</a>.</dd>
+ 
+ </dl>
+ 
+ <p>The one non-intuitive notation for constants is the optional hexadecimal form
+ of floating point constants.  For example, the form '<tt>double
+ 0x432ff973cafa8000</tt>' is equivalent to (but harder to read than) '<tt>double
+ 4.5e+15</tt>'.  The only time hexadecimal floating point constants are required
+ (and the only time that they are generated by the disassembler) is when a 
+ floating point constant must be emitted but it cannot be represented as a 
+ decimal floating point number.  For example, NaN's, infinities, and other 
+ special values are represented in their IEEE hexadecimal format so that 
+ assembly and disassembly do not cause any bits to change in the constants.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="aggregateconstants">Aggregate Constants</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>Aggregate constants arise from aggregation of simple constants
+ and smaller aggregate constants.</p>
+ 
+ <dl>
+   <dt><b>Structure constants</b></dt>
+ 
+   <dd>Structure constants are represented with notation similar to structure
+   type definitions (a comma separated list of elements, surrounded by braces
+   (<tt>{}</tt>)).  For example: "<tt>{ int 4, float 17.0, int* %G }</tt>",
+   where "<tt>%G</tt>" is declared as "<tt>%G = external global int</tt>".  Structure constants
+   must have <a href="#t_struct">structure type</a>, and the number and
+   types of elements must match those specified by the type.
+   </dd>
+ 
+   <dt><b>Array constants</b></dt>
+ 
+   <dd>Array constants are represented with notation similar to array type
+   definitions (a comma separated list of elements, surrounded by square brackets
+   (<tt>[]</tt>)).  For example: "<tt>[ int 42, int 11, int 74 ]</tt>".  Array
+   constants must have <a href="#t_array">array type</a>, and the number and
+   types of elements must match those specified by the type.
+   </dd>
+ 
+   <dt><b>Packed constants</b></dt>
+ 
+   <dd>Packed constants are represented with notation similar to packed type
+   definitions (a comma separated list of elements, surrounded by
+   less-than/greater-than's (<tt><></tt>)).  For example: "<tt>< int 42,
+   int 11, int 74, int 100 ></tt>".  Packed constants must have <a
+   href="#t_packed">packed type</a>, and the number and types of elements must
+   match those specified by the type.
+   </dd>
+ 
+   <dt><b>Zero initialization</b></dt>
+ 
+   <dd>The string '<tt>zeroinitializer</tt>' can be used to zero initialize a
+   value to zero of <em>any</em> type, including scalar and aggregate types.
+   This is often used to avoid having to print large zero initializers (e.g. for
+   large arrays) and is always exactly equivalent to using explicit zero
+   initializers.
+   </dd>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="globalconstants">Global Variable and Function Addresses</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The addresses of <a href="#globalvars">global variables</a> and <a
+ href="#functionstructure">functions</a> are always implicitly valid (link-time)
+ constants.  These constants are explicitly referenced when the <a
+ href="#identifiers">identifier for the global</a> is used and always have <a
+ href="#t_pointer">pointer</a> type. For example, the following is a legal LLVM
+ file:</p>
+ 
+ <pre>
+   %X = global int 17
+   %Y = global int 42
+   %Z = global [2 x int*] [ int* %X, int* %Y ]
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="undefvalues">Undefined Values</a></div>
+ <div class="doc_text">
+   <p>The string '<tt>undef</tt>' is recognized as a type-less constant that has 
+   no specific value.  Undefined values may be of any type and be used anywhere 
+   a constant is permitted.</p>
+ 
+   <p>Undefined values indicate to the compiler that the program is well defined
+   no matter what value is used, giving the compiler more freedom to optimize.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="constantexprs">Constant Expressions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Constant expressions are used to allow expressions involving other constants
+ to be used as constants.  Constant expressions may be of any <a
+ href="#t_firstclass">first class</a> type and may involve any LLVM operation
+ that does not have side effects (e.g. load and call are not supported).  The
+ following is the syntax for constant expressions:</p>
+ 
+ <dl>
+   <dt><b><tt>cast ( CST to TYPE )</tt></b></dt>
+ 
+   <dd>Cast a constant to another type.</dd>
+ 
+   <dt><b><tt>getelementptr ( CSTPTR, IDX0, IDX1, ... )</tt></b></dt>
+ 
+   <dd>Perform the <a href="#i_getelementptr">getelementptr operation</a> on
+   constants.  As with the <a href="#i_getelementptr">getelementptr</a>
+   instruction, the index list may have zero or more indexes, which are required
+   to make sense for the type of "CSTPTR".</dd>
+ 
+   <dt><b><tt>select ( COND, VAL1, VAL2 )</tt></b></dt>
+ 
+   <dd>Perform the <a href="#i_select">select operation</a> on
+   constants.
+ 
+   <dt><b><tt>extractelement ( VAL, IDX )</tt></b></dt>
+ 
+   <dd>Perform the <a href="#i_extractelement">extractelement
+   operation</a> on constants.
+ 
+   <dt><b><tt>insertelement ( VAL, ELT, IDX )</tt></b></dt>
+ 
+   <dd>Perform the <a href="#i_insertelement">insertelement
+   operation</a> on constants.
+ 
+ 
+   <dt><b><tt>shufflevector ( VEC1, VEC2, IDXMASK )</tt></b></dt>
+ 
+   <dd>Perform the <a href="#i_shufflevector">shufflevector
+   operation</a> on constants.
+ 
+   <dt><b><tt>OPCODE ( LHS, RHS )</tt></b></dt>
+ 
+   <dd>Perform the specified operation of the LHS and RHS constants. OPCODE may 
+   be any of the <a href="#binaryops">binary</a> or <a href="#bitwiseops">bitwise
+   binary</a> operations.  The constraints on operands are the same as those for
+   the corresponding instruction (e.g. no bitwise operations on floating point
+   values are allowed).</dd>
+ </dl>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="othervalues">Other Values</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+ <a name="inlineasm">Inline Assembler Expressions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ LLVM supports inline assembler expressions (as opposed to <a href="#moduleasm">
+ Module-Level Inline Assembly</a>) through the use of a special value.  This
+ value represents the inline assembler as a string (containing the instructions
+ to emit), a list of operand constraints (stored as a string), and a flag that 
+ indicates whether or not the inline asm expression has side effects.  An example
+ inline assembler expression is:
+ </p>
+ 
+ <pre>
+   int(int) asm "bswap $0", "=r,r"
+ </pre>
+ 
+ <p>
+ Inline assembler expressions may <b>only</b> be used as the callee operand of
+ a <a href="#i_call"><tt>call</tt> instruction</a>.  Thus, typically we have:
+ </p>
+ 
+ <pre>
+   %X = call int asm "<a href="#i_bswap">bswap</a> $0", "=r,r"(int %Y)
+ </pre>
+ 
+ <p>
+ Inline asms with side effects not visible in the constraint list must be marked
+ as having side effects.  This is done through the use of the
+ '<tt>sideeffect</tt>' keyword, like so:
+ </p>
+ 
+ <pre>
+   call void asm sideeffect "eieio", ""()
+ </pre>
+ 
+ <p>TODO: The format of the asm and constraints string still need to be
+ documented here.  Constraints on what can be done (e.g. duplication, moving, etc
+ need to be documented).
+ </p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="instref">Instruction Reference</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM instruction set consists of several different
+ classifications of instructions: <a href="#terminators">terminator
+ instructions</a>, <a href="#binaryops">binary instructions</a>,
+ <a href="#bitwiseops">bitwise binary instructions</a>, <a
+  href="#memoryops">memory instructions</a>, and <a href="#otherops">other
+ instructions</a>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="terminators">Terminator
+ Instructions</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <p>As mentioned <a href="#functionstructure">previously</a>, every
+ basic block in a program ends with a "Terminator" instruction, which
+ indicates which block should be executed after the current block is
+ finished. These terminator instructions typically yield a '<tt>void</tt>'
+ value: they produce control flow, not values (the one exception being
+ the '<a href="#i_invoke"><tt>invoke</tt></a>' instruction).</p>
+ <p>There are six different terminator instructions: the '<a
+  href="#i_ret"><tt>ret</tt></a>' instruction, the '<a href="#i_br"><tt>br</tt></a>'
+ instruction, the '<a href="#i_switch"><tt>switch</tt></a>' instruction,
+ the '<a href="#i_invoke"><tt>invoke</tt></a>' instruction, the '<a
+  href="#i_unwind"><tt>unwind</tt></a>' instruction, and the '<a
+  href="#i_unreachable"><tt>unreachable</tt></a>' instruction.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_ret">'<tt>ret</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  ret <type> <value>       <i>; Return a value from a non-void function</i>
+   ret void                 <i>; Return from void function</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>ret</tt>' instruction is used to return control flow (and a
+ value) from a function back to the caller.</p>
+ <p>There are two forms of the '<tt>ret</tt>' instruction: one that
+ returns a value and then causes control flow, and one that just causes
+ control flow to occur.</p>
+ <h5>Arguments:</h5>
+ <p>The '<tt>ret</tt>' instruction may return any '<a
+  href="#t_firstclass">first class</a>' type.  Notice that a function is
+ not <a href="#wellformed">well formed</a> if there exists a '<tt>ret</tt>'
+ instruction inside of the function that returns a value that does not
+ match the return type of the function.</p>
+ <h5>Semantics:</h5>
+ <p>When the '<tt>ret</tt>' instruction is executed, control flow
+ returns back to the calling function's context.  If the caller is a "<a
+  href="#i_call"><tt>call</tt></a>" instruction, execution continues at
+ the instruction after the call.  If the caller was an "<a
+  href="#i_invoke"><tt>invoke</tt></a>" instruction, execution continues
+ at the beginning of the "normal" destination block.  If the instruction
+ returns a value, that value shall set the call or invoke instruction's
+ return value.</p>
+ <h5>Example:</h5>
+ <pre>  ret int 5                       <i>; Return an integer value of 5</i>
+   ret void                        <i>; Return from a void function</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_br">'<tt>br</tt>' Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  br bool <cond>, label <iftrue>, label <iffalse><br>  br label <dest>          <i>; Unconditional branch</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>br</tt>' instruction is used to cause control flow to
+ transfer to a different basic block in the current function.  There are
+ two forms of this instruction, corresponding to a conditional branch
+ and an unconditional branch.</p>
+ <h5>Arguments:</h5>
+ <p>The conditional branch form of the '<tt>br</tt>' instruction takes a
+ single '<tt>bool</tt>' value and two '<tt>label</tt>' values.  The
+ unconditional form of the '<tt>br</tt>' instruction takes a single '<tt>label</tt>'
+ value as a target.</p>
+ <h5>Semantics:</h5>
+ <p>Upon execution of a conditional '<tt>br</tt>' instruction, the '<tt>bool</tt>'
+ argument is evaluated.  If the value is <tt>true</tt>, control flows
+ to the '<tt>iftrue</tt>' <tt>label</tt> argument.  If "cond" is <tt>false</tt>,
+ control flows to the '<tt>iffalse</tt>' <tt>label</tt> argument.</p>
+ <h5>Example:</h5>
+ <pre>Test:<br>  %cond = <a href="#i_setcc">seteq</a> int %a, %b<br>  br bool %cond, label %IfEqual, label %IfUnequal<br>IfEqual:<br>  <a
+  href="#i_ret">ret</a> int 1<br>IfUnequal:<br>  <a href="#i_ret">ret</a> int 0<br></pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_switch">'<tt>switch</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   switch <intty> <value>, label <defaultdest> [ <intty> <val>, label <dest> ... ]
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>switch</tt>' instruction is used to transfer control flow to one of
+ several different places.  It is a generalization of the '<tt>br</tt>'
+ instruction, allowing a branch to occur to one of many possible
+ destinations.</p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The '<tt>switch</tt>' instruction uses three parameters: an integer
+ comparison value '<tt>value</tt>', a default '<tt>label</tt>' destination, and
+ an array of pairs of comparison value constants and '<tt>label</tt>'s.  The
+ table is not allowed to contain duplicate constant entries.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The <tt>switch</tt> instruction specifies a table of values and
+ destinations. When the '<tt>switch</tt>' instruction is executed, this
+ table is searched for the given value.  If the value is found, control flow is
+ transfered to the corresponding destination; otherwise, control flow is
+ transfered to the default destination.</p>
+ 
+ <h5>Implementation:</h5>
+ 
+ <p>Depending on properties of the target machine and the particular
+ <tt>switch</tt> instruction, this instruction may be code generated in different
+ ways.  For example, it could be generated as a series of chained conditional
+ branches or with a lookup table.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+  <i>; Emulate a conditional br instruction</i>
+  %Val = <a href="#i_cast">cast</a> bool %value to int
+  switch int %Val, label %truedest [int 0, label %falsedest ]
+ 
+  <i>; Emulate an unconditional br instruction</i>
+  switch uint 0, label %dest [ ]
+ 
+  <i>; Implement a jump table:</i>
+  switch uint %val, label %otherwise [ uint 0, label %onzero 
+                                       uint 1, label %onone 
+                                       uint 2, label %ontwo ]
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_invoke">'<tt>invoke</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = invoke [<a href="#callingconv">cconv</a>] <ptr to function ty> %<function ptr val>(<function args>) 
+                 to label <normal label> unwind label <exception label>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>invoke</tt>' instruction causes control to transfer to a specified
+ function, with the possibility of control flow transfer to either the
+ '<tt>normal</tt>' label or the
+ '<tt>exception</tt>' label.  If the callee function returns with the
+ "<tt><a href="#i_ret">ret</a></tt>" instruction, control flow will return to the
+ "normal" label.  If the callee (or any indirect callees) returns with the "<a
+ href="#i_unwind"><tt>unwind</tt></a>" instruction, control is interrupted and
+ continued at the dynamically nearest "exception" label.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>This instruction requires several arguments:</p>
+ 
+ <ol>
+   <li>
+     The optional "cconv" marker indicates which <a href="callingconv">calling
+     convention</a> the call should use.  If none is specified, the call defaults
+     to using C calling conventions.
+   </li>
+   <li>'<tt>ptr to function ty</tt>': shall be the signature of the pointer to
+   function value being invoked.  In most cases, this is a direct function
+   invocation, but indirect <tt>invoke</tt>s are just as possible, branching off
+   an arbitrary pointer to function value.
+   </li>
+ 
+   <li>'<tt>function ptr val</tt>': An LLVM value containing a pointer to a
+   function to be invoked. </li>
+ 
+   <li>'<tt>function args</tt>': argument list whose types match the function
+   signature argument types.  If the function signature indicates the function
+   accepts a variable number of arguments, the extra arguments can be
+   specified. </li>
+ 
+   <li>'<tt>normal label</tt>': the label reached when the called function
+   executes a '<tt><a href="#i_ret">ret</a></tt>' instruction. </li>
+ 
+   <li>'<tt>exception label</tt>': the label reached when a callee returns with
+   the <a href="#i_unwind"><tt>unwind</tt></a> instruction. </li>
+ 
+ </ol>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>This instruction is designed to operate as a standard '<tt><a
+ href="#i_call">call</a></tt>' instruction in most regards.  The primary
+ difference is that it establishes an association with a label, which is used by
+ the runtime library to unwind the stack.</p>
+ 
+ <p>This instruction is used in languages with destructors to ensure that proper
+ cleanup is performed in the case of either a <tt>longjmp</tt> or a thrown
+ exception.  Additionally, this is important for implementation of
+ '<tt>catch</tt>' clauses in high-level languages that support them.</p>
+ 
+ <h5>Example:</h5>
+ <pre>
+   %retval = invoke int %Test(int 15)             to label %Continue
+               unwind label %TestCleanup     <i>; {int}:retval set</i>
+   %retval = invoke <a href="#callingconv">coldcc</a> int %Test(int 15)             to label %Continue
+               unwind label %TestCleanup     <i>; {int}:retval set</i>
+ </pre>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_subsubsection"> <a name="i_unwind">'<tt>unwind</tt>'
+ Instruction</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   unwind
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>unwind</tt>' instruction unwinds the stack, continuing control flow
+ at the first callee in the dynamic call stack which used an <a
+ href="#i_invoke"><tt>invoke</tt></a> instruction to perform the call.  This is
+ primarily used to implement exception handling.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>unwind</tt>' intrinsic causes execution of the current function to
+ immediately halt.  The dynamic call stack is then searched for the first <a
+ href="#i_invoke"><tt>invoke</tt></a> instruction on the call stack.  Once found,
+ execution continues at the "exceptional" destination block specified by the
+ <tt>invoke</tt> instruction.  If there is no <tt>invoke</tt> instruction in the
+ dynamic call chain, undefined behavior results.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_subsubsection"> <a name="i_unreachable">'<tt>unreachable</tt>'
+ Instruction</a> </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   unreachable
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>unreachable</tt>' instruction has no defined semantics.  This
+ instruction is used to inform the optimizer that a particular portion of the
+ code is not reachable.  This can be used to indicate that the code after a
+ no-return function cannot be reached, and other facts.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>unreachable</tt>' instruction has no defined semantics.</p>
+ </div>
+ 
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="binaryops">Binary Operations</a> </div>
+ <div class="doc_text">
+ <p>Binary operators are used to do most of the computation in a
+ program.  They require two operands, execute an operation on them, and
+ produce a single value.  The operands might represent 
+ multiple data, as is the case with the <a href="#t_packed">packed</a> data type. 
+ The result value of a binary operator is not
+ necessarily the same type as its operands.</p>
+ <p>There are several different binary operators:</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_add">'<tt>add</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = add <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>add</tt>' instruction returns the sum of its two operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>add</tt>' instruction must be either <a
+  href="#t_integer">integer</a> or <a href="#t_floating">floating point</a> values.
+  This instruction can also take <a href="#t_packed">packed</a> versions of the values.
+ Both arguments must have identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The value produced is the integer or floating point sum of the two
+ operands.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = add int 4, %var          <i>; yields {int}:result = 4 + %var</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_sub">'<tt>sub</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = sub <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>sub</tt>' instruction returns the difference of its two
+ operands.</p>
+ <p>Note that the '<tt>sub</tt>' instruction is used to represent the '<tt>neg</tt>'
+ instruction present in most other intermediate representations.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>sub</tt>' instruction must be either <a
+  href="#t_integer">integer</a> or <a href="#t_floating">floating point</a>
+ values. 
+ This instruction can also take <a href="#t_packed">packed</a> versions of the values.
+ Both arguments must have identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The value produced is the integer or floating point difference of
+ the two operands.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = sub int 4, %var          <i>; yields {int}:result = 4 - %var</i>
+   <result> = sub int 0, %val          <i>; yields {int}:result = -%var</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_mul">'<tt>mul</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = mul <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The  '<tt>mul</tt>' instruction returns the product of its two
+ operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>mul</tt>' instruction must be either <a
+  href="#t_integer">integer</a> or <a href="#t_floating">floating point</a>
+ values. 
+ This instruction can also take <a href="#t_packed">packed</a> versions of the values.
+ Both arguments must have identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The value produced is the integer or floating point product of the
+ two operands.</p>
+ <p>There is no signed vs unsigned multiplication.  The appropriate
+ action is taken based on the type of the operand.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = mul int 4, %var          <i>; yields {int}:result = 4 * %var</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_div">'<tt>div</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = div <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>div</tt>' instruction returns the quotient of its two
+ operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>div</tt>' instruction must be either <a
+  href="#t_integer">integer</a> or <a href="#t_floating">floating point</a>
+ values. 
+ This instruction can also take <a href="#t_packed">packed</a> versions of the values.
+ Both arguments must have identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The value produced is the integer or floating point quotient of the
+ two operands.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = div int 4, %var          <i>; yields {int}:result = 4 / %var</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_rem">'<tt>rem</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = rem <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>rem</tt>' instruction returns the remainder from the
+ division of its two operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>rem</tt>' instruction must be either <a
+  href="#t_integer">integer</a> or <a href="#t_floating">floating point</a>
+ values. 
+ This instruction can also take <a href="#t_packed">packed</a> versions of the values.
+ Both arguments must have identical types.</p>
+ <h5>Semantics:</h5>
+ <p>This returns the <i>remainder</i> of a division (where the result
+ has the same sign as the divisor), not the <i>modulus</i> (where the
+ result has the same sign as the dividend) of a value.  For more
+ information about the difference, see <a
+  href="http://mathforum.org/dr.math/problems/anne.4.28.99.html">The
+ Math Forum</a>.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = rem int 4, %var          <i>; yields {int}:result = 4 % %var</i>
+ </pre>
+ 
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_setcc">'<tt>set<i>cc</i></tt>'
+ Instructions</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = seteq <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+   <result> = setne <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+   <result> = setlt <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+   <result> = setgt <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+   <result> = setle <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+   <result> = setge <ty> <var1>, <var2>   <i>; yields {bool}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>set<i>cc</i></tt>' family of instructions returns a boolean
+ value based on a comparison of their two operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>set<i>cc</i></tt>' instructions must
+ be of <a href="#t_firstclass">first class</a> type (it is not possible
+ to compare '<tt>label</tt>'s, '<tt>array</tt>'s, '<tt>structure</tt>'
+ or '<tt>void</tt>' values, etc...).  Both arguments must have identical
+ types.</p>
+ <h5>Semantics:</h5>
+ <p>The '<tt>seteq</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if both operands are equal.<br>
+ The '<tt>setne</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if both operands are unequal.<br>
+ The '<tt>setlt</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if the first operand is less than the second operand.<br>
+ The '<tt>setgt</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if the first operand is greater than the second operand.<br>
+ The '<tt>setle</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if the first operand is less than or equal to the second operand.<br>
+ The '<tt>setge</tt>' instruction yields a <tt>true</tt> '<tt>bool</tt>'
+ value if the first operand is greater than or equal to the second
+ operand.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = seteq int   4, 5        <i>; yields {bool}:result = false</i>
+   <result> = setne float 4, 5        <i>; yields {bool}:result = true</i>
+   <result> = setlt uint  4, 5        <i>; yields {bool}:result = true</i>
+   <result> = setgt sbyte 4, 5        <i>; yields {bool}:result = false</i>
+   <result> = setle sbyte 4, 5        <i>; yields {bool}:result = true</i>
+   <result> = setge sbyte 4, 5        <i>; yields {bool}:result = false</i>
+ </pre>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="bitwiseops">Bitwise Binary
+ Operations</a> </div>
+ <div class="doc_text">
+ <p>Bitwise binary operators are used to do various forms of
+ bit-twiddling in a program.  They are generally very efficient
+ instructions and can commonly be strength reduced from other
+ instructions.  They require two operands, execute an operation on them,
+ and produce a single value.  The resulting value of the bitwise binary
+ operators is always the same type as its first operand.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_and">'<tt>and</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = and <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>and</tt>' instruction returns the bitwise logical and of
+ its two operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>and</tt>' instruction must be <a
+  href="#t_integral">integral</a> values.  Both arguments must have
+ identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The truth table used for the '<tt>and</tt>' instruction is:</p>
+ <p> </p>
+ <div style="align: center">
+ <table border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr>
+       <td>In0</td>
+       <td>In1</td>
+       <td>Out</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>0</td>
+       <td>0</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>1</td>
+       <td>0</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>0</td>
+       <td>0</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>1</td>
+       <td>1</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <h5>Example:</h5>
+ <pre>  <result> = and int 4, %var         <i>; yields {int}:result = 4 & %var</i>
+   <result> = and int 15, 40          <i>; yields {int}:result = 8</i>
+   <result> = and int 4, 8            <i>; yields {int}:result = 0</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_or">'<tt>or</tt>' Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = or <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>or</tt>' instruction returns the bitwise logical inclusive
+ or of its two operands.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>or</tt>' instruction must be <a
+  href="#t_integral">integral</a> values.  Both arguments must have
+ identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The truth table used for the '<tt>or</tt>' instruction is:</p>
+ <p> </p>
+ <div style="align: center">
+ <table border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr>
+       <td>In0</td>
+       <td>In1</td>
+       <td>Out</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>0</td>
+       <td>0</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>1</td>
+       <td>1</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>0</td>
+       <td>1</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>1</td>
+       <td>1</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <h5>Example:</h5>
+ <pre>  <result> = or int 4, %var         <i>; yields {int}:result = 4 | %var</i>
+   <result> = or int 15, 40          <i>; yields {int}:result = 47</i>
+   <result> = or int 4, 8            <i>; yields {int}:result = 12</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_xor">'<tt>xor</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = xor <ty> <var1>, <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>xor</tt>' instruction returns the bitwise logical exclusive
+ or of its two operands.  The <tt>xor</tt> is used to implement the
+ "one's complement" operation, which is the "~" operator in C.</p>
+ <h5>Arguments:</h5>
+ <p>The two arguments to the '<tt>xor</tt>' instruction must be <a
+  href="#t_integral">integral</a> values.  Both arguments must have
+ identical types.</p>
+ <h5>Semantics:</h5>
+ <p>The truth table used for the '<tt>xor</tt>' instruction is:</p>
+ <p> </p>
+ <div style="align: center">
+ <table border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr>
+       <td>In0</td>
+       <td>In1</td>
+       <td>Out</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>0</td>
+       <td>0</td>
+     </tr>
+     <tr>
+       <td>0</td>
+       <td>1</td>
+       <td>1</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>0</td>
+       <td>1</td>
+     </tr>
+     <tr>
+       <td>1</td>
+       <td>1</td>
+       <td>0</td>
+     </tr>
+   </tbody>
+ </table>
+ </div>
+ <p> </p>
+ <h5>Example:</h5>
+ <pre>  <result> = xor int 4, %var         <i>; yields {int}:result = 4 ^ %var</i>
+   <result> = xor int 15, 40          <i>; yields {int}:result = 39</i>
+   <result> = xor int 4, 8            <i>; yields {int}:result = 12</i>
+   <result> = xor int %V, -1          <i>; yields {int}:result = ~%V</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_shl">'<tt>shl</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = shl <ty> <var1>, ubyte <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>shl</tt>' instruction returns the first operand shifted to
+ the left a specified number of bits.</p>
+ <h5>Arguments:</h5>
+ <p>The first argument to the '<tt>shl</tt>' instruction must be an <a
+  href="#t_integer">integer</a> type.  The second argument must be an '<tt>ubyte</tt>'
+ type.</p>
+ <h5>Semantics:</h5>
+ <p>The value produced is <tt>var1</tt> * 2<sup><tt>var2</tt></sup>.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = shl int 4, ubyte %var   <i>; yields {int}:result = 4 << %var</i>
+   <result> = shl int 4, ubyte 2      <i>; yields {int}:result = 16</i>
+   <result> = shl int 1, ubyte 10     <i>; yields {int}:result = 1024</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_shr">'<tt>shr</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = shr <ty> <var1>, ubyte <var2>   <i>; yields {ty}:result</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>shr</tt>' instruction returns the first operand shifted to
+ the right a specified number of bits.</p>
+ <h5>Arguments:</h5>
+ <p>The first argument to the '<tt>shr</tt>' instruction must be an <a
+  href="#t_integer">integer</a> type.  The second argument must be an '<tt>ubyte</tt>'
+ type.</p>
+ <h5>Semantics:</h5>
+ <p>If the first argument is a <a href="#t_signed">signed</a> type, the
+ most significant bit is duplicated in the newly free'd bit positions. 
+ If the first argument is unsigned, zero bits shall fill the empty
+ positions.</p>
+ <h5>Example:</h5>
+ <pre>  <result> = shr int 4, ubyte %var   <i>; yields {int}:result = 4 >> %var</i>
+   <result> = shr uint 4, ubyte 1     <i>; yields {uint}:result = 2</i>
+   <result> = shr int 4, ubyte 2      <i>; yields {int}:result = 1</i>
+   <result> = shr sbyte 4, ubyte 3    <i>; yields {sbyte}:result = 0</i>
+   <result> = shr sbyte -2, ubyte 1   <i>; yields {sbyte}:result = -1</i>
+ </pre>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> 
+   <a name="vectorops">Vector Operations</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM supports several instructions to represent vector operations in a
+ target-independent manner.  This instructions cover the element-access and
+ vector-specific operations needed to process vectors effectively.  While LLVM
+ does directly support these vector operations, many sophisticated algorithms
+ will want to use target-specific intrinsics to take full advantage of a specific
+ target.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_extractelement">'<tt>extractelement</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = extractelement <n x <ty>> <val>, uint <idx>    <i>; yields <ty></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>extractelement</tt>' instruction extracts a single scalar
+ element from a packed vector at a specified index.
+ </p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first operand of an '<tt>extractelement</tt>' instruction is a
+ value of <a href="#t_packed">packed</a> type.  The second operand is
+ an index indicating the position from which to extract the element.
+ The index may be a variable.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The result is a scalar of the same type as the element type of
+ <tt>val</tt>.  Its value is the value at position <tt>idx</tt> of
+ <tt>val</tt>.  If <tt>idx</tt> exceeds the length of <tt>val</tt>, the
+ results are undefined.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %result = extractelement <4 x int> %vec, uint 0    <i>; yields int</i>
+ </pre>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_insertelement">'<tt>insertelement</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = insertelement <n x <ty>> <val>, <ty> <elt&gt, uint <idx>    <i>; yields <n x <ty>></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>insertelement</tt>' instruction inserts a scalar
+ element into a packed vector at a specified index.
+ </p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first operand of an '<tt>insertelement</tt>' instruction is a
+ value of <a href="#t_packed">packed</a> type.  The second operand is a
+ scalar value whose type must equal the element type of the first
+ operand.  The third operand is an index indicating the position at
+ which to insert the value.  The index may be a variable.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The result is a packed vector of the same type as <tt>val</tt>.  Its
+ element values are those of <tt>val</tt> except at position
+ <tt>idx</tt>, where it gets the value <tt>elt</tt>.  If <tt>idx</tt>
+ exceeds the length of <tt>val</tt>, the results are undefined.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %result = insertelement <4 x int> %vec, int 1, uint 0    <i>; yields <4 x int></i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_shufflevector">'<tt>shufflevector</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <n x uint> <mask>    <i>; yields <n x <ty>></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>shufflevector</tt>' instruction constructs a permutation of elements
+ from two input vectors, returning a vector of the same type.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first two operands of a '<tt>shufflevector</tt>' instruction are vectors
+ with types that match each other and types that match the result of the
+ instruction.  The third argument is a shuffle mask, which has the same number
+ of elements as the other vector type, but whose element type is always 'uint'.
+ </p>
+ 
+ <p>
+ The shuffle mask operand is required to be a constant vector with either
+ constant integer or undef values.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The elements of the two input vectors are numbered from left to right across
+ both of the vectors.  The shuffle mask operand specifies, for each element of
+ the result vector, which element of the two input registers the result element
+ gets.  The element selector may be undef (meaning "don't care") and the second
+ operand may be undef if performing a shuffle from only one vector.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %result = shufflevector <4 x int> %v1, <4 x int> %v2, 
+                           <4 x uint> <uint 0, uint 4, uint 1, uint 5>    <i>; yields <4 x int></i>
+   %result = shufflevector <4 x int> %v1, <4 x int> undef, 
+                           <4 x uint> <uint 0, uint 1, uint 2, uint 3>  <i>; yields <4 x int></i> - Identity shuffle.
+ </pre>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_vsetint">'<tt>vsetint</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre><result> = vsetint <op>, <n x <ty>> <var1>, <var2>   <i>; yields <n x bool></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>vsetint</tt>' instruction takes two integer vectors and
+ returns a vector of boolean values representing, at each position, the
+ result of the comparison between the values at that position in the
+ two operands.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The arguments to a '<tt>vsetint</tt>' instruction are a comparison
+ operation and two value arguments.  The value arguments must be of <a
+ href="#t_integral">integral</a> <a href="#t_packed">packed</a> type,
+ and they must have identical types.  The operation argument must be
+ one of <tt>eq</tt>, <tt>ne</tt>, <tt>slt</tt>, <tt>sgt</tt>,
+ <tt>sle</tt>, <tt>sge</tt>, <tt>ult</tt>, <tt>ugt</tt>, <tt>ule</tt>,
+ <tt>uge</tt>, <tt>true</tt>, and <tt>false</tt>.  The result is a
+ packed <tt>bool</tt> value with the same length as each operand.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The following table shows the semantics of '<tt>vsetint</tt>'.  For
+ each position of the result, the comparison is done on the
+ corresponding positions of the two value arguments.  Note that the
+ signedness of the comparison depends on the comparison opcode and
+ <i>not</i> on the signedness of the value operands.  E.g., <tt>vsetint
+ slt <4 x unsigned> %x, %y</tt> does an elementwise <i>signed</i>
+ comparison of <tt>%x</tt> and <tt>%y</tt>.</p>
+ 
+ <table  border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr><th>Operation</th><th>Result is true iff</th><th>Comparison is</th></tr>
+     <tr><td><tt>eq</tt></td><td>var1 == var2</td><td>--</td></tr>
+     <tr><td><tt>ne</tt></td><td>var1 != var2</td><td>--</td></tr>
+     <tr><td><tt>slt</tt></td><td>var1 < var2</td><td>signed</td></tr>
+     <tr><td><tt>sgt</tt></td><td>var1 > var2</td><td>signed</td></tr>
+     <tr><td><tt>sle</tt></td><td>var1 <= var2</td><td>signed</td></tr>
+     <tr><td><tt>sge</tt></td><td>var1 >= var2</td><td>signed</td></tr>
+     <tr><td><tt>ult</tt></td><td>var1 < var2</td><td>unsigned</td></tr>
+     <tr><td><tt>ugt</tt></td><td>var1 > var2</td><td>unsigned</td></tr>
+     <tr><td><tt>ule</tt></td><td>var1 <= var2</td><td>unsigned</td></tr>
+     <tr><td><tt>uge</tt></td><td>var1 >= var2</td><td>unsigned</td></tr>
+     <tr><td><tt>true</tt></td><td>always</td><td>--</td></tr>
+     <tr><td><tt>false</tt></td><td>never</td><td>--</td></tr>
+   </tbody>
+ </table>
+ 
+ <h5>Example:</h5>
+ <pre>  <result> = vsetint eq <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = false, false</i>
+   <result> = vsetint ne <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = true, true</i>
+   <result> = vsetint slt <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = true, false</i>
+   <result> = vsetint sgt <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = false, true</i>
+   <result> = vsetint sle <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = true, false</i>
+   <result> = vsetint sge <2 x int> <int 0, int 1>, <int 1, int 0>      <i>; yields {<2 x bool>}:result = false, true</i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_vsetfp">'<tt>vsetfp</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre><result> = vsetfp <op>, <n x <ty>> <var1>, <var2>   <i>; yields <n x bool></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>vsetfp</tt>' instruction takes two floating point vector
+ arguments and returns a vector of boolean values representing, at each
+ position, the result of the comparison between the values at that
+ position in the two operands.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The arguments to a '<tt>vsetfp</tt>' instruction are a comparison
+ operation and two value arguments.  The value arguments must be of <a
+ href="t_floating">floating point</a> <a href="#t_packed">packed</a>
+ type, and they must have identical types.  The operation argument must
+ be one of <tt>eq</tt>, <tt>ne</tt>, <tt>lt</tt>, <tt>gt</tt>,
+ <tt>le</tt>, <tt>ge</tt>, <tt>oeq</tt>, <tt>one</tt>, <tt>olt</tt>,
+ <tt>ogt</tt>, <tt>ole</tt>, <tt>oge</tt>, <tt>ueq</tt>, <tt>une</tt>,
+ <tt>ult</tt>, <tt>ugt</tt>, <tt>ule</tt>, <tt>uge</tt>, <tt>o</tt>,
+ <tt>u</tt>, <tt>true</tt>, and <tt>false</tt>.  The result is a packed
+ <tt>bool</tt> value with the same length as each operand.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The following table shows the semantics of '<tt>vsetfp</tt>' for
+ floating point types.  If either operand is a floating point Not a
+ Number (NaN) value, the operation is unordered, and the value in the
+ first column below is produced at that position.  Otherwise, the
+ operation is ordered, and the value in the second column is
+ produced.</p>
+ 
+ <table  border="1" cellspacing="0" cellpadding="4">
+   <tbody>
+     <tr><th>Operation</th><th>If unordered<th>Otherwise true iff</th></tr>
+     <tr><td><tt>eq</tt></td><td>undefined</td><td>var1 == var2</td></tr>
+     <tr><td><tt>ne</tt></td><td>undefined</td><td>var1 != var2</td></tr>
+     <tr><td><tt>lt</tt></td><td>undefined</td><td>var1 < var2</td></tr>
+     <tr><td><tt>gt</tt></td><td>undefined</td><td>var1 > var2</td></tr>
+     <tr><td><tt>le</tt></td><td>undefined</td><td>var1 <= var2</td></tr>
+     <tr><td><tt>ge</tt></td><td>undefined</td><td>var1 >= var2</td></tr>
+     <tr><td><tt>oeq</tt></td><td>false</td><td>var1 == var2</td></tr>
+     <tr><td><tt>one</tt></td><td>false</td><td>var1 != var2</td></tr>
+     <tr><td><tt>olt</tt></td><td>false</td><td>var1 < var2</td></tr>
+     <tr><td><tt>ogt</tt></td><td>false</td><td>var1 > var2</td></tr>
+     <tr><td><tt>ole</tt></td><td>false</td><td>var1 <= var2</td></tr>
+     <tr><td><tt>oge</tt></td><td>false</td><td>var1 >= var2</td></tr>
+     <tr><td><tt>ueq</tt></td><td>true</td><td>var1 == var2</td></tr>
+     <tr><td><tt>une</tt></td><td>true</td><td>var1 != var2</td></tr>
+     <tr><td><tt>ult</tt></td><td>true</td><td>var1 < var2</td></tr>
+     <tr><td><tt>ugt</tt></td><td>true</td><td>var1 > var2</td></tr>
+     <tr><td><tt>ule</tt></td><td>true</td><td>var1 <= var2</td></tr>
+     <tr><td><tt>uge</tt></td><td>true</td><td>var1 >= var2</td></tr>
+     <tr><td><tt>o</tt></td><td>false</td><td>always</td></tr>
+     <tr><td><tt>u</tt></td><td>true</td><td>never</td></tr>
+     <tr><td><tt>true</tt></td><td>true</td><td>always</td></tr>
+     <tr><td><tt>false</tt></td><td>false</td><td>never</td></tr>
+   </tbody>
+ </table>
+ 
+ <h5>Example:</h5>
+ <pre>  <result> = vsetfp eq <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = false, false</i>
+   <result> = vsetfp ne <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = true, true</i>
+   <result> = vsetfp lt <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = true, false</i>
+   <result> = vsetfp gt <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = false, true</i>
+   <result> = vsetfp le <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = true, false</i>
+   <result> = vsetfp ge <2 x float> <float 0.0, float 1.0>, <float 1.0, float 0.0>      <i>; yields {<2 x bool>}:result = false, true</i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_vselect">'<tt>vselect</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = vselect <n x bool> <cond>, <n x <ty>> <val1>, <n x <ty>> <val2> <i>; yields <n x <ty>></i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>vselect</tt>' instruction chooses one value at each position
+ of a vector based on a condition.
+ </p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The '<tt>vselect</tt>' instruction requires a <a
+ href="#t_packed">packed</a> <tt>bool</tt> value indicating the
+ condition at each vector position, and two values of the same packed
+ type.  All three operands must have the same length.  The type of the
+ result is the same as the type of the two value operands.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ At each position where the <tt>bool</tt> vector is true, that position
+ of the result gets its value from the first value argument; otherwise,
+ it gets its value from the second value argument.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %X = vselect bool <2 x bool> <bool true, bool false>, <2 x ubyte> <ubyte 17, ubyte 17>, 
+     <2 x ubyte> <ubyte 42, ubyte 42>      <i>; yields <2 x ubyte>:17, 42</i>
+ </pre>
+ </div>
+ 
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> 
+   <a name="memoryops">Memory Access Operations</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>A key design point of an SSA-based representation is how it
+ represents memory.  In LLVM, no memory locations are in SSA form, which
+ makes things very simple.  This section describes how to read, write,
+ allocate, and free memory in LLVM.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_malloc">'<tt>malloc</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = malloc <type>[, uint <NumElements>][, align <alignment>]     <i>; yields {type*}:result</i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>malloc</tt>' instruction allocates memory from the system
+ heap and returns a pointer to it.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The '<tt>malloc</tt>' instruction allocates
+ <tt>sizeof(<type>)*NumElements</tt>
+ bytes of memory from the operating system and returns a pointer of the
+ appropriate type to the program.  If "NumElements" is specified, it is the
+ number of elements allocated.  If an alignment is specified, the value result
+ of the allocation is guaranteed to be aligned to at least that boundary.  If
+ not specified, or if zero, the target can choose to align the allocation on any
+ convenient boundary.</p>
+ 
+ <p>'<tt>type</tt>' must be a sized type.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>Memory is allocated using the system "<tt>malloc</tt>" function, and
+ a pointer is returned.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %array  = malloc [4 x ubyte ]                    <i>; yields {[%4 x ubyte]*}:array</i>
+ 
+   %size   = <a href="#i_add">add</a> uint 2, 2                          <i>; yields {uint}:size = uint 4</i>
+   %array1 = malloc ubyte, uint 4                   <i>; yields {ubyte*}:array1</i>
+   %array2 = malloc [12 x ubyte], uint %size        <i>; yields {[12 x ubyte]*}:array2</i>
+   %array3 = malloc int, uint 4, align 1024         <i>; yields {int*}:array3</i>
+   %array4 = malloc int, align 1024                 <i>; yields {int*}:array4</i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_free">'<tt>free</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   free <type> <value>                              <i>; yields {void}</i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>free</tt>' instruction returns memory back to the unused
+ memory heap to be reallocated in the future.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>'<tt>value</tt>' shall be a pointer value that points to a value
+ that was allocated with the '<tt><a href="#i_malloc">malloc</a></tt>'
+ instruction.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>Access to the memory pointed to by the pointer is no longer defined
+ after this instruction executes.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %array  = <a href="#i_malloc">malloc</a> [4 x ubyte]                    <i>; yields {[4 x ubyte]*}:array</i>
+             free   [4 x ubyte]* %array
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_alloca">'<tt>alloca</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = alloca <type>[, uint <NumElements>][, align <alignment>]     <i>; yields {type*}:result</i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>alloca</tt>' instruction allocates memory on the current
+ stack frame of the procedure that is live until the current function
+ returns to its caller.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The '<tt>alloca</tt>' instruction allocates <tt>sizeof(<type>)*NumElements</tt>
+ bytes of memory on the runtime stack, returning a pointer of the
+ appropriate type to the program.    If "NumElements" is specified, it is the
+ number of elements allocated.  If an alignment is specified, the value result
+ of the allocation is guaranteed to be aligned to at least that boundary.  If
+ not specified, or if zero, the target can choose to align the allocation on any
+ convenient boundary.</p>
+ 
+ <p>'<tt>type</tt>' may be any sized type.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>Memory is allocated; a pointer is returned.  '<tt>alloca</tt>'d
+ memory is automatically released when the function returns.  The '<tt>alloca</tt>'
+ instruction is commonly used to represent automatic variables that must
+ have an address available.  When the function returns (either with the <tt><a
+  href="#i_ret">ret</a></tt> or <tt><a href="#i_unwind">unwind</a></tt>
+ instructions), the memory is reclaimed.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %ptr = alloca int                              <i>; yields {int*}:ptr</i>
+   %ptr = alloca int, uint 4                      <i>; yields {int*}:ptr</i>
+   %ptr = alloca int, uint 4, align 1024          <i>; yields {int*}:ptr</i>
+   %ptr = alloca int, align 1024                  <i>; yields {int*}:ptr</i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_load">'<tt>load</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = load <ty>* <pointer><br>  <result> = volatile load <ty>* <pointer><br></pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>load</tt>' instruction is used to read from memory.</p>
+ <h5>Arguments:</h5>
+ <p>The argument to the '<tt>load</tt>' instruction specifies the memory
+ address from which to load.  The pointer must point to a <a
+  href="#t_firstclass">first class</a> type.  If the <tt>load</tt> is
+ marked as <tt>volatile</tt>, then the optimizer is not allowed to modify
+ the number or order of execution of this <tt>load</tt> with other
+ volatile <tt>load</tt> and <tt><a href="#i_store">store</a></tt>
+ instructions. </p>
+ <h5>Semantics:</h5>
+ <p>The location of memory pointed to is loaded.</p>
+ <h5>Examples:</h5>
+ <pre>  %ptr = <a href="#i_alloca">alloca</a> int                               <i>; yields {int*}:ptr</i>
+   <a
+  href="#i_store">store</a> int 3, int* %ptr                          <i>; yields {void}</i>
+   %val = load int* %ptr                           <i>; yields {int}:val = int 3</i>
+ </pre>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_store">'<tt>store</tt>'
+ Instruction</a> </div>
+ <h5>Syntax:</h5>
+ <pre>  store <ty> <value>, <ty>* <pointer>                   <i>; yields {void}</i>
+   volatile store <ty> <value>, <ty>* <pointer>                   <i>; yields {void}</i>
+ </pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>store</tt>' instruction is used to write to memory.</p>
+ <h5>Arguments:</h5>
+ <p>There are two arguments to the '<tt>store</tt>' instruction: a value
+ to store and an address in which to store it.  The type of the '<tt><pointer></tt>'
+ operand must be a pointer to the type of the '<tt><value></tt>'
+ operand. If the <tt>store</tt> is marked as <tt>volatile</tt>, then the
+ optimizer is not allowed to modify the number or order of execution of
+ this <tt>store</tt> with other volatile <tt>load</tt> and <tt><a
+  href="#i_store">store</a></tt> instructions.</p>
+ <h5>Semantics:</h5>
+ <p>The contents of memory are updated to contain '<tt><value></tt>'
+ at the location specified by the '<tt><pointer></tt>' operand.</p>
+ <h5>Example:</h5>
+ <pre>  %ptr = <a href="#i_alloca">alloca</a> int                               <i>; yields {int*}:ptr</i>
+   <a
+  href="#i_store">store</a> int 3, int* %ptr                          <i>; yields {void}</i>
+   %val = load int* %ptr                           <i>; yields {int}:val = int 3</i>
+ </pre>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_getelementptr">'<tt>getelementptr</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>
+   <result> = getelementptr <ty>* <ptrval>{, <ty> <idx>}*
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>getelementptr</tt>' instruction is used to get the address of a
+ subelement of an aggregate data structure.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>This instruction takes a list of integer constants that indicate what
+ elements of the aggregate object to index to.  The actual types of the arguments
+ provided depend on the type of the first pointer argument.  The
+ '<tt>getelementptr</tt>' instruction is used to index down through the type
+ levels of a structure or to a specific index in an array.  When indexing into a
+ structure, only <tt>uint</tt>
+ integer constants are allowed.  When indexing into an array or pointer,
+ <tt>int</tt> and <tt>long</tt> indexes are allowed of any sign.</p>
+ 
+ <p>For example, let's consider a C code fragment and how it gets
+ compiled to LLVM:</p>
+ 
+ <pre>
+   struct RT {
+     char A;
+     int B[10][20];
+     char C;
+   };
+   struct ST {
+     int X;
+     double Y;
+     struct RT Z;
+   };
+ 
+   int *foo(struct ST *s) {
+     return &s[1].Z.B[5][13];
+   }
+ </pre>
+ 
+ <p>The LLVM code generated by the GCC frontend is:</p>
+ 
+ <pre>
+   %RT = type { sbyte, [10 x [20 x int]], sbyte }
+   %ST = type { int, double, %RT }
+ 
+   implementation
+ 
+   int* %foo(%ST* %s) {
+   entry:
+     %reg = getelementptr %ST* %s, int 1, uint 2, uint 1, int 5, int 13
+     ret int* %reg
+   }
+ </pre>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The index types specified for the '<tt>getelementptr</tt>' instruction depend
+ on the pointer type that is being indexed into. <a href="#t_pointer">Pointer</a>
+ and <a href="#t_array">array</a> types require <tt>uint</tt>, <tt>int</tt>,
+ <tt>ulong</tt>, or <tt>long</tt> values, and <a href="#t_struct">structure</a>
+ types require <tt>uint</tt> <b>constants</b>.</p>
+ 
+ <p>In the example above, the first index is indexing into the '<tt>%ST*</tt>'
+ type, which is a pointer, yielding a '<tt>%ST</tt>' = '<tt>{ int, double, %RT
+ }</tt>' type, a structure.  The second index indexes into the third element of
+ the structure, yielding a '<tt>%RT</tt>' = '<tt>{ sbyte, [10 x [20 x int]],
+ sbyte }</tt>' type, another structure.  The third index indexes into the second
+ element of the structure, yielding a '<tt>[10 x [20 x int]]</tt>' type, an
+ array.  The two dimensions of the array are subscripted into, yielding an
+ '<tt>int</tt>' type.  The '<tt>getelementptr</tt>' instruction returns a pointer
+ to this element, thus computing a value of '<tt>int*</tt>' type.</p>
+ 
+ <p>Note that it is perfectly legal to index partially through a
+ structure, returning a pointer to an inner element.  Because of this,
+ the LLVM code for the given testcase is equivalent to:</p>
+ 
+ <pre>
+   int* %foo(%ST* %s) {
+     %t1 = getelementptr %ST* %s, int 1                        <i>; yields %ST*:%t1</i>
+     %t2 = getelementptr %ST* %t1, int 0, uint 2               <i>; yields %RT*:%t2</i>
+     %t3 = getelementptr %RT* %t2, int 0, uint 1               <i>; yields [10 x [20 x int]]*:%t3</i>
+     %t4 = getelementptr [10 x [20 x int]]* %t3, int 0, int 5  <i>; yields [20 x int]*:%t4</i>
+     %t5 = getelementptr [20 x int]* %t4, int 0, int 13        <i>; yields int*:%t5</i>
+     ret int* %t5
+   }
+ </pre>
+ 
+ <p>Note that it is undefined to access an array out of bounds: array and 
+ pointer indexes must always be within the defined bounds of the array type.
+ The one exception for this rules is zero length arrays.  These arrays are
+ defined to be accessible as variable length arrays, which requires access
+ beyond the zero'th element.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+     <i>; yields [12 x ubyte]*:aptr</i>
+     %aptr = getelementptr {int, [12 x ubyte]}* %sptr, long 0, uint 1
+ </pre>
+ 
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="otherops">Other Operations</a> </div>
+ <div class="doc_text">
+ <p>The instructions in this category are the "miscellaneous"
+ instructions, which defy better classification.</p>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection"> <a name="i_phi">'<tt>phi</tt>'
+ Instruction</a> </div>
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  <result> = phi <ty> [ <val0>, <label0>], ...<br></pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>phi</tt>' instruction is used to implement the φ node in
+ the SSA graph representing the function.</p>
+ <h5>Arguments:</h5>
+ <p>The type of the incoming values are specified with the first type
+ field. After this, the '<tt>phi</tt>' instruction takes a list of pairs
+ as arguments, with one pair for each predecessor basic block of the
+ current block.  Only values of <a href="#t_firstclass">first class</a>
+ type may be used as the value arguments to the PHI node.  Only labels
+ may be used as the label arguments.</p>
+ <p>There must be no non-phi instructions between the start of a basic
+ block and the PHI instructions: i.e. PHI instructions must be first in
+ a basic block.</p>
+ <h5>Semantics:</h5>
+ <p>At runtime, the '<tt>phi</tt>' instruction logically takes on the
+ value specified by the parameter, depending on which basic block we
+ came from in the last <a href="#terminators">terminator</a> instruction.</p>
+ <h5>Example:</h5>
+ <pre>Loop:       ; Infinite loop that counts from 0 on up...<br>  %indvar = phi uint [ 0, %LoopHeader ], [ %nextindvar, %Loop ]<br>  %nextindvar = add uint %indvar, 1<br>  br label %Loop<br></pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_cast">'<tt>cast .. to</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = cast <ty> <value> to <ty2>             <i>; yields ty2</i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>cast</tt>' instruction is used as the primitive means to convert
+ integers to floating point, change data type sizes, and break type safety (by
+ casting pointers).
+ </p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The '<tt>cast</tt>' instruction takes a value to cast, which must be a first
+ class value, and a type to cast it to, which must also be a <a
+ href="#t_firstclass">first class</a> type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ This instruction follows the C rules for explicit casts when determining how the
+ data being cast must change to fit in its new container.
+ </p>
+ 
+ <p>
+ When casting to bool, any value that would be considered true in the context of
+ a C '<tt>if</tt>' condition is converted to the boolean '<tt>true</tt>' values,
+ all else are '<tt>false</tt>'.
+ </p>
+ 
+ <p>
+ When extending an integral value from a type of one signness to another (for
+ example '<tt>sbyte</tt>' to '<tt>ulong</tt>'), the value is sign-extended if the
+ <b>source</b> value is signed, and zero-extended if the source value is
+ unsigned. <tt>bool</tt> values are always zero extended into either zero or
+ one.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %X = cast int 257 to ubyte              <i>; yields ubyte:1</i>
+   %Y = cast int 123 to bool               <i>; yields bool:true</i>
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+    <a name="i_select">'<tt>select</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <result> = select bool <cond>, <ty> <val1>, <ty> <val2>             <i>; yields ty</i>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>select</tt>' instruction is used to choose one value based on a
+ condition, without branching.
+ </p>
+ 
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The '<tt>select</tt>' instruction requires a boolean value indicating the condition, and two values of the same <a href="#t_firstclass">first class</a> type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ If the boolean condition evaluates to true, the instruction returns the first
+ value argument; otherwise, it returns the second value argument.
+ </p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %X = select bool true, ubyte 17, ubyte 42          <i>; yields ubyte:17</i>
+ </pre>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_call">'<tt>call</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   <result> = [tail] call [<a href="#callingconv">cconv</a>] <ty>* <fnptrval>(<param list>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>call</tt>' instruction represents a simple function call.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>This instruction requires several arguments:</p>
+ 
+ <ol>
+   <li>
+     <p>The optional "tail" marker indicates whether the callee function accesses
+     any allocas or varargs in the caller.  If the "tail" marker is present, the
+     function call is eligible for tail call optimization.  Note that calls may
+     be marked "tail" even if they do not occur before a <a
+     href="#i_ret"><tt>ret</tt></a> instruction.
+   </li>
+   <li>
+     <p>The optional "cconv" marker indicates which <a href="callingconv">calling
+     convention</a> the call should use.  If none is specified, the call defaults
+     to using C calling conventions.
+   </li>
+   <li>
+     <p>'<tt>ty</tt>': shall be the signature of the pointer to function value
+     being invoked.  The argument types must match the types implied by this
+     signature.  This type can be omitted if the function is not varargs and
+     if the function type does not return a pointer to a function.</p>
+   </li>
+   <li>
+     <p>'<tt>fnptrval</tt>': An LLVM value containing a pointer to a function to
+     be invoked. In most cases, this is a direct function invocation, but
+     indirect <tt>call</tt>s are just as possible, calling an arbitrary pointer
+     to function value.</p>
+   </li>
+   <li>
+     <p>'<tt>function args</tt>': argument list whose types match the
+     function signature argument types. All arguments must be of 
+     <a href="#t_firstclass">first class</a> type. If the function signature 
+     indicates the function accepts a variable number of arguments, the extra 
+     arguments can be specified.</p>
+   </li>
+ </ol>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>call</tt>' instruction is used to cause control flow to
+ transfer to a specified function, with its incoming arguments bound to
+ the specified values. Upon a '<tt><a href="#i_ret">ret</a></tt>'
+ instruction in the called function, control flow continues with the
+ instruction after the function call, and the return value of the
+ function is bound to the result argument.  This is a simpler case of
+ the <a href="#i_invoke">invoke</a> instruction.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <pre>
+   %retval = call int %test(int %argc)
+   call int(sbyte*, ...) *%printf(sbyte* %msg, int 12, sbyte 42);
+   %X = tail call int %foo()
+   %Y = tail call <a href="#callingconv">fastcc</a> int %foo()
+ </pre>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_va_arg">'<tt>va_arg</tt>' Instruction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   <resultval> = va_arg <va_list*> <arglist>, <argty>
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>va_arg</tt>' instruction is used to access arguments passed through
+ the "variable argument" area of a function call.  It is used to implement the
+ <tt>va_arg</tt> macro in C.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>This instruction takes a <tt>va_list*</tt> value and the type of
+ the argument. It returns a value of the specified argument type and
+ increments the <tt>va_list</tt> to point to the next argument.  Again, the
+ actual type of <tt>va_list</tt> is target specific.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>va_arg</tt>' instruction loads an argument of the specified
+ type from the specified <tt>va_list</tt> and causes the
+ <tt>va_list</tt> to point to the next argument.  For more information,
+ see the variable argument handling <a href="#int_varargs">Intrinsic
+ Functions</a>.</p>
+ 
+ <p>It is legal for this instruction to be called in a function which does not
+ take a variable number of arguments, for example, the <tt>vfprintf</tt>
+ function.</p>
+ 
+ <p><tt>va_arg</tt> is an LLVM instruction instead of an <a
+ href="#intrinsics">intrinsic function</a> because it takes a type as an
+ argument.</p>
+ 
+ <h5>Example:</h5>
+ 
+ <p>See the <a href="#int_varargs">variable argument processing</a> section.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"> <a name="intrinsics">Intrinsic Functions</a> </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM supports the notion of an "intrinsic function".  These functions have
+ well known names and semantics and are required to follow certain
+ restrictions. Overall, these instructions represent an extension mechanism for
+ the LLVM language that does not require changing all of the transformations in
+ LLVM to add to the language (or the bytecode reader/writer, the parser,
+ etc...).</p>
+ 
+ <p>Intrinsic function names must all start with an "<tt>llvm.</tt>" prefix. This
+ prefix is reserved in LLVM for intrinsic names; thus, functions may not be named
+ this.  Intrinsic functions must always be external functions: you cannot define
+ the body of intrinsic functions.  Intrinsic functions may only be used in call
+ or invoke instructions: it is illegal to take the address of an intrinsic
+ function.  Additionally, because intrinsic functions are part of the LLVM
+ language, it is required that they all be documented here if any are added.</p>
+ 
+ 
+ <p>To learn how to add an intrinsic function, please see the <a
+ href="ExtendingLLVM.html">Extending LLVM Guide</a>.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_varargs">Variable Argument Handling Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Variable argument support is defined in LLVM with the <a
+  href="#i_va_arg"><tt>va_arg</tt></a> instruction and these three
+ intrinsic functions.  These functions are related to the similarly
+ named macros defined in the <tt><stdarg.h></tt> header file.</p>
+ 
+ <p>All of these functions operate on arguments that use a
+ target-specific value type "<tt>va_list</tt>".  The LLVM assembly
+ language reference manual does not define what this type is, so all
+ transformations should be prepared to handle intrinsics with any type
+ used.</p>
+ 
+ <p>This example shows how the <a href="#i_va_arg"><tt>va_arg</tt></a>
+ instruction and the variable argument handling intrinsic functions are
+ used.</p>
+ 
+ <pre>
+ int %test(int %X, ...) {
+   ; Initialize variable argument processing
+   %ap = alloca sbyte*
+   call void %<a href="#i_va_start">llvm.va_start</a>(sbyte** %ap)
+ 
+   ; Read a single integer argument
+   %tmp = va_arg sbyte** %ap, int
+ 
+   ; Demonstrate usage of llvm.va_copy and llvm.va_end
+   %aq = alloca sbyte*
+   call void %<a href="#i_va_copy">llvm.va_copy</a>(sbyte** %aq, sbyte** %ap)
+   call void %<a href="#i_va_end">llvm.va_end</a>(sbyte** %aq)
+ 
+   ; Stop processing of arguments.
+   call void %<a href="#i_va_end">llvm.va_end</a>(sbyte** %ap)
+   ret int %tmp
+ }
+ </pre>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_va_start">'<tt>llvm.va_start</tt>' Intrinsic</a>
+ </div>
+ 
+ 
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  declare void %llvm.va_start(<va_list>* <arglist>)<br></pre>
+ <h5>Overview:</h5>
+ <P>The '<tt>llvm.va_start</tt>' intrinsic initializes
+ <tt>*<arglist></tt> for subsequent use by <tt><a
+ href="#i_va_arg">va_arg</a></tt>.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <P>The argument is a pointer to a <tt>va_list</tt> element to initialize.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <P>The '<tt>llvm.va_start</tt>' intrinsic works just like the <tt>va_start</tt>
+ macro available in C.  In a target-dependent way, it initializes the
+ <tt>va_list</tt> element the argument points to, so that the next call to
+ <tt>va_arg</tt> will produce the first variable argument passed to the function.
+ Unlike the C <tt>va_start</tt> macro, this intrinsic does not need to know the
+ last argument of the function, the compiler can figure that out.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+  <a name="i_va_end">'<tt>llvm.va_end</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ <h5>Syntax:</h5>
+ <pre>  declare void %llvm.va_end(<va_list*> <arglist>)<br></pre>
+ <h5>Overview:</h5>
+ <p>The '<tt>llvm.va_end</tt>' intrinsic destroys <tt><arglist></tt>
+ which has been initialized previously with <tt><a href="#i_va_start">llvm.va_start</a></tt>
+ or <tt><a href="#i_va_copy">llvm.va_copy</a></tt>.</p>
+ <h5>Arguments:</h5>
+ <p>The argument is a <tt>va_list</tt> to destroy.</p>
+ <h5>Semantics:</h5>
+ <p>The '<tt>llvm.va_end</tt>' intrinsic works just like the <tt>va_end</tt>
+ macro available in C.  In a target-dependent way, it destroys the <tt>va_list</tt>.
+ Calls to <a href="#i_va_start"><tt>llvm.va_start</tt></a> and <a
+  href="#i_va_copy"><tt>llvm.va_copy</tt></a> must be matched exactly
+ with calls to <tt>llvm.va_end</tt>.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_va_copy">'<tt>llvm.va_copy</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   declare void %llvm.va_copy(<va_list>* <destarglist>,
+                                           <va_list>* <srcarglist>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>llvm.va_copy</tt>' intrinsic copies the current argument position from
+ the source argument list to the destination argument list.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The first argument is a pointer to a <tt>va_list</tt> element to initialize.
+ The second argument is a pointer to a <tt>va_list</tt> element to copy from.</p>
+ 
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>llvm.va_copy</tt>' intrinsic works just like the <tt>va_copy</tt> macro
+ available in C.  In a target-dependent way, it copies the source
+ <tt>va_list</tt> element into the destination list.  This intrinsic is necessary
+ because the <tt><a href="i_va_begin">llvm.va_begin</a></tt> intrinsic may be
+ arbitrarily complex and require memory allocation, for example.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_gc">Accurate Garbage Collection Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ LLVM support for <a href="GarbageCollection.html">Accurate Garbage
+ Collection</a> requires the implementation and generation of these intrinsics.
+ These intrinsics allow identification of <a href="#i_gcroot">GC roots on the
+ stack</a>, as well as garbage collector implementations that require <a
+ href="#i_gcread">read</a> and <a href="#i_gcwrite">write</a> barriers.
+ Front-ends for type-safe garbage collected languages should generate these
+ intrinsics to make use of the LLVM garbage collectors.  For more details, see <a
+ href="GarbageCollection.html">Accurate Garbage Collection with LLVM</a>.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_gcroot">'<tt>llvm.gcroot</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   declare void %llvm.gcroot(<ty>** %ptrloc, <ty2>* %metadata)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>llvm.gcroot</tt>' intrinsic declares the existence of a GC root to
+ the code generator, and allows some metadata to be associated with it.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The first argument specifies the address of a stack object that contains the
+ root pointer.  The second pointer (which must be either a constant or a global
+ value address) contains the meta-data to be associated with the root.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>At runtime, a call to this intrinsics stores a null pointer into the "ptrloc"
+ location.  At compile-time, the code generator generates information to allow
+ the runtime to find the pointer at GC safe points.
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_gcread">'<tt>llvm.gcread</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   declare sbyte* %llvm.gcread(sbyte* %ObjPtr, sbyte** %Ptr)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>llvm.gcread</tt>' intrinsic identifies reads of references from heap
+ locations, allowing garbage collector implementations that require read
+ barriers.</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The second argument is the address to read from, which should be an address
+ allocated from the garbage collector.  The first object is a pointer to the 
+ start of the referenced object, if needed by the language runtime (otherwise
+ null).</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>llvm.gcread</tt>' intrinsic has the same semantics as a load
+ instruction, but may be replaced with substantially more complex code by the
+ garbage collector runtime, as needed.</p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_gcwrite">'<tt>llvm.gcwrite</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ 
+ <pre>
+   declare void %llvm.gcwrite(sbyte* %P1, sbyte* %Obj, sbyte** %P2)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>The '<tt>llvm.gcwrite</tt>' intrinsic identifies writes of references to heap
+ locations, allowing garbage collector implementations that require write
+ barriers (such as generational or reference counting collectors).</p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>The first argument is the reference to store, the second is the start of the
+ object to store it to, and the third is the address of the field of Obj to 
+ store to.  If the runtime does not require a pointer to the object, Obj may be
+ null.</p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>The '<tt>llvm.gcwrite</tt>' intrinsic has the same semantics as a store
+ instruction, but may be replaced with substantially more complex code by the
+ garbage collector runtime, as needed.</p>
+ 
+ </div>
+ 
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_codegen">Code Generator Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ These intrinsics are provided by LLVM to expose special features that may only
+ be implemented with code generator support.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_returnaddress">'<tt>llvm.returnaddress</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare sbyte *%llvm.returnaddress(uint <level>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.returnaddress</tt>' intrinsic returns a target-specific value
+ indicating the return address of the current function or one of its callers.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The argument to this intrinsic indicates which function to return the address
+ for.  Zero indicates the calling function, one indicates its caller, etc.  The
+ argument is <b>required</b> to be a constant integer value.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.returnaddress</tt>' intrinsic either returns a pointer indicating
+ the return address of the specified call frame, or zero if it cannot be
+ identified.  The value returned by this intrinsic is likely to be incorrect or 0
+ for arguments other than zero, so it should only be used for debugging purposes.
+ </p>
+ 
+ <p>
+ Note that calling this intrinsic does not prevent function inlining or other
+ aggressive transformations, so the value returned may not be that of the obvious
+ source-language caller.
+ </p>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_frameaddress">'<tt>llvm.frameaddress</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare sbyte *%llvm.frameaddress(uint <level>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.frameaddress</tt>' intrinsic returns the target-specific frame
+ pointer value for the specified stack frame.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The argument to this intrinsic indicates which function to return the frame
+ pointer for.  Zero indicates the calling function, one indicates its caller,
+ etc.  The argument is <b>required</b> to be a constant integer value.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.frameaddress</tt>' intrinsic either returns a pointer indicating
+ the frame address of the specified call frame, or zero if it cannot be
+ identified.  The value returned by this intrinsic is likely to be incorrect or 0
+ for arguments other than zero, so it should only be used for debugging purposes.
+ </p>
+ 
+ <p>
+ Note that calling this intrinsic does not prevent function inlining or other
+ aggressive transformations, so the value returned may not be that of the obvious
+ source-language caller.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_stacksave">'<tt>llvm.stacksave</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare sbyte *%llvm.stacksave()
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.stacksave</tt>' intrinsic is used to remember the current state of
+ the function stack, for use with <a href="#i_stackrestore">
+ <tt>llvm.stackrestore</tt></a>.  This is useful for implementing language
+ features like scoped automatic variable sized arrays in C99.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ This intrinsic returns a opaque pointer value that can be passed to <a
+ href="#i_stackrestore"><tt>llvm.stackrestore</tt></a>.  When an
+ <tt>llvm.stackrestore</tt> intrinsic is executed with a value saved from 
+ <tt>llvm.stacksave</tt>, it effectively restores the state of the stack to the
+ state it was in when the <tt>llvm.stacksave</tt> intrinsic executed.  In
+ practice, this pops any <a href="#i_alloca">alloca</a> blocks from the stack
+ that were allocated after the <tt>llvm.stacksave</tt> was executed.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_stackrestore">'<tt>llvm.stackrestore</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.stackrestore(sbyte* %ptr)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.stackrestore</tt>' intrinsic is used to restore the state of
+ the function stack to the state it was in when the corresponding <a
+ href="#llvm.stacksave"><tt>llvm.stacksave</tt></a> intrinsic executed.  This is
+ useful for implementing language features like scoped automatic variable sized
+ arrays in C99.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ See the description for <a href="#i_stacksave"><tt>llvm.stacksave</tt></a>.
+ </p>
+ 
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_prefetch">'<tt>llvm.prefetch</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.prefetch(sbyte * <address>,
+                                 uint <rw>, uint <locality>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ 
+ <p>
+ The '<tt>llvm.prefetch</tt>' intrinsic is a hint to the code generator to insert
+ a prefetch instruction if supported; otherwise, it is a noop.  Prefetches have
+ no
+ effect on the behavior of the program but can change its performance
+ characteristics.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ <tt>address</tt> is the address to be prefetched, <tt>rw</tt> is the specifier
+ determining if the fetch should be for a read (0) or write (1), and
+ <tt>locality</tt> is a temporal locality specifier ranging from (0) - no
+ locality, to (3) - extremely local keep in cache.  The <tt>rw</tt> and
+ <tt>locality</tt> arguments must be constant integers.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ This intrinsic does not modify the behavior of the program.  In particular,
+ prefetches cannot trap and do not produce a value.  On targets that support this
+ intrinsic, the prefetch can provide hints to the processor cache for better
+ performance.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_pcmarker">'<tt>llvm.pcmarker</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.pcmarker( uint <id> )
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ 
+ <p>
+ The '<tt>llvm.pcmarker</tt>' intrinsic is a method to export a Program Counter
+ (PC) in a region of 
+ code to simulators and other tools.  The method is target specific, but it is 
+ expected that the marker will use exported symbols to transmit the PC of the marker.
+ The marker makes no guarantees that it will remain with any specific instruction 
+ after optimizations.  It is possible that the presence of a marker will inhibit 
+ optimizations.  The intended use is to be inserted after optimizations to allow
+ correlations of simulation runs.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ <tt>id</tt> is a numerical id identifying the marker.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ This intrinsic does not modify the behavior of the program.  Backends that do not 
+ support this intrinisic may ignore it.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_readcyclecounter">'<tt>llvm.readcyclecounter</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare ulong %llvm.readcyclecounter( )
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ 
+ <p>
+ The '<tt>llvm.readcyclecounter</tt>' intrinsic provides access to the cycle 
+ counter register (or similar low latency, high accuracy clocks) on those targets
+ that support it.  On X86, it should map to RDTSC.  On Alpha, it should map to RPCC.
+ As the backing counters overflow quickly (on the order of 9 seconds on alpha), this
+ should only be used for small timings.  
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ When directly supported, reading the cycle counter should not modify any memory.  
+ Implementations are allowed to either return a application specific value or a
+ system wide value.  On backends without support, this is lowered to a constant 0.
+ </p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_libc">Standard C Library Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ LLVM provides intrinsics for a few important standard C library functions.
+ These intrinsics allow source-language front-ends to pass information about the
+ alignment of the pointer arguments to the code generator, providing opportunity
+ for more efficient code generation.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_memcpy">'<tt>llvm.memcpy</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.memcpy.i32(sbyte* <dest>, sbyte* <src>,
+                                 uint <len>, uint <align>)
+   declare void %llvm.memcpy.i64(sbyte* <dest>, sbyte* <src>,
+                                 ulong <len>, uint <align>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.memcpy.*</tt>' intrinsics copy a block of memory from the source
+ location to the destination location.
+ </p>
+ 
+ <p>
+ Note that, unlike the standard libc function, the <tt>llvm.memcpy.*</tt> 
+ intrinsics do not return a value, and takes an extra alignment argument.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first argument is a pointer to the destination, the second is a pointer to
+ the source.  The third argument is an integer argument
+ specifying the number of bytes to copy, and the fourth argument is the alignment
+ of the source and destination locations.
+ </p>
+ 
+ <p>
+ If the call to this intrinisic has an alignment value that is not 0 or 1, then
+ the caller guarantees that both the source and destination pointers are aligned
+ to that boundary.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.memcpy.*</tt>' intrinsics copy a block of memory from the source
+ location to the destination location, which are not allowed to overlap.  It
+ copies "len" bytes of memory over.  If the argument is known to be aligned to
+ some boundary, this can be specified as the fourth argument, otherwise it should
+ be set to 0 or 1.
+ </p>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_memmove">'<tt>llvm.memmove</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.memmove.i32(sbyte* <dest>, sbyte* <src>,
+                                  uint <len>, uint <align>)
+   declare void %llvm.memmove.i64(sbyte* <dest>, sbyte* <src>,
+                                  ulong <len>, uint <align>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.memmove.*</tt>' intrinsics move a block of memory from the source
+ location to the destination location. It is similar to the
+ '<tt>llvm.memcmp</tt>' intrinsic but allows the two memory locations to overlap.
+ </p>
+ 
+ <p>
+ Note that, unlike the standard libc function, the <tt>llvm.memmove.*</tt> 
+ intrinsics do not return a value, and takes an extra alignment argument.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first argument is a pointer to the destination, the second is a pointer to
+ the source.  The third argument is an integer argument
+ specifying the number of bytes to copy, and the fourth argument is the alignment
+ of the source and destination locations.
+ </p>
+ 
+ <p>
+ If the call to this intrinisic has an alignment value that is not 0 or 1, then
+ the caller guarantees that the source and destination pointers are aligned to
+ that boundary.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.memmove.*</tt>' intrinsics copy a block of memory from the source
+ location to the destination location, which may overlap.  It
+ copies "len" bytes of memory over.  If the argument is known to be aligned to
+ some boundary, this can be specified as the fourth argument, otherwise it should
+ be set to 0 or 1.
+ </p>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_memset">'<tt>llvm.memset.*</tt>' Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare void %llvm.memset.i32(sbyte* <dest>, ubyte <val>,
+                                 uint <len>, uint <align>)
+   declare void %llvm.memset.i64(sbyte* <dest>, ubyte <val>,
+                                 ulong <len>, uint <align>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.memset.*</tt>' intrinsics fill a block of memory with a particular
+ byte value.
+ </p>
+ 
+ <p>
+ Note that, unlike the standard libc function, the <tt>llvm.memset</tt> intrinsic
+ does not return a value, and takes an extra alignment argument.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The first argument is a pointer to the destination to fill, the second is the
+ byte value to fill it with, the third argument is an integer
+ argument specifying the number of bytes to fill, and the fourth argument is the
+ known alignment of destination location.
+ </p>
+ 
+ <p>
+ If the call to this intrinisic has an alignment value that is not 0 or 1, then
+ the caller guarantees that the destination pointer is aligned to that boundary.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.memset.*</tt>' intrinsics fill "len" bytes of memory starting at
+ the
+ destination location.  If the argument is known to be aligned to some boundary,
+ this can be specified as the fourth argument, otherwise it should be set to 0 or
+ 1.
+ </p>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_isunordered">'<tt>llvm.isunordered.*</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare bool %llvm.isunordered.f32(float Val1, float  Val2)
+   declare bool %llvm.isunordered.f64(double Val1, double Val2)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.isunordered</tt>' intrinsics return true if either or both of the
+ specified floating point values is a NAN.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The arguments are floating point numbers of the same type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ If either or both of the arguments is a SNAN or QNAN, it returns true, otherwise
+ false.
+ </p>
+ </div>
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_sqrt">'<tt>llvm.sqrt.*</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare double %llvm.sqrt.f32(float Val)
+   declare double %llvm.sqrt.f64(double Val)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.sqrt</tt>' intrinsics return the sqrt of the specified operand,
+ returning the same value as the libm '<tt>sqrt</tt>' function would.  Unlike
+ <tt>sqrt</tt> in libm, however, <tt>llvm.sqrt</tt> has undefined behavior for
+ negative numbers (which allows for better optimization).
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The argument and return value are floating point numbers of the same type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ This function returns the sqrt of the specified operand if it is a positive
+ floating point number.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_manip">Bit Manipulation Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ LLVM provides intrinsics for a few important bit manipulation operations.
+ These allow efficient code generation for some algorithms.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="i_bswap">'<tt>llvm.bswap.*</tt>' Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare ushort %llvm.bswap.i16(ushort <id>)
+   declare uint   %llvm.bswap.i32(uint <id>)
+   declare ulong  %llvm.bswap.i64(ulong <id>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.bwsap</tt>' family of intrinsics is used to byteswap a 16, 32 or
+ 64 bit quantity.  These are useful for performing operations on data that is not
+ in the target's  native byte order.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The <tt>llvm.bswap.16</tt> intrinsic returns a ushort value that has the high and low
+ byte of the input ushort swapped.  Similarly, the <tt>llvm.bswap.i32</tt> intrinsic
+ returns a uint value that has the four bytes of the input uint swapped, so that 
+ if the input bytes are numbered 0, 1, 2, 3 then the returned uint will have its
+ bytes in 3, 2, 1, 0 order.  The <tt>llvm.bswap.i64</tt> intrinsic extends this concept
+ to 64 bits.
+ </p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="int_ctpop">'<tt>llvm.ctpop.*</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare ubyte  %llvm.ctpop.i8 (ubyte <src>)
+   declare ushort %llvm.ctpop.i16(ushort <src>)
+   declare uint   %llvm.ctpop.i32(uint <src>)
+   declare ulong  %llvm.ctpop.i64(ulong <src>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.ctpop</tt>' family of intrinsics counts the number of bits set in a 
+ value.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The only argument is the value to be counted.  The argument may be of any
+ unsigned integer type.  The return type must match the argument type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.ctpop</tt>' intrinsic counts the 1's in a variable.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="int_ctlz">'<tt>llvm.ctlz.*</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare ubyte  %llvm.ctlz.i8 (ubyte <src>)
+   declare ushort %llvm.ctlz.i16(ushort <src>)
+   declare uint   %llvm.ctlz.i32(uint <src>)
+   declare ulong  %llvm.ctlz.i64(ulong <src>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.ctlz</tt>' family of intrinsic functions counts the number of 
+ leading zeros in a variable.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The only argument is the value to be counted.  The argument may be of any
+ unsigned integer type. The return type must match the argument type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.ctlz</tt>' intrinsic counts the leading (most significant) zeros
+ in a variable.  If the src == 0 then the result is the size in bits of the type
+ of src. For example, <tt>llvm.ctlz(int 2) = 30</tt>.
+ </p>
+ </div>
+ 
+ 
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="int_cttz">'<tt>llvm.cttz.*</tt>' Intrinsic</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <h5>Syntax:</h5>
+ <pre>
+   declare ubyte  %llvm.cttz.i8 (ubyte <src>)
+   declare ushort %llvm.cttz.i16(ushort <src>)
+   declare uint   %llvm.cttz.i32(uint <src>)
+   declare ulong  %llvm.cttz.i64(ulong <src>)
+ </pre>
+ 
+ <h5>Overview:</h5>
+ 
+ <p>
+ The '<tt>llvm.cttz</tt>' family of intrinsic functions counts the number of 
+ trailing zeros.
+ </p>
+ 
+ <h5>Arguments:</h5>
+ 
+ <p>
+ The only argument is the value to be counted.  The argument may be of any
+ unsigned integer type.  The return type must match the argument type.
+ </p>
+ 
+ <h5>Semantics:</h5>
+ 
+ <p>
+ The '<tt>llvm.cttz</tt>' intrinsic counts the trailing (least significant) zeros
+ in a variable.  If the src == 0 then the result is the size in bits of the type
+ of src.  For example, <tt>llvm.cttz(2) = 1</tt>.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="int_debugger">Debugger Intrinsics</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The LLVM debugger intrinsics (which all start with <tt>llvm.dbg.</tt> prefix),
+ are described in the <a
+ href="SourceLevelDebugging.html#format_common_intrinsics">LLVM Source Level
+ Debugging</a> document.
+ </p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/Lexicon.html
diff -c /dev/null llvm-www/releases/1.8/docs/Lexicon.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/Lexicon.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,178 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>The LLVM Lexicon</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+   <meta name="author" content="Various">
+   <meta name="description" 
+   content="A glossary of terms used with the LLVM project.">
+ </head>
+ <body>
+ <div class="doc_title">The LLVM Lexicon</div>
+ <p class="doc_warning">NOTE: This document is a work in progress!</p>
+ <!-- *********************************************************************** -->
+ <div class="doc_section">Table Of Contents</div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <table>
+     <tr><th colspan="8"><b>- <a href="#A">A</a> -</b></th></tr>
+     <tr>
+       <td><a href="#ADCE">ADCE</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#B">B</a> -</b></th></tr>
+     <tr>
+       <td><a href="#BURS">BURS</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#C">C</a> -</b></th></tr>
+     <tr>
+       <td><a href="#CSE">CSE</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#D">D</a> -</b></th></tr>
+     <tr>
+       <td><a href="#DSA">DSA</a></td>
+       <td><a href="#DSE">DSE</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#I">I</a> -</b></th></tr>
+     <tr>
+       <td><a href="#IPA">IPA</a></td>
+       <td><a href="#IPO">IPO</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#L">L</a> -</b></th></tr>
+     <tr>
+       <td><a href="#LICM">LICM</a></td>
+       <td><a href="#Load-VN">Load-VN</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#P">P</a> -</b></th></tr>
+     <tr>
+       <td><a href="#PRE">PRE</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#R">R</a> -</b></th></tr>
+     <tr>
+       <td><a href="#Reassociation">Reassociation</a></td>
+     </tr>
+     <tr><th colspan="8"><b>- <a href="#S">S</a> -</b></th></tr>
+     <tr>
+       <td><a href="#SCC">SCC</a></td>
+       <td><a href="#SCCP">SCCP</a></td>
+       <td><a href="#SRoA">SRoA</a></td>
+       <td><a href="#SSA">SSA</a></td>
+     </tr>
+   </table>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">Definitions</div>
+ <!-- *********************************************************************** -->
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="A">- A -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="ADCE"><b>ADCE</b></a></dt>
+     <dd>Aggressive Dead Code Elimination</dd>
+   </dl>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="B">- B -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="BURS"><b>BURS</b></a></dt>
+     <dd>Bottom Up Rewriting System - A method of instruction selection for
+         code generation.  An example is the <a 
+ href="http://www.program-transformation.org/Transform/BURG">BURG</a> tool.</dd>
+   </dl>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="C">- C -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="CSE"><b>CSE</b></a></dt>
+     <dd>Common Subexpression Elimination. An optimization that removes common
+     subexpression compuation. For example <tt>(a+b)*(a+b)</tt> has two
+     subexpressions that are the same: <tt>(a+b)</tt>. This optimization would
+     perform the addition only once and then perform the multiply (but only if
+     its compulationally correct/safe).
+   </dl>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="D">- D -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="DSA"><b>DSA</b></a></dt>
+     <dd>Data Structure Analysis</dd>
+     <dt><a name="DSE"><b>DSE</b></a></dt>
+     <dd>Dead Store Elimination</dd>
+   </dl>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="I">- I -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="IPA"><b>IPA</b></a></dt>
+     <dd>Inter-Procedural Analysis. Refers to any variety of code analysis that
+     occurs between procedures, functions or compilation units (modules).</dd>
+     <dt><a name="IPO"><b>IPO</b></a></dt>
+     <dd>Inter-Procedural Optimization. Refers to any variety of code
+     optimization that occurs between procedures, functions or compilation units
+     (modules).</dd>
+   </dl>
+ </div>
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="L">- L -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="LICM"><b>LICM</b></a></dt>
+     <dd>Loop Invariant Code Motion</dd>
+     <dt><a name="Load-VN"><b>Load-VN</b></a></dt>
+     <dd>Load Value Numbering</dd>
+   </dl>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="P">- P -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="PRE"><b>PRE</b></a></dt>
+     <dd>Partial Redundancy Elimination</dd>
+   </dl>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="R">- R -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="Reassociation"><b>Reassociation</b></a></dt> <dd>Rearranging
+     associative expressions to promote better redundancy elimination and other
+     optimization.  For example, changing (A+B-A) into (B+A-A), permitting it to
+     be optimized into (B+0) then (B).
+   </dl>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="S">- S -</a></div>
+ <div class="doc_text">
+   <dl>
+     <dt><a name="SCC"><b>SCC</b></a></dt>
+     <dd>Strongly Connected Component</dd>
+     <dt><a name="SCCP"><b>SCCP</b></a></dt>
+     <dd>Sparse Conditional Constant Propagation</dd>
+     <dt><a name="SRoA"><b>SRoA</b></a></dt>
+     <dd>Scalar Replacement of Aggregates</dd>
+     <dt><a name="SSA"><b>SSA</b></a></dt>
+     <dd>Static Single Assignment</dd>
+   </dl>
+ </div>
+ <!-- *********************************************************************** -->
+ <hr>
+ <address> <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+  src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a><a
+  href="http://validator.w3.org/check/referer"><img
+  src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a><a
+  href="http://llvm.org/">The LLVM Team</a><br>
+ <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+ Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ <!-- vim: sw=2
+ -->
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/Makefile
diff -c /dev/null llvm-www/releases/1.8/docs/Makefile:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/Makefile	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,83 ----
+ ##===- docs/Makefile ---------------------------------------*- Makefile -*-===##
+ # 
+ #                     The LLVM Compiler Infrastructure
+ #
+ # This file was developed by the LLVM research group and is distributed under
+ # the University of Illinois Open Source License. See LICENSE.TXT for details.
+ # 
+ ##===----------------------------------------------------------------------===##
+ 
+ LEVEL      := ..
+ DIRS       := CommandGuide
+ 
+ ifdef BUILD_FOR_WEBSITE
+ PROJ_OBJ_DIR = .
+ DOXYGEN = doxygen
+ 
+ doxygen.cfg: doxygen.cfg.in
+ 	cat $< | sed 's/@abs_top_srcdir@/../g' | sed 's/@DOT@/dot/g' | \
+         sed 's/@PACKAGE_VERSION@/CVS/g' | sed 's/@abs_top_builddir@/../g' > $@
+ endif
+ 
+ include $(LEVEL)/Makefile.common
+ 
+ HTML       := $(wildcard $(PROJ_SRC_DIR)/*.html) \
+               $(wildcard $(PROJ_SRC_DIR)/*.css)
+ IMAGES     := $(wildcard $(PROJ_SRC_DIR)/img/*.*)
+ DOXYFILES  := doxygen.cfg.in doxygen.css doxygen.footer doxygen.header \
+               doxygen.intro
+ EXTRA_DIST := $(HTML) $(DOXYFILES) llvm.css CommandGuide img
+ 
+ .PHONY: install-html install-doxygen doxygen
+ 
+ ifeq ($(ENABLE_DOXYGEN),1)
+ install-local:: install-html install-doxygen
+ else
+ install-local:: install-html
+ endif
+ 
+ install-html: $(PROJ_OBJ_DIR)/html.tar.gz
+ 	$(Echo) Installing HTML documentation
+ 	$(Verb) $(MKDIR) $(PROJ_docsdir)/html
+ 	$(Verb) $(MKDIR) $(PROJ_docsdir)/html/img
+ 	$(Verb) $(DataInstall) $(HTML) $(PROJ_docsdir)/html
+ 	$(Verb) $(DataInstall) $(IMAGES) $(PROJ_docsdir)/html/img
+ 	$(Verb) $(DataInstall) $(PROJ_OBJ_DIR)/html.tar.gz $(PROJ_docsdir)
+ 
+ $(PROJ_OBJ_DIR)/html.tar.gz: $(HTML)
+ 	$(Echo) Packaging HTML documentation
+ 	$(Verb) $(RM) -rf $@ $(PROJ_OBJ_DIR)/html.tar
+ 	$(Verb) cd $(PROJ_SRC_DIR) && \
+ 	  $(TAR) cf $(PROJ_OBJ_DIR)/html.tar *.html
+ 	$(Verb) $(GZIP) $(PROJ_OBJ_DIR)/html.tar
+ 
+ install-doxygen: doxygen
+ 	$(Echo) Installing doxygen documentation
+ 	$(Echo) Installing doxygen documentation
+ 	$(Verb) $(MKDIR) $(PROJ_docsdir)/html/doxygen
+ 	$(Verb) $(DataInstall) $(PROJ_OBJ_DIR)/doxygen.tar.gz $(PROJ_docsdir)
+ 	$(Verb) cd $(PROJ_OBJ_DIR)/doxygen && \
+ 	  $(FIND) . -type f -exec \
+ 	    $(DataInstall) {} $(PROJ_docsdir)/html/doxygen \;
+ 
+ doxygen: $(PROJ_OBJ_DIR)/doxygen.tar.gz
+ 
+ $(PROJ_OBJ_DIR)/doxygen.tar.gz: $(DOXYFILES) $(PROJ_OBJ_DIR)/doxygen.cfg
+ 	$(Echo) Building doxygen documentation
+ 	$(Verb) if test -e $(PROJ_OBJ_DIR)/doxygen ; then \
+ 	  $(RM) -rf $(PROJ_OBJ_DIR)/doxygen ; \
+ 	fi
+ 	$(Verb) $(DOXYGEN) $(PROJ_OBJ_DIR)/doxygen.cfg
+ 	$(Echo) Packaging doxygen documentation
+ 	$(Verb) $(RM) -rf $@ $(PROJ_OBJ_DIR)/doxygen.tar
+ 	$(Verb) $(TAR) cf $(PROJ_OBJ_DIR)/doxygen.tar doxygen
+ 	$(Verb) $(GZIP) $(PROJ_OBJ_DIR)/doxygen.tar
+ 	$(Verb) $(CP) $(PROJ_OBJ_DIR)/doxygen.tar.gz $(PROJ_OBJ_DIR)/doxygen/html/
+ 
+ userloc: $(LLVM_SRC_ROOT)/docs/userloc.html
+ 
+ $(LLVM_SRC_ROOT)/docs/userloc.html:
+ 	$(Echo) Making User LOC Table
+ 	$(Verb) cd $(LLVM_SRC_ROOT) ; ./utils/userloc.pl -details -recurse \
+ 	  -html lib include tools runtime utils examples autoconf test > docs/userloc.html
+ 	  

Index: llvm-www/releases/1.8/docs/MakefileGuide.html
diff -c /dev/null llvm-www/releases/1.8/docs/MakefileGuide.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/MakefileGuide.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1010 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>LLVM Makefile Guide</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">LLVM Makefile Guide</div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#general">General Concepts</a>
+     <ol>
+       <li><a href="#projects">Projects</a></li>
+       <li><a href="#varvals">Variable Values</a></li>
+       <li><a href="#including">Including Makefiles</a>
+         <ol>
+           <li><a href="#Makefile">Makefile</a></li>
+           <li><a href="#Makefile.common">Makefile.common</a></li>
+           <li><a href="#Makefile.config">Makefile.config</a></li>
+           <li><a href="#Makefile.rules">Makefile.rules</a></li>
+         </ol>
+       </li>
+       <li><a href="#Comments">Comments</a></li>
+     </ol>
+   </li>
+   <li><a href="#tutorial">Tutorial</a>
+     <ol>
+       <li><a href="#libraries">Libraries</a>
+         <ol>
+ 	  <li><a href="#Modules">Bytecode Modules</a></li>
+ 	</ol>
+       </li>
+       <li><a href="#tools">Tools</a>
+         <ol>
+ 	  <li><a href="#JIT">JIT Tools</a></li>
+ 	</ol>
+       </li>
+       <li><a href="#projects">Projects</a></li>
+     </ol>
+   </li>
+   <li><a href="#targets">Targets Supported</a>
+     <ol>
+       <li><a href="#all">all</a></li>
+       <li><a href="#all-local">all-local</a></li>
+       <li><a href="#check">check</a></li>
+       <li><a href="#check-local">check-local</a></li>
+       <li><a href="#clean">clean</a></li>
+       <li><a href="#clean-local">clean-local</a></li>
+       <li><a href="#dist">dist</a></li>
+       <li><a href="#dist-check">dist-check</a></li>
+       <li><a href="#dist-clean">dist-clean</a></li>
+       <li><a href="#install">install</a></li>
+       <li><a href="#preconditions">preconditions</a></li>
+       <li><a href="#printvars">printvars</a></li>
+       <li><a href="#reconfigure">reconfigure</a></li>
+       <li><a href="#spotless">spotless</a></li>
+       <li><a href="#tags">tags</a></li>
+       <li><a href="#uninstall">uninstall</a></li>
+     </ol>
+   </li>
+   <li><a href="#variables">Using Variables</a>
+     <ol>
+       <li><a href="#setvars">Control Variables</a></li>
+       <li><a href="#overvars">Override Variables</a></li>
+       <li><a href="#getvars">Readable Variables</a></li>
+       <li><a href="#intvars">Internal Variables</a></li>
+     </ol>
+   </li>
+ </ol>
+ 
+ <div class="doc_author">    
+   <p>Written by <a href="mailto:reid at x10sys.com">Reid Spencer</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="introduction">Introduction </a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+   <p>This document provides <em>usage</em> information about the LLVM makefile 
+   system. While loosely patterned after the BSD makefile system, LLVM has taken 
+   a departure from BSD in order to implement additional features needed by LLVM.
+   Although makefile systems such as automake were attempted at one point, it
+   has become clear that the features needed by LLVM and the Makefile norm are 
+   too great to use a more limited tool. Consequently, LLVM requires simply GNU 
+   Make 3.79, a widely portable makefile processor. LLVM unabashedly makes heavy 
+   use of the features of GNU Make so the dependency on GNU Make is firm. If 
+   you're not familiar with <tt>make</tt>, it is recommended that you read the 
+   <a href="http://www.gnu.org/software/make/manual/make.html">GNU Makefile 
+   Manual</a>.</p>
+   <p>While this document is rightly part of the 
+   <a href="ProgrammersManual.html">LLVM Programmer's Manual</a>, it is treated
+   separately here because of the volume of content and because it is often an
+   early source of bewilderment for new developers.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="general">General Concepts</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+   <p>The LLVM Makefile System is the component of LLVM that is responsible for
+   building the software, testing it,  generating distributions, checking those
+   distributions, installing and uninstalling, etc. It consists of a several
+   files throughout the source tree. These files and other general concepts are
+   described in this section.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="projects">Projects</a></div>
+ <div class="doc_text">
+   <p>The LLVM Makefile System is quite generous. It not only builds its own
+   software, but it can build yours too. Built into the system is knowledge of
+   the <tt>llvm/projects</tt> directory. Any directory under <tt>projects</tt>
+   that has both a <tt>configure</tt> script and a <tt>Makefile</tt> is assumed
+   to be a project that uses the LLVM Makefile system.  Building software that
+   uses LLVM does not require the LLVM Makefile System nor even placement in the
+   <tt>llvm/projects</tt> directory. However, doing so will allow your project
+   to get up and running quickly by utilizing the built-in features that are used
+   to compile LLVM. LLVM compiles itself using the same features of the makefile
+   system as used for projects.</p>
+   <p>For complete details on setting up your projects configuration, simply
+   mimic the <tt>llvm/projects/sample</tt> project or for further details, 
+   consult the <a href="Projects.html">Projects.html</a> page.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="varvalues">Variable Values</a></div>
+ <div class="doc_text">
+   <p>To use the makefile system, you simply create a file named 
+   <tt>Makefile</tt> in your directory and declare values for certain variables. 
+   The variables and values that you select determine what the makefile system
+   will do. These variables enable rules and processing in the makefile system
+   that automatically Do The Right Thing™. 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="including">Including Makefiles</a></div>
+ <div class="doc_text">
+   <p>Setting variables alone is not enough. You must include into your Makefile
+   additional files that provide the rules of the LLVM Makefile system. The 
+   various files involved are described in the sections that follow.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="Makefile">Makefile</a></div>
+ <div class="doc_text">
+   <p>Each directory to participate in the build needs to have a file named
+   <tt>Makefile</tt>. This is the file first read by <tt>make</tt>. It has three
+   sections:</p>
+   <ol>
+     <li><a href="#setvars">Settable Variables</a> - Required that must be set
+     first.</li>
+     <li><a href="#Makefile.common">include <tt>$(LEVEL)/Makefile.common</tt></a>
+     - include the LLVM Makefile system.
+     <li><a href="#overvars">Override Variables</a> - Override variables set by
+     the LLVM Makefile system.
+   </ol>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="Makefile.common">Makefile.common</a>
+ </div>
+ <div class="doc_text">
+   <p>Every project must have a <tt>Makefile.common</tt> file at its top source 
+   directory. This file serves three purposes:</p>
+   <ol>
+     <li>It includes the project's configuration makefile to obtain values
+     determined by the <tt>configure</tt> script. This is done by including the
+     <a href="#Makefile.config"><tt>$(LEVEL)/Makefile.config</tt></a> file.</li>
+     <li>It specifies any other (static) values that are needed throughout the
+     project. Only values that are used in all or a large proportion of the
+     project's directories should be placed here.</li>
+     <li>It includes the standard rules for the LLVM Makefile system,
+     <a href="#Makefile.rules"><tt>$(LLVM_SRC_ROOT)/Makefile.rules</tt></a>. 
+     This file is the "guts" of the LLVM Makefile system.</li>
+   </ol>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="Makefile.config">Makefile.config</a>
+ </div>
+ <div class="doc_text">
+   <p>Every project must have a <tt>Makefile.config</tt> at the top of its
+   <em>build</em> directory. This file is <b>generated</b> by the
+   <tt>configure</tt> script from the pattern provided by the
+   <tt>Makefile.config.in</tt> file located at the top of the project's
+   <em>source</em> directory. The contents of this file depend largely on what
+   configuration items the project uses, however most projects can get what they
+   need by just relying on LLVM's configuration found in
+   <tt>$(LLVM_OBJ_ROOT)/Makefile.config</tt>.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="Makefile.rules">Makefile.rules</a></div>
+ <div class="doc_text">
+   <p>This file, located at <tt>$(LLVM_SRC_ROOT)/Makefile.rules</tt> is the heart
+   of the LLVM Makefile System. It provides all the logic, dependencies, and
+   rules for building the targets supported by the system. What it does largely
+   depends on the values of <tt>make</tt> <a href="#variables">variables</a> that
+   have been set <em>before</em> <tt>Makefile.rules</tt> is included.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="Comments">Comments</a></div>
+ <div class="doc_text">
+   <p>User Makefiles need not have comments in them unless the construction is
+   unusual or it does not strictly follow the rules and patterns of the LLVM
+   makefile system. Makefile comments are invoked with the pound (#) character.
+   The # character and any text following it, to the end of the line, are ignored
+   by <tt>make</tt>.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="tutorial">Tutorial</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>This section provides some examples of the different kinds of modules you
+   can build with the LLVM makefile system. In general, each directory you 
+   provide will build a single object although that object may be composed of
+   additionally compiled components.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="libraries">Libraries</a></div>
+ <div class="doc_text">
+   <p>Only a few variable definitions are needed to build a regular library.
+   Normally, the makefile system will build all the software into a single
+   <tt>libname.o</tt> (pre-linked) object. This means the library is not
+   searchable and that the distinction between compilation units has been
+   dissolved. Optionally, you can ask for a shared library (.so), archive library
+   (.a) or to not have the default (relinked) library built. For example:</p>
+   <pre><tt>
+       LIBRARYNAME = mylib
+       SHARED_LIBRARY = 1
+       ARCHIVE_LIBRARY = 1
+       DONT_BUILD_RELINKED = 1
+   </tt></pre>
+   <p>says to build a library named "mylib" with both a shared library 
+   (<tt>mylib.so</tt>) and an archive library (<tt>mylib.a</tt>) version but
+   not to build the relinked object (<tt>mylib.o</tt>). The contents of all the
+   libraries produced will be the same, they are just constructed differently.
+   Note that you normally do not need to specify the sources involved. The LLVM
+   Makefile system will infer the source files from the contents of the source
+   directory.</p>
+   <p>The <tt>LOADABLE_MODULE=1</tt> directive can be used in conjunction with
+   <tt>SHARED_LIBRARY=1</tt> to indicate that the resulting shared library should
+   be openable with the <tt>dlopen</tt> function and searchable with the
+   <tt>dlsym</tt> function (or your operating system's equivalents). While this
+   isn't strictly necessary on Linux and a few other platforms, it is required
+   on systems like HP-UX and Darwin. You should use <tt>LOADABLE_MODULE</tt> for
+   any shared library that you intend to be loaded into an tool via the
+   <tt>-load</tt> option. See the 
+   <a href="WritingAnLLVMPass.html#makefile">WritingAnLLVMPass.html</a> document
+   for an example of why you might want to do this.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="Modules">Bytecode Modules</a></div>
+ <div class="doc_text">
+   <p>In some situations, it is desireable to build a single bytecode module from
+   a variety of sources, instead of an archive, shared library, or bytecode 
+   library. Bytecode modules can be specified in addition to any of the other
+   types of libraries by defining the <a href="#MODULE_NAME">MODULE_NAME</a>
+   variable. For example:</p>
+   <pre><tt>
+       LIBRARYNAME = mylib
+       BYTECODE_LIBRARY = 1
+       MODULE_NAME = mymod
+   </tt></pre>
+   <p>will build a module named <tt>mymod.bc</tt> from the sources in the
+   directory. This module will be an aggregation of all the bytecode modules 
+   derived from the sources. The example will also build a bytecode archive 
+   containing a bytecode module for each compiled source file. The difference is
+   subtle, but important depending on how the module or library is to be linked.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="tools">Tools</a></div>
+ <div class="doc_text">
+   <p>For building executable programs (tools), you must provide the name of the
+   tool and the names of the libraries you wish to link with the tool. For
+   example:</p>
+   <pre><tt>
+       TOOLNAME = mytool
+       USEDLIBS = mylib
+       LLVMLIBS = LLVMSupport.a LLVMSystem.a
+   </tt></pre>
+   <p>says that we are to build a tool name <tt>mytool</tt> and that it requires
+   three libraries: <tt>mylib</tt>, <tt>LLVMSupport.a</tt> and
+   <tt>LLVMSystem.a</tt>.</p>
+   <p>Note that two different variables are use to indicate which libraries are
+   linked: <tt>USEDLIBS</tt> and <tt>LLVMLIBS</tt>. This distinction is necessary
+   to support projects. <tt>LLVMLIBS</tt> refers to the LLVM libraries found in 
+   the LLVM object directory. <tt>USEDLIBS</tt> refers to the libraries built by 
+   your project. In the case of building LLVM tools, <tt>USEDLIBS</tt> and 
+   <tt>LLVMLIBS</tt> can be used interchangeably since the "project" is LLVM 
+   itself and <tt>USEDLIBS</tt> refers to the same place as <tt>LLVMLIBS</tt>.
+   </p>
+   <p>Also note that there are two different ways of specifying a library: with a
+   <tt>.a</tt> suffix and without. Without the suffix, the entry refers to the
+   re-linked (.o) file which will include <em>all</em> symbols of the library.
+   This is useful, for example, to include all passes from a library of passes.
+   If the <tt>.a</tt> suffix is used then the library is linked as a searchable
+   library (with the <tt>-l</tt> option). In this case, only the symbols that are
+   unresolved <em>at that point</em> will be resolved from the library, if they
+   exist. Other (unreferenced) symbols will not be included when the <tt>.a</tt>
+   syntax is used. Note that in order to use the <tt>.a</tt> suffix, the library
+   in question must have been built with the <tt>ARCHIVE_LIBRARY</tt> option set.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection"><a name="JIT">JIT Tools</a></div>
+ <div class="doc_text">
+   <p>Many tools will want to use the JIT features of LLVM. However, getting the
+   right set of libraries to link with is tedious, platform specific, and error 
+   prone. Additionally, the JIT has special linker switch options that it needs.
+   Consequently, to make it easier to build tools that use the JIT, you can 
+   use a special value for the <tt>LLVMLIBS</tt> variable:</p>
+   <pre><tt>
+       TOOLNAME = my_jit_tool
+       USEDLIBS = mylib
+       LLVMLIBS = JIT
+   </tt></pre>
+   <p>Using a value of <tt>JIT</tt> for <tt>LLVMLIBS</tt> tells the makefile
+   system to construct a special value for LLVMLIBS that gives the program all
+   the LLVM libraries needed to run the JIT. Any additional libraries needed can
+   still be specified with <tt>USEDLIBS</tt>. To get a full understanding of how
+   this changes the linker command, it is recommended that you:</p>
+   <pre><tt>
+       cd examples/Fibonacci
+       make VERBOSE=1
+   </tt></pre>
+   <p>By default, using <tt>LLVMLIBS=JIT</tt> will link in enough to support JIT
+   code generation for the architecture on which the tool is linked. If you need
+   additional target architectures linked in, you may specify them on the command
+   line or in your <tt>Makefile</tt>. For example:</p>
+   <pre><tt>
+       ENABLE_X86_JIT=1
+       ENABLE_SPARCV9_JIT=1
+       ENALBE_PPC_JIT=1
+   </tt></pre>
+   <p>will cause the tool to be able to generate code for all three platforms.
+   </p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="targets">Targets Supported</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+   <p>This section describes each of the targets that can be built using the LLVM
+   Makefile system. Any target can be invoked from any directory but not all are
+   applicable to a given directory (e.g. "check", "dist" and "install" will
+   always operate as if invoked from the top level directory).</p>
+ 
+   <table style="text-align:left">
+     <tr>
+       <th>Target Name</th><th>Implied Targets</th><th>Target Description</th>
+     </tr>
+     <tr><td><a href="#all"><tt>all</tt></a></td><td></td>
+       <td>Compile the software recursively. Default target.
+     </td></tr>
+     <tr><td><a href="#all-local"><tt>all-local</tt></a></td><td></td>
+       <td>Compile the software in the local directory only.
+     </td></tr>
+     <tr><td><a href="#check"><tt>check</tt></a></td><td></td>
+       <td>Change to the <tt>test</tt> directory in a project and run the
+       test suite there.
+     </td></tr>
+     <tr><td><a href="#check-local"><tt>check-local</tt></a></td><td></td>
+       <td>Run a local test suite. Generally this is only defined in the 
+         <tt>Makefile</tt> of the project's <tt>test</tt> directory.
+     </td></tr>
+     <tr><td><a href="#clean"><tt>clean</tt></a></td><td></td>
+       <td>Remove built objects recursively.
+     </td></tr>
+     <tr><td><a href="#clean-local"><tt>clean-local</tt></a></td><td></td>
+       <td>Remove built objects from the local directory only.
+     </td></tr>
+     <tr><td><a href="#dist"><tt>dist</tt></a></td><td>all</td>
+       <td>Prepare a source distribution tarball.
+     </td></tr>
+     <tr><td><a href="#dist-check"><tt>dist-check</tt></a></td><td>all</td>
+       <td>Prepare a source distribution tarball and check that it builds.
+     </td></tr>
+     <tr><td><a href="#dist-clean"><tt>dist-clean</tt></a></td><td>clean</td>
+       <td>Clean source distribution tarball temporary files.
+     </td></tr>
+     <tr><td><a href="#install"><tt>install</tt></a></td><td>all</td>
+       <td>Copy built objects to installation directory.
+     </td></tr>
+     <tr><td><a href="#preconditions"><tt>preconditions</tt></a></td><td>all</td>
+       <td>Check to make sure configuration and makefiles are up to date.
+     </td></tr>
+     <tr><td><a href="#printvars"><tt>printvars</tt></a></td><td>all</td>
+       <td>Prints variables defined by the makefile system (for debugging).
+     </td></tr>
+     <tr><td><a href="#tags"><tt>tags</tt></a></td><td></td>
+       <td>Make C and C++ tags files for emacs and vi.
+     </td></tr>
+     <tr><td><a href="#uninstall"><tt>uninstall</tt></a></td><td></td>
+       <td>Remove built objects from installation directory.
+     </td></tr>
+   </table>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="all">all (default)</a></div>
+ <div class="doc_text">
+   <p>When you invoke <tt>make</tt> with no arguments, you are implicitly
+   instructing it to seek the "all" target (goal). This target is used for
+   building the software recursively and will do different things in different 
+   directories.  For example, in a <tt>lib</tt> directory, the "all" target will 
+   compile source files and generate libraries. But, in a <tt>tools</tt> 
+   directory, it will link libraries and generate executables.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="all-local">all-local</a></div>
+ <div class="doc_text">
+   <p>This target is the same as <a href="#all">all</a> but it operates only on
+   the current directory instead of recursively.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="check">check</a></div>
+ <div class="doc_text">
+   <p>This target can be invoked from anywhere within a project's directories
+   but always invokes the <a href="#check-local"><tt>check-local</tt></a> target 
+   in the project's <tt>test</tt> directory, if it exists and has a 
+   <tt>Makefile</tt>. A warning is produced otherwise.  If 
+   <a href="#TESTSUITE"><tt>TESTSUITE</tt></a> is defined on the <tt>make</tt>
+   command line, it will be passed down to the invocation of 
+   <tt>make check-local</tt> in the <tt>test</tt> directory. The intended usage 
+   for this is to assist in running specific suites of tests. If
+   <tt>TESTSUITE</tt> is not set, the implementation of <tt>check-local</tt> 
+   should run all normal tests.  It is up to the project to define what 
+   different values for <tt>TESTSUTE</tt> will do. See the 
+   <a href="TestingGuide.html">TestingGuide</a> for further details.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="check-local">check-local</a></div>
+ <div class="doc_text">
+   <p>This target should be implemented by the <tt>Makefile</tt> in the project's
+   <tt>test</tt> directory. It is invoked by the <tt>check</tt> target elsewhere.
+   Each project is free to define the actions of <tt>check-local</tt> as 
+   appropriate for that project. The LLVM project itself uses dejagnu to run a 
+   suite of feature and regresson tests. Other projects may choose to use 
+   dejagnu or any other testing mechanism.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="clean">clean</a></div>
+ <div class="doc_text">
+   <p>This target cleans the build directory, recursively removing all things
+   that the Makefile builds. The cleaning rules have been made guarded so they 
+   shouldn't go awry (via <tt>rm -f $(UNSET_VARIABLE)/*</tt> which will attempt
+   to erase the entire directory structure.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="clean-local">clean-local</a></div>
+ <div class="doc_text">
+   <p>This target does the same thing as <tt>clean</tt> but only for the current
+   (local) directory.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="dist">dist</a></div>
+ <div class="doc_text">
+   <p>This target builds a distribution tarball. It first builds the entire
+   project using the <tt>all</tt> target and then tars up the necessary files and
+   compresses it. The generated tarball is sufficient for a casual source 
+   distribution, but probably not for a release (see <tt>dist-check</tt>).</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="dist-check">dist-check</a></div>
+ <div class="doc_text">
+   <p>This target does the same thing as the <tt>dist</tt> target but also checks
+   the distribution tarball. The check is made by unpacking the tarball to a new
+   directory, configuring it, building it, installing it, and then verifying that
+   the installation results are correct (by comparing to the original build).
+   This target can take a long time to run but should be done before a release
+   goes out to make sure that the distributed tarball can actually be built into
+   a working release.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="dist-clean">dist-clean</a></div>
+ <div class="doc_text">
+   <p>This is a special form of the <tt>clean</tt> clean target. It performs a
+   normal <tt>clean</tt> but also removes things pertaining to building the
+   distribution.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="install">install</a></div>
+ <div class="doc_text">
+   <p>This target finalizes shared objects and executables and copies all
+   libraries, headers, executables and documentation to the directory given 
+   with the <tt>--prefix</tt> option to <tt>configure</tt>.  When completed, 
+   the prefix directory will have everything needed to <b>use</b> LLVM. </p>
+   <p>The LLVM makefiles can generate complete <b>internal</b> documentation 
+   for all the classes by using <tt>doxygen</tt>. By default, this feature is 
+   <b>not</b> enabled because it takes a long time and generates a massive 
+   amount of data (>100MB). If you want this feature, you must configure LLVM
+   with the --enable-doxygen switch and ensure that a modern version of doxygen
+   (1.3.7 or later) is available in your <tt>PATH</tt>. You can download 
+   doxygen from 
+   <a href="http://www.stack.nl/~dimitri/doxygen/download.html#latestsrc">
+   here</a>.
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="preconditions">preconditions</a></div>
+ <div class="doc_text">
+   <p>This utility target checks to see if the <tt>Makefile</tt> in the object
+   directory is older than the <tt>Makefile</tt> in the source directory and
+   copies it if so. It also reruns the <tt>configure</tt> script if that needs to
+   be done and rebuilds the <tt>Makefile.config</tt> file similarly. Users may
+   overload this target to ensure that sanity checks are run <em>before</em> any
+   building of targets as all the targets depend on <tt>preconditions</tt>.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="printvars">printvars</a></div>
+ <div class="doc_text">
+   <p>This utility target just causes the LLVM makefiles to print out some of 
+   the makefile variables so that you can double check how things are set. </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="reconfigure">reconfigure</a></div>
+ <div class="doc_text">
+   <p>This utility target will force a reconfigure of LLVM or your project. It 
+   simply runs <tt>$(PROJ_OBJ_ROOT)/config.status --recheck</tt> to rerun the
+   configuration tests and rebuild the configured files. This isn't generally
+   useful as the makefiles will reconfigure themselves whenever its necessary.
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="spotless">spotless</a></div>
+ <div class="doc_text">
+   <p>This utility target, only available when <tt>$(PROJ_OBJ_ROOT)</tt> is not 
+   the same as <tt>$(PROJ_SRC_ROOT)</tt>, will completely clean the
+   <tt>$(PROJ_OBJ_ROOT)</tt> directory by removing its content entirely and 
+   reconfiguring the directory. This returns the <tt>$(PROJ_OBJ_ROOT)</tt> 
+   directory to a completely fresh state. All content in the directory except 
+   configured files and top-level makefiles will be lost.</p>
+   <div class="doc_warning"><p>Use with caution.</p></div>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="tags">tags</a></div>
+ <div class="doc_text">
+   <p>This target will generate a <tt>TAGS</tt> file in the top-level source
+   directory. It is meant for use with emacs, XEmacs, or ViM. The TAGS file
+   provides an index of symbol definitions so that the editor can jump you to the
+   definition quickly. </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="uninstall">uninstall</a></div>
+ <div class="doc_text">
+   <p>This target is the opposite of the <tt>install</tt> target. It removes the
+   header, library and executable files from the installation directories. Note
+   that the directories themselves are not removed because it is not guaranteed
+   that LLVM is the only thing installing there (e.g. --prefix=/usr).</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="variables">Variables</a></div>
+ <!-- *********************************************************************** -->
+ <div class="doc_text">
+   <p>Variables are used to tell the LLVM Makefile System what to do and to
+   obtain information from it. Variables are also used internally by the LLVM
+   Makefile System. Variable names that contain only the upper case alphabetic
+   letters and underscore are intended for use by the end user. All other
+   variables are internal to the LLVM Makefile System and should not be relied
+   upon nor modified. The sections below describe how to use the LLVM Makefile 
+   variables.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="setvars">Control Variables</a></div>
+ <div class="doc_text">
+   <p>Variables listed in the table below should be set <em>before</em> the 
+   inclusion of <a href="#Makefile.common"><tt>$(LEVEL)/Makefile.common</tt></a>.
+   These variables provide input to the LLVM make system that tell it what to do 
+   for the current directory.</p>
+   <dl>
+     <dt><a name="BUILD_ARCHIVE"><tt>BUILD_ARCHIVE</tt></a></dt>
+     <dd>If set to any value, causes an archive (.a) library to be built.</dd>
+     <dt><a name="BUILT_SOURCES"><tt>BUILT_SOURCES</tt></a></dt>
+     <dd>Specifies a set of source files that are generated from other source
+     files. These sources will be built before any other target processing to 
+     ensure they are present.</dd>
+     <dt><a name="BYTECODE_LIBRARY"><tt>BYTECODE_LIBRARY</tt></a></dt>
+     <dd>If set to any value, causes a bytecode library (.bc) to be built.</dd>
+     <dt><a name="CONFIG_FILES"><tt>CONFIG_FILES</tt></a></dt>
+     <dd>Specifies a set of configuration files to be installed.</dd>
+     <dt><a name="DIRS"><tt>DIRS</tt></a></dt>
+     <dd>Specifies a set of directories, usually children of the current
+     directory, that should also be made using the same goal. These directories 
+     will be built serially.</dd>
+     <dt><a name="DISABLE_AUTO_DEPENDENCIES"><tt>DISABLE_AUTO_DEPENDENCIES</tt></a></dt>
+     <dd>If set to any value, causes the makefiles to <b>not</b> automatically
+     generate dependencies when running the compiler. Use of this feature is
+     discouraged and it may be removed at a later date.</dd>
+     <dt><a name="DONT_BUILD_RELINKED"><tt>DONT_BUILD_RELINKED</tt></a></dt>
+     <dd>If set to any value, causes a relinked library (.o) not to be built. By
+     default, libraries are built as re-linked since most LLVM libraries are
+     needed in their entirety and re-linked libraries will be linked more quickly
+     than equivalent archive libraries.</dd>
+     <dt><a name="ENABLE_OPTIMIZED"><tt>ENABLE_OPTIMIZED</tt></a></dt>
+     <dd>If set to any value, causes the build to generate optimized objects,
+     libraries and executables. This alters the flags specified to the compilers
+     and linkers. Generally debugging won't be a fun experience with an optimized
+     build.</dd>
+     <dt><a name="ENABLE_PROFILING"><tt>ENABLE_PROFILING</tt></a></dt>
+     <dd>If set to any value, causes the build to generate both optimized and 
+     profiled objects, libraries and executables. This alters the flags specified
+     to the compilers and linkers to ensure that profile data can be collected
+     from the tools built. Use the <tt>gprof</tt> tool to analyze the output from
+     the profiled tools (<tt>gmon.out</tt>).</dd>
+     <dt><a name="DISABLE_ASSERTIONS"><tt>DISABLE_ASSERTIONS</tt></a></dt>
+     <dd>If set to any value, causes the build to disable assertions, even if 
+     building a release or profile build.  This will exclude all assertion check
+     code from the build. LLVM will execute faster, but with little help when
+     things go wrong.</dd>
+     <dt><a name="EXPERIMENTAL_DIRS"><tt>EXPERIMENTAL_DIRS</tt></a></dt>
+     <dd>Specify a set of directories that should be built, but if they fail, it
+     should not cause the build to fail. Note that this should only be used 
+     temporarily while code is being written.</dd> 
+     <dt><a name="EXPORTED_SYMBOL_FILE"><tt>EXPORTED_SYMBOL_FILE</tt></a></dt>
+     <dd>Specifies the name of a single file that contains a list of the 
+     symbols to be exported by the linker. One symbol per line.</dd>
+     <dt><a name="EXPORTED_SYMBOL_LIST"><tt>EXPORTED_SYMBOL_LIST</tt></a></dt>
+     <dd>Specifies a set of symbols to be exported by the linker.</dd>
+     <dt><a name="EXTRA_DIST"><tt>EXTRA_DIST</tt></a></dt>
+     <dd>Specifies additional files that should be distributed with LLVM. All
+     source files, all built sources, all Makefiles, and most documentation files
+     will be automatically distributed. Use this variable to distribute any 
+     files that are not automatically distributed.</dd>
+     <dt><a name="KEEP_SYMBOLS"><tt>KEEP_SYMBOLS</tt></a></dt>
+     <dd>If set to any value, specifies that when linking executables the
+     makefiles should retain debug symbols in the executable. Normally, symbols
+     are stripped from the executable.</dd>
+     <dt><a name="LEVEL"><tt>LEVEL</tt></a><small>(required)</small></dt>
+     <dd>Specify the level of nesting from the top level. This variable must be
+     set in each makefile as it is used to find the top level and thus the other
+     makefiles.</dd>
+     <dt><a name="LIBRARYNAME"><tt>LIBRARYNAME</tt></a></dt>
+     <dd>Specify the name of the library to be built. (Required For
+     Libraries)</dd>
+     <dt><a name="LINK_LIBS_IN_SHARED"><tt>LINK_LIBS_IN_SHARED</tt></a></dt>
+     <dd>By default, shared library linking will ignore any libraries specified
+     with the <a href="LLVMLIBS">LLVMLIBS</a> or <a href="USEDLIBS">USEDLIBS</a>.
+     This prevents shared libs from including things that will be in the LLVM
+     tool the shared library will be loaded into. However, sometimes it is useful
+     to link certain libraries into your shared library and this option enables
+     that feature.</dd>
+     <dt><a name="LLVMLIBS"><tt>LLVMLIBS</tt></a></dt>
+     <dd>Specifies the set of libraries from the LLVM $(ObjDir) that will be
+     linked into the tool or library.</dd>
+     <dt><a name="LOADABLE_MODULE"><tt>LOADABLE_MODULE</tt></a></dt>
+     <dd>If set to any value, causes the shared library being built to also be
+     a loadable module. Loadable modules can be opened with the dlopen() function
+     and searched with dlsym (or the operating system's equivalent). Note that
+     setting this variable without also setting <tt>SHARED_LIBRARY</tt> will have
+     no effect.</dd>
+     <dt><a name="MODULE_NAME"><tt>MODULE_NAME</tt></a></dt>
+     <dd>Specifies the name of a bytecode module to be created. A bytecode 
+     module can be specified in conjunction with other kinds of library builds 
+     or by itself. It constructs from the sources a single linked bytecode 
+     file.</dd>
+     <dt><a name="OPTIONAL_DIRS"><tt>OPTIONAL_DIRS</tt></a></dt>
+     <dd>Specify a set of directories that may be built, if they exist, but its
+     not an error for them not to exist.</dd>
+     <dt><a name="PARALLEL_DIRS"><tt>PARALLEL_DIRS</tt></a></dt>
+     <dd>Specify a set of directories to build recursively and in parallel if
+     the -j option was used with <tt>make</tt>.</dd>
+     <dt><a name="SHARED_LIBRARY"><tt>SHARED_LIBRARY</tt></a></dt>
+     <dd>If set to any value, causes a shared library (.so) to be built in
+     addition to any other kinds of libraries. Note that this option will cause
+     all source files to be built twice: once with options for position
+     independent code and once without. Use it only where you really need a
+     shared library.</dd>
+     <dt><a name="SOURCES"><tt>SOURCES</tt><small>(optional)</small></a></dt>
+     <dd>Specifies the list of source files in the current directory to be
+     built. Source files of any type may be specified (programs, documentation, 
+     config files, etc.). If not specified, the makefile system will infer the
+     set of source files from the files present in the current directory.</dd>
+     <dt><a name="SUFFIXES"><tt>SUFFIXES</tt></a></dt>
+     <dd>Specifies a set of filename suffixes that occur in suffix match rules.
+     Only set this if your local <tt>Makefile</tt> specifies additional suffix
+     match rules.</dd> 
+     <dt><a name="TARGET"><tt>TARGET</tt></a></dt>
+     <dd>Specifies the name of the LLVM code generation target that the
+     current directory builds. Setting this variable enables additional rules to
+     build <tt>.inc</tt> files from <tt>.td</tt> files. </dd>
+     <dt><a name="TESTSUITE"><tt>TESTSUITE</tt></a></dt>
+     <dd>Specifies the directory of tests to run in <tt>llvm/test</tt>.</dd>
+     <dt><a name="TOOLNAME"><tt>TOOLNAME</tt></a></dt>
+     <dd>Specifies the name of the tool that the current directory should
+     build.</dd>
+     <dt><a name="TOOL_VERBOSE"><tt>TOOL_VERBOSE</tt></a></dt>
+     <dd>Implies VERBOSE and also tells each tool invoked to be verbose. This is
+     handy when you're trying to see the sub-tools invoked by each tool invoked 
+     by the makefile. For example, this will pass <tt>-v</tt> to the GCC 
+     compilers which causes it to print out the command lines it uses to invoke
+     sub-tools (compiler, assembler, linker).</dd>
+     <dt><a name="USEDLIBS"><tt>USEDLIBS</tt></a></dt>
+     <dd>Specifies the list of project libraries that will be linked into the
+     tool or library.</dd>
+     <dt><a name="VERBOSE"><tt>VERBOSE</tt></a></dt>
+     <dd>Tells the Makefile system to produce detailed output of what it is doing
+     instead of just summary comments. This will generate a LOT of output.</dd>
+   </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="overvars">Override Variables</a></div>
+ <div class="doc_text">
+   <p>Override variables can be used to override the default
+   values provided by the LLVM makefile system. These variables can be set in 
+   several ways:</p>
+   <ul>
+     <li>In the environment (e.g. setenv, export) -- not recommended.</li>
+     <li>On the <tt>make</tt> command line -- recommended.</li>
+     <li>On the <tt>configure</tt> command line</li>
+     <li>In the Makefile (only <em>after</em> the inclusion of <a
+     href="#Makefile.common"><tt>$(LEVEL)/Makefile.common</tt></a>).</li>
+   </ul>
+   <p>The override variables are given below:</p>
+   <dl>
+     <dt><a name="AR"><tt>AR</tt></a> <small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>ar</tt> tool.</dd>
+     <dt><a name="BISON"><tt>BISON</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>bison</tt> tool.</dd>
+     <dt><a name="PROJ_OBJ_DIR"><tt>PROJ_OBJ_DIR</tt></a></dt>
+     <dd>The directory into which the products of build rules will be placed.
+     This might be the same as 
+     <a href="#PROJ_SRC_DIR"><tt>PROJ_SRC_DIR</tt></a> but typically is
+     not.</dd>
+     <dt><a name="PROJ_SRC_DIR"><tt>PROJ_SRC_DIR</tt></a></dt>
+     <dd>The directory which contains the source files to be built.</dd>
+     <dt><a name="BZIP2"><tt>BZIP2</tt></a><small>(configured)</small></dt>
+     <dd>The path to the <tt>bzip2</tt> tool.</dd>
+     <dt><a name="CC"><tt>CC</tt></a><small>(configured)</small></dt>
+     <dd>The path to the 'C' compiler.</dd>
+     <dt><a name="CFLAGS"><tt>CFLAGS</tt></a></dt>
+     <dd>Additional flags to be passed to the 'C' compiler.</dd>
+     <dt><a name="CXX"><tt>CXX</tt></a></dt>
+     <dd>Specifies the path to the C++ compiler.</dd>
+     <dt><a name="CXXFLAGS"><tt>CXXFLAGS</tt></a></dt>
+     <dd>Additional flags to be passed to the C++ compiler.</dd>
+     <dt><a name="DATE"><tt>DATE<small>(configured)</small></tt></a></dt>
+     <dd>Specifies the path to the <tt>date</tt> program or any program that can
+     generate the current date and time on its standard output</dd>
+     <dt><a name="DOT"><tt>DOT</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>dot</tt> tool or <tt>false</tt> if there
+     isn't one.</dd>
+     <dt><a name="ECHO"><tt>ECHO</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>echo</tt> tool for printing output.</dd>
+     <dt><a name="ETAGS"><tt>ETAGS</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>etags</tt> tool.</dd>
+     <dt><a name="ETAGSFLAGS"><tt>ETAGSFLAGS</tt></a><small>(configured)</small>
+     </dt>
+     <dd>Provides flags to be passed to the <tt>etags</tt> tool.</dd>
+     <dt><a name="EXEEXT"><tt>EXEEXT</tt></a><small>(configured)</small></dt>
+     <dd>Provides the extension to be used on executables built by the makefiles.
+     The value may be empty on platforms that do not use file extensions for
+     executables (e.g. Unix).</dd>
+     <dt><a name="FLEX"><tt>FLEX</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>flex</tt> tool.</dd>
+     <dt><a name="GCCLD"><tt>GCCLD</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>gccld</tt> tool.</dd>
+     <dt><a name="INSTALL"><tt>INSTALL</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>install</tt> tool.</dd>
+     <dt><a name="LDFLAGS"><tt>LDFLAGS</tt></a><small>(configured)</small></dt>
+     <dd>Allows users to specify additional flags to pass to the linker.</dd>
+     <dt><a name="LIBS"><tt>LIBS</tt></a><small>(configured)</small></dt>
+     <dd>The list of libraries that should be linked with each tool.</dd>
+     <dt><a name="LIBTOOL"><tt>LIBTOOL</tt></a><small>(configured)</small></dt>
+     <dd>Specifies the path to the <tt>libtool</tt> tool. This tool is renamed
+     <tt>mklib</tt> by the <tt>configure</tt> script and always located in the 
+     <dt><a name="LLVMAS"><tt>LLVMAS</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>llvm-as</tt> tool.</dd>
+     <dt><a name="LLVMGCC"><tt>LLVMGCC</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the LLVM version of the GCC 'C' Compiler</dd>
+     <dt><a name="LLVMGXX"><tt>LLVMGXX</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the LLVM version of the GCC C++ Compiler</dd>
+     <dt><a name="LLVM_OBJ_ROOT"><tt>LLVM_OBJ_ROOT</tt></a><small>(configured)
+     </small></dt>
+     <dd>Specifies the top directory into which the output of the build is
+     placed.</dd>
+     <dt><a name="LLVM_SRC_ROOT"><tt>LLVM_SRC_ROOT</tt></a><small>(configured)
+     </small></dt>
+     <dd>Specifies the top directory in which the sources are found.</dd>
+     <dt><a name="LLVM_TARBALL_NAME"><tt>LLVM_TARBALL_NAME</tt></a>
+     <small>(configured)</small></dt>
+     <dd>Specifies the name of the distribution tarball to create. This is
+     configured from the name of the project and its version number.</dd>
+     <dt><a name="MKDIR"><tt>MKDIR</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>mkdir</tt> tool that creates
+     directories.</dd>
+     <dt><a name="PLATFORMSTRIPOPTS"><tt>PLATFORMSTRIPOPTS</tt></a></dt>
+     <dd>The options to provide to the linker to specify that a stripped (no
+     symbols) executable should be built.</dd>
+     <dt><a name="RANLIB"><tt>RANLIB</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>ranlib</tt> tool.</dd>
+     <dt><a name="RM"><tt>RM</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>rm</tt> tool.</dd>
+     <dt><a name="SED"><tt>SED</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>sed</tt> tool.</dd>
+     <dt><a name="SHLIBEXT"><tt>SHLIBEXT</tt></a><small>(configured)</small></dt>
+     <dd>Provides the filename extension to use for shared libraries.</dd>
+     <dt><a name="TBLGEN"><tt>TBLGEN</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>tblgen</tt> tool.</dd>
+     <dt><a name="TAR"><tt>TAR</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>tar</tt> tool.</dd>
+     <dt><a name="ZIP"><tt>ZIP</tt></a><small>(defaulted)</small></dt>
+     <dd>Specifies the path to the <tt>zip</tt> tool.</dd>
+   </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="getvars">Readable Variables</a></div>
+ <div class="doc_text">
+   <p>Variables listed in the table below can be used by the user's Makefile but
+   should not be changed. Changing the value will generally cause the build to go
+   wrong, so don't do it.</p>
+   <dl>
+     <dt><a name="bindir"><tt>bindir</tt></a></dt>
+     <dd>The directory into which executables will ultimately be installed. This
+     value is derived from the <tt>--prefix</tt> option given to
+     <tt>configure</tt>.</dd>
+     <dt><a name="BuildMode"><tt>BuildMode</tt></a></dt>
+     <dd>The name of the type of build being performed: Debug, Release, or 
+     Profile</dd>
+     <dt><a name="bytecode_libdir"><tt>bytecode_libdir</tt></a></dt>
+     <dd>The directory into which bytecode libraries will ultimately be 
+     installed.  This value is derived from the <tt>--prefix</tt> option given to
+     <tt>configure</tt>.</dd>
+     <dt><a name="ConfigureScriptFLAGS"><tt>ConfigureScriptFLAGS</tt></a></dt>
+     <dd>Additional flags given to the <tt>configure</tt> script when
+     reconfiguring.</dd>
+     <dt><a name="DistDir"><tt>DistDir</tt></a></dt>
+     <dd>The <em>current</em> directory for which a distribution copy is being
+     made.</dd>
+     <dt><a name="Echo"><tt>Echo</tt></a></dt>
+     <dd>The LLVM Makefile System output command. This provides the
+     <tt>llvm[n]</tt> prefix and starts with @ so the command itself is not
+     printed by <tt>make</tt>.</dd>
+     <dt><a name="EchoCmd"><tt>EchoCmd</tt></a></dt>
+     <dd> Same as <a href="#Echo"><tt>Echo</tt></a> but without the leading @.
+     </dd>
+     <dt><a name="includedir"><tt>includedir</tt></a></dt>
+     <dd>The directory into which include files will ultimately be installed. 
+     This value is derived from the <tt>--prefix</tt> option given to
+     <tt>configure</tt>.</dd>
+     <dt><a name="libdir"><tt>libdir</tt></a></dt><dd></dd>
+     <dd>The directory into which native libraries will ultimately be installed. 
+     This value is derived from the <tt>--prefix</tt> option given to
+     <tt>configure</tt>.</dd>
+     <dt><a name="LibDir"><tt>LibDir</tt></a></dt>
+     <dd>The configuration specific directory into which libraries are placed
+     before installation.</dd>
+     <dt><a name="MakefileConfig"><tt>MakefileConfig</tt></a></dt>
+     <dd>Full path of the <tt>Makefile.config</tt> file.</dd>
+     <dt><a name="MakefileConfigIn"><tt>MakefileConfigIn</tt></a></dt>
+     <dd>Full path of the <tt>Makefile.config.in</tt> file.</dd>
+     <dt><a name="ObjDir"><tt>ObjDir</tt></a></dt>
+     <dd>The configuration and directory specific directory where build objects
+     (compilation results) are placed.</dd>
+     <dt><a name="SubDirs"><tt>SubDirs</tt></a></dt>
+     <dd>The complete list of sub-directories of the current directory as
+     specified by other variables.</dd>
+     <dt><a name="Sources"><tt>Sources</tt></a></dt>
+     <dd>The complete list of source files.</dd>
+     <dt><a name="sysconfdir"><tt>sysconfdir</tt></a></dt>
+     <dd>The directory into which configuration files will ultimately be
+     installed. This value is derived from the <tt>--prefix</tt> option given to
+     <tt>configure</tt>.</dd>
+     <dt><a name="ToolDir"><tt>ToolDir</tt></a></dt>
+     <dd>The configuration specific directory into which executables are placed
+     before they are installed.</dd>
+     <dt><a name="TopDistDir"><tt>TopDistDir</tt></a></dt>
+     <dd>The top most directory into which the distribution files are copied.
+     </dd>
+     <dt><a name="Verb"><tt>Verb</tt></a></dt>
+     <dd>Use this as the first thing on your build script lines to enable or
+     disable verbose mode. It expands to either an @ (quiet mode) or nothing
+     (verbose mode). </dd>
+   </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="intvars">Internal Variables</a></div>
+ <div class="doc_text">
+   <p>Variables listed below are used by the LLVM Makefile System 
+   and considered internal. You should not use these variables under any
+   circumstances.</p>
+   <p><tt>
+     Archive
+     AR.Flags
+     BaseNameSources
+     BCCompile.C
+     BCCompile.CXX
+     BCLinkLib
+     C.Flags
+     Compile.C
+     CompileCommonOpts
+     Compile.CXX
+     ConfigStatusScript
+     ConfigureScript
+     CPP.Flags
+     CPP.Flags 
+     CXX.Flags
+     DependFiles
+     DestArchiveLib
+     DestBytecodeLib
+     DestModule
+     DestRelinkedLib
+     DestSharedLib
+     DestTool
+     DistAlways
+     DistCheckDir
+     DistCheckTop
+     DistFiles
+     DistName
+     DistOther
+     DistSources
+     DistSubDirs
+     DistTarBZ2
+     DistTarGZip
+     DistZip
+     ExtraLibs
+     FakeSources
+     INCFiles
+     InternalTargets
+     LD.Flags
+     LexFiles
+     LexOutput
+     LibName.A
+     LibName.BC
+     LibName.LA
+     LibName.O
+     LibTool.Flags
+     Link
+     LinkModule
+     LLVMLibDir
+     LLVMLibsOptions
+     LLVMLibsPaths
+     LLVMToolDir
+     LLVMUsedLibs
+     LocalTargets
+     LTCompile.C
+     LTCompile.CXX
+     LTInstall
+     Module
+     ObjectsBC
+     ObjectsLO
+     ObjectsO
+     ObjMakefiles
+     ParallelTargets
+     PreConditions
+     ProjLibsOptions
+     ProjLibsPaths
+     ProjUsedLibs
+     Ranlib
+     RecursiveTargets
+     Relink
+     SrcMakefiles
+     Strip
+     StripWarnMsg
+     TableGen
+     TDFiles
+     ToolBuildPath
+     TopLevelTargets
+     UserTargets
+     YaccFiles
+     YaccOutput
+   </tt></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:rspencer at x10sys.com">Reid Spencer</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/ProgrammersManual.html
diff -c /dev/null llvm-www/releases/1.8/docs/ProgrammersManual.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/ProgrammersManual.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,2288 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM Programmer's Manual</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   LLVM Programmer's Manual
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#general">General Information</a>
+     <ul>
+       <li><a href="#stl">The C++ Standard Template Library</a></li>
+ <!--
+       <li>The <tt>-time-passes</tt> option</li>
+       <li>How to use the LLVM Makefile system</li>
+       <li>How to write a regression test</li>
+ 
+ --> 
+     </ul>
+   </li>
+   <li><a href="#apis">Important and useful LLVM APIs</a>
+     <ul>
+       <li><a href="#isa">The <tt>isa<></tt>, <tt>cast<></tt>
+ and <tt>dyn_cast<></tt> templates</a> </li>
+       <li><a href="#DEBUG">The <tt>DEBUG()</tt> macro and <tt>-debug</tt>
+ option</a>
+         <ul>
+           <li><a href="#DEBUG_TYPE">Fine grained debug info with <tt>DEBUG_TYPE</tt>
+ and the <tt>-debug-only</tt> option</a> </li>
+         </ul>
+       </li>
+       <li><a href="#Statistic">The <tt>Statistic</tt> template & <tt>-stats</tt>
+ option</a></li>
+ <!--
+       <li>The <tt>InstVisitor</tt> template
+       <li>The general graph API
+ --> 
+       <li><a href="#ViewGraph">Viewing graphs while debugging code</a></li>
+     </ul>
+   </li>
+   <li><a href="#common">Helpful Hints for Common Operations</a>
+     <ul>
+       <li><a href="#inspection">Basic Inspection and Traversal Routines</a>
+         <ul>
+           <li><a href="#iterate_function">Iterating over the <tt>BasicBlock</tt>s
+ in a <tt>Function</tt></a> </li>
+           <li><a href="#iterate_basicblock">Iterating over the <tt>Instruction</tt>s
+ in a <tt>BasicBlock</tt></a> </li>
+           <li><a href="#iterate_institer">Iterating over the <tt>Instruction</tt>s
+ in a <tt>Function</tt></a> </li>
+           <li><a href="#iterate_convert">Turning an iterator into a
+ class pointer</a> </li>
+           <li><a href="#iterate_complex">Finding call sites: a more
+ complex example</a> </li>
+           <li><a href="#calls_and_invokes">Treating calls and invokes
+ the same way</a> </li>
+           <li><a href="#iterate_chains">Iterating over def-use &
+ use-def chains</a> </li>
+         </ul>
+       </li>
+       <li><a href="#simplechanges">Making simple changes</a>
+         <ul>
+           <li><a href="#schanges_creating">Creating and inserting new
+ 		 <tt>Instruction</tt>s</a> </li>
+           <li><a href="#schanges_deleting">Deleting 		 <tt>Instruction</tt>s</a> </li>
+           <li><a href="#schanges_replacing">Replacing an 		 <tt>Instruction</tt>
+ with another <tt>Value</tt></a> </li>
+         </ul>
+       </li>
+ <!--
+     <li>Working with the Control Flow Graph
+     <ul>
+       <li>Accessing predecessors and successors of a <tt>BasicBlock</tt>
+       <li>
+       <li>
+     </ul>
+ --> 
+     </ul>
+   </li>
+ 
+   <li><a href="#advanced">Advanced Topics</a>
+   <ul>
+   <li><a href="#TypeResolve">LLVM Type Resolution</a>
+   <ul>
+     <li><a href="#BuildRecType">Basic Recursive Type Construction</a></li>
+     <li><a href="#refineAbstractTypeTo">The <tt>refineAbstractTypeTo</tt> method</a></li>
+     <li><a href="#PATypeHolder">The PATypeHolder Class</a></li>
+     <li><a href="#AbstractTypeUser">The AbstractTypeUser Class</a></li>
+   </ul></li>
+ 
+   <li><a href="#SymbolTable">The <tt>SymbolTable</tt> class </a></li>
+   </ul></li>
+ 
+   <li><a href="#coreclasses">The Core LLVM Class Hierarchy Reference</a>
+     <ul>
+       <li><a href="#Value">The <tt>Value</tt> class</a>
+         <ul>
+           <li><a href="#User">The <tt>User</tt> class</a>
+             <ul>
+               <li><a href="#Instruction">The <tt>Instruction</tt> class</a>
+                 <ul>
+                   <li><a href="#GetElementPtrInst">The <tt>GetElementPtrInst</tt> class</a></li>
+                 </ul>
+               </li>
+               <li><a href="#Module">The <tt>Module</tt> class</a></li>
+               <li><a href="#Constant">The <tt>Constant</tt> class</a>
+ 	        <ul>
+                   <li><a href="#GlobalValue">The <tt>GlobalValue</tt> class</a>
+                     <ul>
+                       <li><a href="#BasicBlock">The <tt>BasicBlock</tt>class</a></li>
+                       <li><a href="#Function">The <tt>Function</tt> class</a></li>
+                       <li><a href="#GlobalVariable">The <tt>GlobalVariable</tt> class</a></li>
+                     </ul>
+                   </li>
+                 </ul>
+               </li>
+ 	    </ul>
+ 	  </li>
+           <li><a href="#Type">The <tt>Type</tt> class</a> </li>
+           <li><a href="#Argument">The <tt>Argument</tt> class</a></li>
+         </ul>
+       </li>
+     </ul>
+   </li>
+ </ol>
+ 
+ <div class="doc_author">    
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a>, 
+                 <a href="mailto:dhurjati at cs.uiuc.edu">Dinakar Dhurjati</a>, 
+                 <a href="mailto:jstanley at cs.uiuc.edu">Joel Stanley</a>, and
+                 <a href="mailto:rspencer at x10sys.com">Reid Spencer</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction </a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document is meant to highlight some of the important classes and
+ interfaces available in the LLVM source-base.  This manual is not
+ intended to explain what LLVM is, how it works, and what LLVM code looks
+ like.  It assumes that you know the basics of LLVM and are interested
+ in writing transformations or otherwise analyzing or manipulating the
+ code.</p>
+ 
+ <p>This document should get you oriented so that you can find your
+ way in the continuously growing source code that makes up the LLVM
+ infrastructure. Note that this manual is not intended to serve as a
+ replacement for reading the source code, so if you think there should be
+ a method in one of these classes to do something, but it's not listed,
+ check the source.  Links to the <a href="/doxygen/">doxygen</a> sources
+ are provided to make this as easy as possible.</p>
+ 
+ <p>The first section of this document describes general information that is
+ useful to know when working in the LLVM infrastructure, and the second describes
+ the Core LLVM classes.  In the future this manual will be extended with
+ information describing how to use extension libraries, such as dominator
+ information, CFG traversal routines, and useful utilities like the <tt><a
+ href="/doxygen/InstVisitor_8h-source.html">InstVisitor</a></tt> template.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="general">General Information</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section contains general information that is useful if you are working
+ in the LLVM source-base, but that isn't specific to any particular API.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="stl">The C++ Standard Template Library</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM makes heavy use of the C++ Standard Template Library (STL),
+ perhaps much more than you are used to, or have seen before.  Because of
+ this, you might want to do a little background reading in the
+ techniques used and capabilities of the library.  There are many good
+ pages that discuss the STL, and several books on the subject that you
+ can get, so it will not be discussed in this document.</p>
+ 
+ <p>Here are some useful links:</p>
+ 
+ <ol>
+ 
+ <li><a href="http://www.dinkumware.com/refxcpp.html">Dinkumware C++ Library
+ reference</a> - an excellent reference for the STL and other parts of the
+ standard C++ library.</li>
+ 
+ <li><a href="http://www.tempest-sw.com/cpp/">C++ In a Nutshell</a> - This is an
+ O'Reilly book in the making.  It has a decent 
+ Standard Library
+ Reference that rivals Dinkumware's, and is unfortunately no longer free since the book has been 
+ published.</li>
+ 
+ <li><a href="http://www.parashift.com/c++-faq-lite/">C++ Frequently Asked
+ Questions</a></li>
+ 
+ <li><a href="http://www.sgi.com/tech/stl/">SGI's STL Programmer's Guide</a> -
+ Contains a useful <a
+ href="http://www.sgi.com/tech/stl/stl_introduction.html">Introduction to the
+ STL</a>.</li>
+ 
+ <li><a href="http://www.research.att.com/%7Ebs/C++.html">Bjarne Stroustrup's C++
+ Page</a></li>
+ 
+ <li><a href="http://64.78.49.204/">
+ Bruce Eckel's Thinking in C++, 2nd ed. Volume 2 Revision 4.0 (even better, get
+ the book).</a></li>
+ 
+ </ol>
+   
+ <p>You are also encouraged to take a look at the <a
+ href="CodingStandards.html">LLVM Coding Standards</a> guide which focuses on how
+ to write maintainable code more than where to put your curly braces.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="stl">Other useful references</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ol>
+ <li><a href="http://www.psc.edu/%7Esemke/cvs_branches.html">CVS
+ Branch and Tag Primer</a></li>
+ <li><a href="http://www.fortran-2000.com/ArnaudRecipes/sharedlib.html">Using
+ static and shared libraries across platforms</a></li>
+ </ol>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="apis">Important and useful LLVM APIs</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Here we highlight some LLVM APIs that are generally useful and good to
+ know about when writing transformations.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="isa">The <tt>isa<></tt>, <tt>cast<></tt> and
+   <tt>dyn_cast<></tt> templates</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM source-base makes extensive use of a custom form of RTTI.
+ These templates have many similarities to the C++ <tt>dynamic_cast<></tt>
+ operator, but they don't have some drawbacks (primarily stemming from
+ the fact that <tt>dynamic_cast<></tt> only works on classes that
+ have a v-table). Because they are used so often, you must know what they
+ do and how they work. All of these templates are defined in the <a
+  href="/doxygen/Casting_8h-source.html"><tt>llvm/Support/Casting.h</tt></a>
+ file (note that you very rarely have to include this file directly).</p>
+ 
+ <dl>
+   <dt><tt>isa<></tt>: </dt>
+ 
+   <dd>The <tt>isa<></tt> operator works exactly like the Java
+   "<tt>instanceof</tt>" operator.  It returns true or false depending on whether
+   a reference or pointer points to an instance of the specified class.  This can
+   be very useful for constraint checking of various sorts (example below).</dd>
+ 
+   <dt><tt>cast<></tt>: </dt>
+ 
+   <dd>The <tt>cast<></tt> operator is a "checked cast" operation. It
+   converts a pointer or reference from a base class to a derived cast, causing
+   an assertion failure if it is not really an instance of the right type.  This
+   should be used in cases where you have some information that makes you believe
+   that something is of the right type.  An example of the <tt>isa<></tt>
+   and <tt>cast<></tt> template is:
+ 
+   <pre>
+   static bool isLoopInvariant(const <a href="#Value">Value</a> *V, const Loop *L) {
+     if (isa<<a href="#Constant">Constant</a>>(V) || isa<<a href="#Argument">Argument</a>>(V) || isa<<a href="#GlobalValue">GlobalValue</a>>(V))
+       return true;
+ 
+     <i>// Otherwise, it must be an instruction...</i>
+     return !L->contains(cast<<a href="#Instruction">Instruction</a>>(V)->getParent());
+   }
+   </pre>
+ 
+   <p>Note that you should <b>not</b> use an <tt>isa<></tt> test followed
+   by a <tt>cast<></tt>, for that use the <tt>dyn_cast<></tt>
+   operator.</p>
+ 
+   </dd>
+ 
+   <dt><tt>dyn_cast<></tt>:</dt>
+ 
+   <dd>The <tt>dyn_cast<></tt> operator is a "checking cast" operation. It
+   checks to see if the operand is of the specified type, and if so, returns a
+   pointer to it (this operator does not work with references). If the operand is
+   not of the correct type, a null pointer is returned.  Thus, this works very
+   much like the <tt>dynamic_cast<></tt> operator in C++, and should be
+   used in the same circumstances.  Typically, the <tt>dyn_cast<></tt>
+   operator is used in an <tt>if</tt> statement or some other flow control
+   statement like this:
+ 
+   <pre>
+      if (<a href="#AllocationInst">AllocationInst</a> *AI = dyn_cast<<a href="#AllocationInst">AllocationInst</a>>(Val)) {
+        ...
+      }
+   </pre>
+    
+   <p>This form of the <tt>if</tt> statement effectively combines together a call
+   to <tt>isa<></tt> and a call to <tt>cast<></tt> into one
+   statement, which is very convenient.</p>
+ 
+   <p>Note that the <tt>dyn_cast<></tt> operator, like C++'s
+   <tt>dynamic_cast<></tt> or Java's <tt>instanceof</tt> operator, can be
+   abused.  In particular, you should not use big chained <tt>if/then/else</tt>
+   blocks to check for lots of different variants of classes.  If you find
+   yourself wanting to do this, it is much cleaner and more efficient to use the
+   <tt>InstVisitor</tt> class to dispatch over the instruction type directly.</p>
+ 
+   </dd>
+ 
+   <dt><tt>cast_or_null<></tt>: </dt>
+   
+   <dd>The <tt>cast_or_null<></tt> operator works just like the
+   <tt>cast<></tt> operator, except that it allows for a null pointer as an
+   argument (which it then propagates).  This can sometimes be useful, allowing
+   you to combine several null checks into one.</dd>
+ 
+   <dt><tt>dyn_cast_or_null<></tt>: </dt>
+ 
+   <dd>The <tt>dyn_cast_or_null<></tt> operator works just like the
+   <tt>dyn_cast<></tt> operator, except that it allows for a null pointer
+   as an argument (which it then propagates).  This can sometimes be useful,
+   allowing you to combine several null checks into one.</dd>
+ 
+ </dl>
+ 
+ <p>These five templates can be used with any classes, whether they have a
+ v-table or not.  To add support for these templates, you simply need to add
+ <tt>classof</tt> static methods to the class you are interested casting
+ to. Describing this is currently outside the scope of this document, but there
+ are lots of examples in the LLVM source base.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="DEBUG">The <tt>DEBUG()</tt> macro and <tt>-debug</tt> option</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Often when working on your pass you will put a bunch of debugging printouts
+ and other code into your pass.  After you get it working, you want to remove
+ it... but you may need it again in the future (to work out new bugs that you run
+ across).</p>
+ 
+ <p> Naturally, because of this, you don't want to delete the debug printouts,
+ but you don't want them to always be noisy.  A standard compromise is to comment
+ them out, allowing you to enable them if you need them in the future.</p>
+ 
+ <p>The "<tt><a href="/doxygen/Debug_8h-source.html">llvm/Support/Debug.h</a></tt>"
+ file provides a macro named <tt>DEBUG()</tt> that is a much nicer solution to
+ this problem.  Basically, you can put arbitrary code into the argument of the
+ <tt>DEBUG</tt> macro, and it is only executed if '<tt>opt</tt>' (or any other
+ tool) is run with the '<tt>-debug</tt>' command line argument:</p>
+ 
+   <pre>     ... <br>     DEBUG(std::cerr << "I am here!\n");<br>     ...<br></pre>
+ 
+ <p>Then you can run your pass like this:</p>
+ 
+   <pre>  $ opt < a.bc > /dev/null -mypass<br>    <no output><br>  $ opt < a.bc > /dev/null -mypass -debug<br>    I am here!<br>  $<br></pre>
+ 
+ <p>Using the <tt>DEBUG()</tt> macro instead of a home-brewed solution allows you
+ to not have to create "yet another" command line option for the debug output for
+ your pass.  Note that <tt>DEBUG()</tt> macros are disabled for optimized builds,
+ so they do not cause a performance impact at all (for the same reason, they
+ should also not contain side-effects!).</p>
+ 
+ <p>One additional nice thing about the <tt>DEBUG()</tt> macro is that you can
+ enable or disable it directly in gdb.  Just use "<tt>set DebugFlag=0</tt>" or
+ "<tt>set DebugFlag=1</tt>" from the gdb if the program is running.  If the
+ program hasn't been started yet, you can always just run it with
+ <tt>-debug</tt>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="DEBUG_TYPE">Fine grained debug info with <tt>DEBUG_TYPE</tt> and
+   the <tt>-debug-only</tt> option</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Sometimes you may find yourself in a situation where enabling <tt>-debug</tt>
+ just turns on <b>too much</b> information (such as when working on the code
+ generator).  If you want to enable debug information with more fine-grained
+ control, you define the <tt>DEBUG_TYPE</tt> macro and the <tt>-debug</tt> only
+ option as follows:</p>
+ 
+   <pre>     ...<br>     DEBUG(std::cerr << "No debug type\n");<br>     #undef  DEBUG_TYPE<br>     #define DEBUG_TYPE "foo"<br>     DEBUG(std::cerr << "'foo' debug type\n");<br>     #undef  DEBUG_TYPE<br>     #define DEBUG_TYPE "bar"<br>     DEBUG(std::cerr << "'bar' debug type\n");<br>     #undef  DEBUG_TYPE<br>     #define DEBUG_TYPE ""<br>     DEBUG(std::cerr << "No debug type (2)\n");<br>     ...<br></pre>
+ 
+ <p>Then you can run your pass like this:</p>
+ 
+   <pre>  $ opt < a.bc > /dev/null -mypass<br>    <no output><br>  $ opt < a.bc > /dev/null -mypass -debug<br>    No debug type<br>    'foo' debug type<br>    'bar' debug type<br>    No debug type (2)<br>  $ opt < a.bc > /dev/null -mypass -debug-only=foo<br>    'foo' debug type<br>  $ opt < a.bc > /dev/null -mypass -debug-only=bar<br>    'bar' debug type<br>  $<br></pre>
+ 
+ <p>Of course, in practice, you should only set <tt>DEBUG_TYPE</tt> at the top of
+ a file, to specify the debug type for the entire module (if you do this before
+ you <tt>#include "llvm/Support/Debug.h"</tt>, you don't have to insert the ugly
+ <tt>#undef</tt>'s).  Also, you should use names more meaningful than "foo" and
+ "bar", because there is no system in place to ensure that names do not
+ conflict. If two different modules use the same string, they will all be turned
+ on when the name is specified. This allows, for example, all debug information
+ for instruction scheduling to be enabled with <tt>-debug-type=InstrSched</tt>,
+ even if the source lives in multiple files.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Statistic">The <tt>Statistic</tt> template & <tt>-stats</tt>
+   option</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The "<tt><a
+ href="/doxygen/Statistic_8h-source.html">llvm/ADT/Statistic.h</a></tt>" file
+ provides a template named <tt>Statistic</tt> that is used as a unified way to
+ keep track of what the LLVM compiler is doing and how effective various
+ optimizations are.  It is useful to see what optimizations are contributing to
+ making a particular program run faster.</p>
+ 
+ <p>Often you may run your pass on some big program, and you're interested to see
+ how many times it makes a certain transformation.  Although you can do this with
+ hand inspection, or some ad-hoc method, this is a real pain and not very useful
+ for big programs.  Using the <tt>Statistic</tt> template makes it very easy to
+ keep track of this information, and the calculated information is presented in a
+ uniform manner with the rest of the passes being executed.</p>
+ 
+ <p>There are many examples of <tt>Statistic</tt> uses, but the basics of using
+ it are as follows:</p>
+ 
+ <ol>
+     <li>Define your statistic like this:
+       <pre>static Statistic<> NumXForms("mypassname", "The # of times I did stuff");<br></pre>
+ 
+       <p>The <tt>Statistic</tt> template can emulate just about any data-type,
+       but if you do not specify a template argument, it defaults to acting like
+       an unsigned int counter (this is usually what you want).</p></li>
+ 
+     <li>Whenever you make a transformation, bump the counter:
+       <pre>   ++NumXForms;   // I did stuff<br></pre>
+     </li>
+   </ol>
+ 
+   <p>That's all you have to do.  To get '<tt>opt</tt>' to print out the
+   statistics gathered, use the '<tt>-stats</tt>' option:</p>
+ 
+   <pre>   $ opt -stats -mypassname < program.bc > /dev/null<br>    ... statistic output ...<br></pre>
+ 
+   <p> When running <tt>gccas</tt> on a C file from the SPEC benchmark
+ suite, it gives a report that looks like this:</p>
+ 
+   <pre>   7646 bytecodewriter  - Number of normal instructions<br>    725 bytecodewriter  - Number of oversized instructions<br> 129996 bytecodewriter  - Number of bytecode bytes written<br>   2817 raise           - Number of insts DCEd or constprop'd<br>   3213 raise           - Number of cast-of-self removed<br>   5046 raise           - Number of expression trees converted<br>     75 raise           - Number of other getelementptr's formed<br>    138 raise           - Number of load/store peepholes<br>     42 deadtypeelim    - Number of unused typenames removed from symtab<br>    392 funcresolve     - Number of varargs functions resolved<br>     27 globaldce       - Number of global variables removed<br>      2 adce            - Number of basic blocks removed<br>    134 cee             - Number of branches revectored<br>     49 cee             - Number of setcc instruction eliminated<br>    532 gcse            - Number of loads removed<br>   2919 gcse            - Number!
  of instructions removed<br>     86 indvars         - Number of canonical indvars added<br>     87 indvars         - Number of aux indvars removed<br>     25 instcombine     - Number of dead inst eliminate<br>    434 instcombine     - Number of insts combined<br>    248 licm            - Number of load insts hoisted<br>   1298 licm            - Number of insts hoisted to a loop pre-header<br>      3 licm            - Number of insts hoisted to multiple loop preds (bad, no loop pre-header)<br>     75 mem2reg         - Number of alloca's promoted<br>   1444 cfgsimplify     - Number of blocks simplified<br></pre>
+ 
+ <p>Obviously, with so many optimizations, having a unified framework for this
+ stuff is very nice.  Making your pass fit well into the framework makes it more
+ maintainable and useful.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ViewGraph">Viewing graphs while debugging code</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Several of the important data structures in LLVM are graphs: for example
+ CFGs made out of LLVM <a href="#BasicBlock">BasicBlock</a>s, CFGs made out of
+ LLVM <a href="CodeGenerator.html#machinebasicblock">MachineBasicBlock</a>s, and
+ <a href="CodeGenerator.html#selectiondag_intro">Instruction Selection
+ DAGs</a>.  In many cases, while debugging various parts of the compiler, it is
+ nice to instantly visualize these graphs.</p>
+ 
+ <p>LLVM provides several callbacks that are available in a debug build to do
+ exactly that.  If you call the <tt>Function::viewCFG()</tt> method, for example,
+ the current LLVM tool will pop up a window containing the CFG for the function
+ where each basic block is a node in the graph, and each node contains the
+ instructions in the block.  Similarly, there also exists 
+ <tt>Function::viewCFGOnly()</tt> (does not include the instructions), the
+ <tt>MachineFunction::viewCFG()</tt> and <tt>MachineFunction::viewCFGOnly()</tt>,
+ and the <tt>SelectionDAG::viewGraph()</tt> methods.  Within GDB, for example,
+ you can usually use something like "<tt>call DAG.viewGraph()</tt>" to pop
+ up a window.  Alternatively, you can sprinkle calls to these functions in your
+ code in places you want to debug.</p>
+ 
+ <p>Getting this to work requires a small amount of configuration.  On Unix
+ systems with X11, install the <a href="http://www.graphviz.org">graphviz</a>
+ toolkit, and make sure 'dot' and 'gv' are in your path.  If you are running on
+ Mac OS/X, download and install the Mac OS/X <a 
+ href="http://www.pixelglow.com/graphviz/">Graphviz program</a>, and add
+ <tt>/Applications/Graphviz.app/Contents/MacOS/</tt> (or whereever you install
+ it) to your path.  Once in your system and path are set up, rerun the LLVM
+ configure script and rebuild LLVM to enable this functionality.</p>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="common">Helpful Hints for Common Operations</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section describes how to perform some very simple transformations of
+ LLVM code.  This is meant to give examples of common idioms used, showing the
+ practical side of LLVM transformations.  <p> Because this is a "how-to" section,
+ you should also read about the main classes that you will be working with.  The
+ <a href="#coreclasses">Core LLVM Class Hierarchy Reference</a> contains details
+ and descriptions of the main classes that you should know about.</p>
+ 
+ </div>
+ 
+ <!-- NOTE: this section should be heavy on example code -->
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="inspection">Basic Inspection and Traversal Routines</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM compiler infrastructure have many different data structures that may
+ be traversed.  Following the example of the C++ standard template library, the
+ techniques used to traverse these various data structures are all basically the
+ same.  For a enumerable sequence of values, the <tt>XXXbegin()</tt> function (or
+ method) returns an iterator to the start of the sequence, the <tt>XXXend()</tt>
+ function returns an iterator pointing to one past the last valid element of the
+ sequence, and there is some <tt>XXXiterator</tt> data type that is common
+ between the two operations.</p>
+ 
+ <p>Because the pattern for iteration is common across many different aspects of
+ the program representation, the standard template library algorithms may be used
+ on them, and it is easier to remember how to iterate. First we show a few common
+ examples of the data structures that need to be traversed.  Other data
+ structures are traversed in very similar ways.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="iterate_function">Iterating over the </a><a
+   href="#BasicBlock"><tt>BasicBlock</tt></a>s in a <a
+   href="#Function"><tt>Function</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>It's quite common to have a <tt>Function</tt> instance that you'd like to
+ transform in some way; in particular, you'd like to manipulate its
+ <tt>BasicBlock</tt>s.  To facilitate this, you'll need to iterate over all of
+ the <tt>BasicBlock</tt>s that constitute the <tt>Function</tt>. The following is
+ an example that prints the name of a <tt>BasicBlock</tt> and the number of
+ <tt>Instruction</tt>s it contains:</p>
+ 
+   <pre>  // func is a pointer to a Function instance<br>  for (Function::iterator i = func->begin(), e = func->end(); i != e; ++i) {<br><br>      // print out the name of the basic block if it has one, and then the<br>      // number of instructions that it contains<br><br>      std::cerr << "Basic block (name=" << i->getName() << ") has " <br>           << i->size() << " instructions.\n";<br>  }<br></pre>
+ 
+ <p>Note that i can be used as if it were a pointer for the purposes of
+ invoking member functions of the <tt>Instruction</tt> class.  This is
+ because the indirection operator is overloaded for the iterator
+ classes.  In the above code, the expression <tt>i->size()</tt> is
+ exactly equivalent to <tt>(*i).size()</tt> just like you'd expect.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="iterate_basicblock">Iterating over the </a><a
+   href="#Instruction"><tt>Instruction</tt></a>s in a <a
+   href="#BasicBlock"><tt>BasicBlock</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Just like when dealing with <tt>BasicBlock</tt>s in <tt>Function</tt>s, it's
+ easy to iterate over the individual instructions that make up
+ <tt>BasicBlock</tt>s. Here's a code snippet that prints out each instruction in
+ a <tt>BasicBlock</tt>:</p>
+ 
+ <pre>
+   // blk is a pointer to a BasicBlock instance
+   for (BasicBlock::iterator i = blk->begin(), e = blk->end(); i != e; ++i)
+      // the next statement works since operator<<(ostream&,...)
+      // is overloaded for Instruction&
+      std::cerr << *i << "\n";
+ </pre>
+ 
+ <p>However, this isn't really the best way to print out the contents of a
+ <tt>BasicBlock</tt>!  Since the ostream operators are overloaded for virtually
+ anything you'll care about, you could have just invoked the print routine on the
+ basic block itself: <tt>std::cerr << *blk << "\n";</tt>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="iterate_institer">Iterating over the </a><a
+   href="#Instruction"><tt>Instruction</tt></a>s in a <a
+   href="#Function"><tt>Function</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>If you're finding that you commonly iterate over a <tt>Function</tt>'s
+ <tt>BasicBlock</tt>s and then that <tt>BasicBlock</tt>'s <tt>Instruction</tt>s,
+ <tt>InstIterator</tt> should be used instead. You'll need to include <a
+ href="/doxygen/InstIterator_8h-source.html"><tt>llvm/Support/InstIterator.h</tt></a>,
+ and then instantiate <tt>InstIterator</tt>s explicitly in your code.  Here's a
+ small example that shows how to dump all instructions in a function to the standard error stream:<p>
+ 
+   <pre>#include "<a href="/doxygen/InstIterator_8h-source.html">llvm/Support/InstIterator.h</a>"<br>...<br>// Suppose F is a ptr to a function<br>for (inst_iterator i = inst_begin(F), e = inst_end(F); i != e; ++i)<br>  std::cerr << *i << "\n";<br></pre>
+ Easy, isn't it?  You can also use <tt>InstIterator</tt>s to fill a
+ worklist with its initial contents.  For example, if you wanted to
+ initialize a worklist to contain all instructions in a <tt>Function</tt>
+ F, all you would need to do is something like:
+   <pre>std::set<Instruction*> worklist;<br>worklist.insert(inst_begin(F), inst_end(F));<br></pre>
+ 
+ <p>The STL set <tt>worklist</tt> would now contain all instructions in the
+ <tt>Function</tt> pointed to by F.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="iterate_convert">Turning an iterator into a class pointer (and
+   vice-versa)</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Sometimes, it'll be useful to grab a reference (or pointer) to a class
+ instance when all you've got at hand is an iterator.  Well, extracting
+ a reference or a pointer from an iterator is very straight-forward.
+ Assuming that <tt>i</tt> is a <tt>BasicBlock::iterator</tt> and <tt>j</tt>
+ is a <tt>BasicBlock::const_iterator</tt>:</p>
+ 
+   <pre>    Instruction& inst = *i;   // grab reference to instruction reference<br>    Instruction* pinst = &*i; // grab pointer to instruction reference<br>    const Instruction& inst = *j;<br></pre>
+ 
+ <p>However, the iterators you'll be working with in the LLVM framework are
+ special: they will automatically convert to a ptr-to-instance type whenever they
+ need to.  Instead of dereferencing the iterator and then taking the address of
+ the result, you can simply assign the iterator to the proper pointer type and
+ you get the dereference and address-of operation as a result of the assignment
+ (behind the scenes, this is a result of overloading casting mechanisms).  Thus
+ the last line of the last example,</p>
+ 
+   <pre>Instruction* pinst = &*i;</pre>
+ 
+ <p>is semantically equivalent to</p>
+ 
+   <pre>Instruction* pinst = i;</pre>
+ 
+ <p>It's also possible to turn a class pointer into the corresponding iterator,
+ and this is a constant time operation (very efficient).  The following code
+ snippet illustrates use of the conversion constructors provided by LLVM
+ iterators.  By using these, you can explicitly grab the iterator of something
+ without actually obtaining it via iteration over some structure:</p>
+ 
+   <pre>void printNextInstruction(Instruction* inst) {<br>    BasicBlock::iterator it(inst);<br>    ++it; // after this line, it refers to the instruction after *inst.<br>    if (it != inst->getParent()->end()) std::cerr << *it << "\n";<br>}<br></pre>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="iterate_complex">Finding call sites: a slightly more complex
+   example</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Say that you're writing a FunctionPass and would like to count all the
+ locations in the entire module (that is, across every <tt>Function</tt>) where a
+ certain function (i.e., some <tt>Function</tt>*) is already in scope.  As you'll
+ learn later, you may want to use an <tt>InstVisitor</tt> to accomplish this in a
+ much more straight-forward manner, but this example will allow us to explore how
+ you'd do it if you didn't have <tt>InstVisitor</tt> around. In pseudocode, this
+ is what we want to do:</p>
+ 
+   <pre>initialize callCounter to zero<br>for each Function f in the Module<br>    for each BasicBlock b in f<br>      for each Instruction i in b<br>        if (i is a CallInst and calls the given function)<br>          increment callCounter<br></pre>
+ 
+ <p>And the actual code is (remember, since we're writing a
+ <tt>FunctionPass</tt>, our <tt>FunctionPass</tt>-derived class simply has to
+ override the <tt>runOnFunction</tt> method...):</p>
+ 
+   <pre>Function* targetFunc = ...;<br><br>class OurFunctionPass : public FunctionPass {<br>  public:<br>    OurFunctionPass(): callCounter(0) { }<br><br>    virtual runOnFunction(Function& F) {<br> 	for (Function::iterator b = F.begin(), be = F.end(); b != be; ++b) {<br> 	    for (BasicBlock::iterator i = b->begin(); ie = b->end(); i != ie; ++i) {<br> 		if (<a
+  href="#CallInst">CallInst</a>* callInst = <a href="#isa">dyn_cast</a><<a
+  href="#CallInst">CallInst</a>>(&*i)) {<br> 		    // we know we've encountered a call instruction, so we<br> 		    // need to determine if it's a call to the<br>	            // function pointed to by m_func or not.<br>  <br> 		    if (callInst->getCalledFunction() == targetFunc)<br> 			++callCounter;<br> 	    }<br> 	}<br>    }<br>    <br>  private:<br>    unsigned  callCounter;<br>};<br></pre>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="calls_and_invokes">Treating calls and invokes the same way</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>You may have noticed that the previous example was a bit oversimplified in
+ that it did not deal with call sites generated by 'invoke' instructions. In
+ this, and in other situations, you may find that you want to treat
+ <tt>CallInst</tt>s and <tt>InvokeInst</tt>s the same way, even though their
+ most-specific common base class is <tt>Instruction</tt>, which includes lots of
+ less closely-related things. For these cases, LLVM provides a handy wrapper
+ class called <a
+ href="http://llvm.org/doxygen/classllvm_1_1CallSite.html"><tt>CallSite</tt></a>.
+ It is essentially a wrapper around an <tt>Instruction</tt> pointer, with some
+ methods that provide functionality common to <tt>CallInst</tt>s and
+ <tt>InvokeInst</tt>s.</p>
+ 
+ <p>This class has "value semantics": it should be passed by value, not by
+ reference and it should not be dynamically allocated or deallocated using
+ <tt>operator new</tt> or <tt>operator delete</tt>. It is efficiently copyable,
+ assignable and constructable, with costs equivalents to that of a bare pointer.
+ If you look at its definition, it has only a single pointer member.</p>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="iterate_chains">Iterating over def-use & use-def chains</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Frequently, we might have an instance of the <a
+ href="/doxygen/structllvm_1_1Value.html">Value Class</a> and we want to
+ determine which <tt>User</tt>s use the <tt>Value</tt>.  The list of all
+ <tt>User</tt>s of a particular <tt>Value</tt> is called a <i>def-use</i> chain.
+ For example, let's say we have a <tt>Function*</tt> named <tt>F</tt> to a
+ particular function <tt>foo</tt>. Finding all of the instructions that
+ <i>use</i> <tt>foo</tt> is as simple as iterating over the <i>def-use</i> chain
+ of <tt>F</tt>:</p>
+ 
+   <pre>Function* F = ...;<br><br>for (Value::use_iterator i = F->use_begin(), e = F->use_end(); i != e; ++i) {<br>    if (Instruction *Inst = dyn_cast<Instruction>(*i)) {<br>        std::cerr << "F is used in instruction:\n";<br>        std::cerr << *Inst << "\n";<br>    }<br>}<br></pre>
+ 
+ <p>Alternately, it's common to have an instance of the <a
+ href="/doxygen/classllvm_1_1User.html">User Class</a> and need to know what
+ <tt>Value</tt>s are used by it.  The list of all <tt>Value</tt>s used by a
+ <tt>User</tt> is known as a <i>use-def</i> chain.  Instances of class
+ <tt>Instruction</tt> are common <tt>User</tt>s, so we might want to iterate over
+ all of the values that a particular instruction uses (that is, the operands of
+ the particular <tt>Instruction</tt>):</p>
+ 
+   <pre>Instruction* pi = ...;<br><br>for (User::op_iterator i = pi->op_begin(), e = pi->op_end(); i != e; ++i) {<br>    Value* v = *i;<br>    ...<br>}<br></pre>
+ 
+ <!--
+   def-use chains ("finding all users of"): Value::use_begin/use_end
+   use-def chains ("finding all values used"): User::op_begin/op_end [op=operand]
+ -->
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="simplechanges">Making simple changes</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>There are some primitive transformation operations present in the LLVM
+ infrastructure that are worth knowing about.  When performing
+ transformations, it's fairly common to manipulate the contents of basic
+ blocks. This section describes some of the common methods for doing so
+ and gives example code.</p>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="schanges_creating">Creating and inserting new
+   <tt>Instruction</tt>s</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><i>Instantiating Instructions</i></p>
+ 
+ <p>Creation of <tt>Instruction</tt>s is straight-forward: simply call the
+ constructor for the kind of instruction to instantiate and provide the necessary
+ parameters. For example, an <tt>AllocaInst</tt> only <i>requires</i> a
+ (const-ptr-to) <tt>Type</tt>. Thus:</p> 
+ 
+ <pre>AllocaInst* ai = new AllocaInst(Type::IntTy);</pre>
+ 
+ <p>will create an <tt>AllocaInst</tt> instance that represents the allocation of
+ one integer in the current stack frame, at runtime. Each <tt>Instruction</tt>
+ subclass is likely to have varying default parameters which change the semantics
+ of the instruction, so refer to the <a
+ href="/doxygen/classllvm_1_1Instruction.html">doxygen documentation for the subclass of
+ Instruction</a> that you're interested in instantiating.</p>
+ 
+ <p><i>Naming values</i></p>
+ 
+ <p>It is very useful to name the values of instructions when you're able to, as
+ this facilitates the debugging of your transformations.  If you end up looking
+ at generated LLVM machine code, you definitely want to have logical names
+ associated with the results of instructions!  By supplying a value for the
+ <tt>Name</tt> (default) parameter of the <tt>Instruction</tt> constructor, you
+ associate a logical name with the result of the instruction's execution at
+ runtime.  For example, say that I'm writing a transformation that dynamically
+ allocates space for an integer on the stack, and that integer is going to be
+ used as some kind of index by some other code.  To accomplish this, I place an
+ <tt>AllocaInst</tt> at the first point in the first <tt>BasicBlock</tt> of some
+ <tt>Function</tt>, and I'm intending to use it within the same
+ <tt>Function</tt>. I might do:</p>
+ 
+   <pre>AllocaInst* pa = new AllocaInst(Type::IntTy, 0, "indexLoc");</pre>
+ 
+ <p>where <tt>indexLoc</tt> is now the logical name of the instruction's
+ execution value, which is a pointer to an integer on the runtime stack.</p>
+ 
+ <p><i>Inserting instructions</i></p>
+ 
+ <p>There are essentially two ways to insert an <tt>Instruction</tt>
+ into an existing sequence of instructions that form a <tt>BasicBlock</tt>:</p>
+ 
+ <ul>
+   <li>Insertion into an explicit instruction list
+ 
+     <p>Given a <tt>BasicBlock* pb</tt>, an <tt>Instruction* pi</tt> within that
+     <tt>BasicBlock</tt>, and a newly-created instruction we wish to insert
+     before <tt>*pi</tt>, we do the following: </p>
+ 
+       <pre>  BasicBlock *pb = ...;<br>  Instruction *pi = ...;<br>  Instruction *newInst = new Instruction(...);<br>  pb->getInstList().insert(pi, newInst); // inserts newInst before pi in pb<br></pre>
+ 
+     <p>Appending to the end of a <tt>BasicBlock</tt> is so common that
+     the <tt>Instruction</tt> class and <tt>Instruction</tt>-derived
+     classes provide constructors which take a pointer to a
+     <tt>BasicBlock</tt> to be appended to. For example code that
+     looked like: </p>
+ 
+       <pre>  BasicBlock *pb = ...;<br>  Instruction *newInst = new Instruction(...);<br>  pb->getInstList().push_back(newInst); // appends newInst to pb<br></pre>
+ 
+     <p>becomes: </p>
+ 
+       <pre>  BasicBlock *pb = ...;<br>  Instruction *newInst = new Instruction(..., pb);<br></pre>
+ 
+     <p>which is much cleaner, especially if you are creating
+     long instruction streams.</p></li>
+ 
+   <li>Insertion into an implicit instruction list
+ 
+     <p><tt>Instruction</tt> instances that are already in <tt>BasicBlock</tt>s
+     are implicitly associated with an existing instruction list: the instruction
+     list of the enclosing basic block. Thus, we could have accomplished the same
+     thing as the above code without being given a <tt>BasicBlock</tt> by doing:
+     </p>
+ 
+       <pre>  Instruction *pi = ...;<br>  Instruction *newInst = new Instruction(...);<br>  pi->getParent()->getInstList().insert(pi, newInst);<br></pre>
+ 
+     <p>In fact, this sequence of steps occurs so frequently that the
+     <tt>Instruction</tt> class and <tt>Instruction</tt>-derived classes provide
+     constructors which take (as a default parameter) a pointer to an
+     <tt>Instruction</tt> which the newly-created <tt>Instruction</tt> should
+     precede.  That is, <tt>Instruction</tt> constructors are capable of
+     inserting the newly-created instance into the <tt>BasicBlock</tt> of a
+     provided instruction, immediately before that instruction.  Using an
+     <tt>Instruction</tt> constructor with a <tt>insertBefore</tt> (default)
+     parameter, the above code becomes:</p>
+ 
+       <pre>Instruction* pi = ...;<br>Instruction* newInst = new Instruction(..., pi);<br></pre>
+ 
+     <p>which is much cleaner, especially if you're creating a lot of
+ instructions and adding them to <tt>BasicBlock</tt>s.</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="schanges_deleting">Deleting <tt>Instruction</tt>s</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Deleting an instruction from an existing sequence of instructions that form a
+ <a href="#BasicBlock"><tt>BasicBlock</tt></a> is very straight-forward. First,
+ you must have a pointer to the instruction that you wish to delete.  Second, you
+ need to obtain the pointer to that instruction's basic block. You use the
+ pointer to the basic block to get its list of instructions and then use the
+ erase function to remove your instruction. For example:</p>
+ 
+   <pre>  <a href="#Instruction">Instruction</a> *I = .. ;<br>  <a
+  href="#BasicBlock">BasicBlock</a> *BB = I->getParent();<br>  BB->getInstList().erase(I);<br></pre>
+ 
+ </div>
+ 
+ <!--_______________________________________________________________________-->
+ <div class="doc_subsubsection">
+   <a name="schanges_replacing">Replacing an <tt>Instruction</tt> with another
+   <tt>Value</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><i>Replacing individual instructions</i></p>
+ 
+ <p>Including "<a href="/doxygen/BasicBlockUtils_8h-source.html">llvm/Transforms/Utils/BasicBlockUtils.h</a>"
+ permits use of two very useful replace functions: <tt>ReplaceInstWithValue</tt>
+ and <tt>ReplaceInstWithInst</tt>.</p>
+ 
+ <h4><a name="schanges_deleting">Deleting <tt>Instruction</tt>s</a></h4>
+ 
+ <ul>
+   <li><tt>ReplaceInstWithValue</tt>
+ 
+     <p>This function replaces all uses (within a basic block) of a given
+     instruction with a value, and then removes the original instruction. The
+     following example illustrates the replacement of the result of a particular
+     <tt>AllocaInst</tt> that allocates memory for a single integer with a null
+     pointer to an integer.</p>
+ 
+       <pre>AllocaInst* instToReplace = ...;<br>BasicBlock::iterator ii(instToReplace);<br>ReplaceInstWithValue(instToReplace->getParent()->getInstList(), ii,<br>                     Constant::getNullValue(PointerType::get(Type::IntTy)));<br></pre></li>
+ 
+   <li><tt>ReplaceInstWithInst</tt> 
+ 
+     <p>This function replaces a particular instruction with another
+     instruction. The following example illustrates the replacement of one
+     <tt>AllocaInst</tt> with another.</p>
+ 
+       <pre>AllocaInst* instToReplace = ...;<br>BasicBlock::iterator ii(instToReplace);<br>ReplaceInstWithInst(instToReplace->getParent()->getInstList(), ii,<br>                    new AllocaInst(Type::IntTy, 0, "ptrToReplacedInt"));<br></pre></li>
+ </ul>
+ 
+ <p><i>Replacing multiple uses of <tt>User</tt>s and <tt>Value</tt>s</i></p>
+ 
+ <p>You can use <tt>Value::replaceAllUsesWith</tt> and
+ <tt>User::replaceUsesOfWith</tt> to change more than one use at a time.  See the
+ doxygen documentation for the <a href="/doxygen/structllvm_1_1Value.html">Value Class</a>
+ and <a href="/doxygen/classllvm_1_1User.html">User Class</a>, respectively, for more
+ information.</p>
+ 
+ <!-- Value::replaceAllUsesWith User::replaceUsesOfWith Point out:
+ include/llvm/Transforms/Utils/ especially BasicBlockUtils.h with:
+ ReplaceInstWithValue, ReplaceInstWithInst -->
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="advanced">Advanced Topics</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>
+ This section describes some of the advanced or obscure API's that most clients
+ do not need to be aware of.  These API's tend manage the inner workings of the
+ LLVM system, and only need to be accessed in unusual circumstances.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="TypeResolve">LLVM Type Resolution</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The LLVM type system has a very simple goal: allow clients to compare types for
+ structural equality with a simple pointer comparison (aka a shallow compare).
+ This goal makes clients much simpler and faster, and is used throughout the LLVM
+ system.
+ </p>
+ 
+ <p>
+ Unfortunately achieving this goal is not a simple matter.  In particular,
+ recursive types and late resolution of opaque types makes the situation very
+ difficult to handle.  Fortunately, for the most part, our implementation makes
+ most clients able to be completely unaware of the nasty internal details.  The
+ primary case where clients are exposed to the inner workings of it are when
+ building a recursive type.  In addition to this case, the LLVM bytecode reader,
+ assembly parser, and linker also have to be aware of the inner workings of this
+ system.
+ </p>
+ 
+ <p>
+ For our purposes below, we need three concepts.  First, an "Opaque Type" is 
+ exactly as defined in the <a href="LangRef.html#t_opaque">language 
+ reference</a>.  Second an "Abstract Type" is any type which includes an 
+ opaque type as part of its type graph (for example "<tt>{ opaque, int }</tt>").
+ Third, a concrete type is a type that is not an abstract type (e.g. "<tt>[ int, 
+ float }</tt>").
+ </p>
+ 
+ </div>
+ 
+ <!-- ______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="BuildRecType">Basic Recursive Type Construction</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ Because the most common question is "how do I build a recursive type with LLVM",
+ we answer it now and explain it as we go.  Here we include enough to cause this
+ to be emitted to an output .ll file:
+ </p>
+ 
+ <pre>
+    %mylist = type { %mylist*, int }
+ </pre>
+ 
+ <p>
+ To build this, use the following LLVM APIs:
+ </p>
+ 
+ <pre>
+   //<i> Create the initial outer struct.</i>
+   <a href="#PATypeHolder">PATypeHolder</a> StructTy = OpaqueType::get();
+   std::vector<const Type*> Elts;
+   Elts.push_back(PointerType::get(StructTy));
+   Elts.push_back(Type::IntTy);
+   StructType *NewSTy = StructType::get(Elts);
+ 
+   //<i> At this point, NewSTy = "{ opaque*, int }". Tell VMCore that</i>
+   //<i> the struct and the opaque type are actually the same.</i>
+   cast<OpaqueType>(StructTy.get())-><a href="#refineAbstractTypeTo">refineAbstractTypeTo</a>(NewSTy);
+ 
+   // <i>NewSTy is potentially invalidated, but StructTy (a <a href="#PATypeHolder">PATypeHolder</a>) is</i>
+   // <i>kept up-to-date.</i>
+   NewSTy = cast<StructType>(StructTy.get());
+ 
+   // <i>Add a name for the type to the module symbol table (optional).</i>
+   MyModule->addTypeName("mylist", NewSTy);
+ </pre>
+ 
+ <p>
+ This code shows the basic approach used to build recursive types: build a
+ non-recursive type using 'opaque', then use type unification to close the cycle.
+ The type unification step is performed by the <tt><a
+ ref="#refineAbstractTypeTo">refineAbstractTypeTo</a></tt> method, which is
+ described next.  After that, we describe the <a
+ href="#PATypeHolder">PATypeHolder class</a>.
+ </p>
+ 
+ </div>
+ 
+ <!-- ______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="refineAbstractTypeTo">The <tt>refineAbstractTypeTo</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ The <tt>refineAbstractTypeTo</tt> method starts the type unification process.
+ While this method is actually a member of the DerivedType class, it is most
+ often used on OpaqueType instances.  Type unification is actually a recursive
+ process.  After unification, types can become structurally isomorphic to
+ existing types, and all duplicates are deleted (to preserve pointer equality).
+ </p>
+ 
+ <p>
+ In the example above, the OpaqueType object is definitely deleted.
+ Additionally, if there is an "{ \2*, int}" type already created in the system,
+ the pointer and struct type created are <b>also</b> deleted.  Obviously whenever
+ a type is deleted, any "Type*" pointers in the program are invalidated.  As
+ such, it is safest to avoid having <i>any</i> "Type*" pointers to abstract types
+ live across a call to <tt>refineAbstractTypeTo</tt> (note that non-abstract
+ types can never move or be deleted).  To deal with this, the <a
+ href="#PATypeHolder">PATypeHolder</a> class is used to maintain a stable
+ reference to a possibly refined type, and the <a
+ href="#AbstractTypeUser">AbstractTypeUser</a> class is used to update more
+ complex datastructures.
+ </p>
+ 
+ </div>
+ 
+ <!-- ______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="PATypeHolder">The PATypeHolder Class</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ PATypeHolder is a form of a "smart pointer" for Type objects.  When VMCore
+ happily goes about nuking types that become isomorphic to existing types, it
+ automatically updates all PATypeHolder objects to point to the new type.  In the
+ example above, this allows the code to maintain a pointer to the resultant
+ resolved recursive type, even though the Type*'s are potentially invalidated.
+ </p>
+ 
+ <p>
+ PATypeHolder is an extremely light-weight object that uses a lazy union-find
+ implementation to update pointers.  For example the pointer from a Value to its
+ Type is maintained by PATypeHolder objects.
+ </p>
+ 
+ </div>
+ 
+ <!-- ______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="AbstractTypeUser">The AbstractTypeUser Class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ Some data structures need more to perform more complex updates when types get
+ resolved.  The <a href="#SymbolTable">SymbolTable</a> class, for example, needs
+ move and potentially merge type planes in its representation when a pointer
+ changes.</p>
+ 
+ <p>
+ To support this, a class can derive from the AbstractTypeUser class.  This class
+ allows it to get callbacks when certain types are resolved.  To register to get
+ callbacks for a particular type, the DerivedType::{add/remove}AbstractTypeUser
+ methods can be called on a type.  Note that these methods only work for <i>
+ abstract</i> types.  Concrete types (those that do not include an opaque objects
+ somewhere) can never be refined.
+ </p>
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="SymbolTable">The <tt>SymbolTable</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>This class provides a symbol table that the <a
+ href="#Function"><tt>Function</tt></a> and <a href="#Module">
+ <tt>Module</tt></a> classes use for naming definitions. The symbol table can
+ provide a name for any <a href="#Value"><tt>Value</tt></a> or <a
+ href="#Type"><tt>Type</tt></a>.  <tt>SymbolTable</tt> is an abstract data
+ type. It hides the data it contains and provides access to it through a
+ controlled interface.</p>
+ 
+ <p>Note that the symbol table class is should not be directly accessed by most
+ clients.  It should only be used when iteration over the symbol table names
+ themselves are required, which is very special purpose.  Note that not all LLVM
+ <a href="#Value">Value</a>s have names, and those without names (i.e. they have
+ an empty name) do not exist in the symbol table.
+ </p>
+ 
+ <p>To use the <tt>SymbolTable</tt> well, you need to understand the 
+ structure of the information it holds. The class contains two 
+ <tt>std::map</tt> objects. The first, <tt>pmap</tt>, is a map of 
+ <tt>Type*</tt> to maps of name (<tt>std::string</tt>) to <tt>Value*</tt>. 
+ The second, <tt>tmap</tt>, is a map of names to <tt>Type*</tt>. Thus, Values
+ are stored in two-dimensions and accessed by <tt>Type</tt> and name. Types,
+ however, are stored in a single dimension and accessed only by name.</p>
+ 
+ <p>The interface of this class provides three basic types of operations:
+ <ol>
+   <li><em>Accessors</em>. Accessors provide read-only access to information
+   such as finding a value for a name with the 
+   <a href="#SymbolTable_lookup">lookup</a> method.</li> 
+   <li><em>Mutators</em>. Mutators allow the user to add information to the
+   <tt>SymbolTable</tt> with methods like 
+   <a href="#SymbolTable_insert"><tt>insert</tt></a>.</li>
+   <li><em>Iterators</em>. Iterators allow the user to traverse the content
+   of the symbol table in well defined ways, such as the method
+   <a href="#SymbolTable_type_begin"><tt>type_begin</tt></a>.</li>
+ </ol>
+ 
+ <h3>Accessors</h3>
+ <dl>
+   <dt><tt>Value* lookup(const Type* Ty, const std::string& name) const</tt>:
+   </dt>
+   <dd>The <tt>lookup</tt> method searches the type plane given by the
+   <tt>Ty</tt> parameter for a <tt>Value</tt> with the provided <tt>name</tt>.
+   If a suitable <tt>Value</tt> is not found, null is returned.</dd>
+ 
+   <dt><tt>Type* lookupType( const std::string& name) const</tt>:</dt>
+   <dd>The <tt>lookupType</tt> method searches through the types for a
+   <tt>Type</tt> with the provided <tt>name</tt>. If a suitable <tt>Type</tt>
+   is not found, null is returned.</dd>
+ 
+   <dt><tt>bool hasTypes() const</tt>:</dt>
+   <dd>This function returns true if an entry has been made into the type
+   map.</dd>
+ 
+   <dt><tt>bool isEmpty() const</tt>:</dt>
+   <dd>This function returns true if both the value and types maps are
+   empty</dd>
+ </dl>
+ 
+ <h3>Mutators</h3>
+ <dl>
+   <dt><tt>void insert(Value *Val)</tt>:</dt>
+   <dd>This method adds the provided value to the symbol table.  The Value must
+   have both a name and a type which are extracted and used to place the value
+   in the correct type plane under the value's name.</dd>
+ 
+   <dt><tt>void insert(const std::string& Name, Value *Val)</tt>:</dt>
+   <dd> Inserts a constant or type into the symbol table with the specified
+   name. There can be a many to one mapping between names and constants
+   or types.</dd>
+ 
+   <dt><tt>void insert(const std::string& Name, Type *Typ)</tt>:</dt>
+   <dd> Inserts a type into the symbol table with the specified name. There
+   can be a many-to-one mapping between names and types. This method
+   allows a type with an existing entry in the symbol table to get
+   a new name.</dd>
+ 
+   <dt><tt>void remove(Value* Val)</tt>:</dt>
+  <dd> This method removes a named value from the symbol table. The
+   type and name of the Value are extracted from \p N and used to
+   lookup the Value in the correct type plane. If the Value is
+   not in the symbol table, this method silently ignores the
+   request.</dd>
+ 
+   <dt><tt>void remove(Type* Typ)</tt>:</dt>
+   <dd> This method removes a named type from the symbol table. The
+   name of the type is extracted from \P T and used to look up
+   the Type in the type map. If the Type is not in the symbol
+   table, this method silently ignores the request.</dd>
+ 
+   <dt><tt>Value* remove(const std::string& Name, Value *Val)</tt>:</dt>
+   <dd> Remove a constant or type with the specified name from the 
+   symbol table.</dd>
+ 
+   <dt><tt>Type* remove(const std::string& Name, Type* T)</tt>:</dt>
+   <dd> Remove a type with the specified name from the symbol table.
+   Returns the removed Type.</dd>
+ 
+   <dt><tt>Value *value_remove(const value_iterator& It)</tt>:</dt>
+   <dd> Removes a specific value from the symbol table. 
+   Returns the removed value.</dd>
+ 
+   <dt><tt>bool strip()</tt>:</dt>
+   <dd> This method will strip the symbol table of its names leaving
+   the type and values. </dd>
+ 
+   <dt><tt>void clear()</tt>:</dt>
+   <dd>Empty the symbol table completely.</dd>
+ </dl>
+ 
+ <h3>Iteration</h3>
+ <p>The following functions describe three types of iterators you can obtain
+ the beginning or end of the sequence for both const and non-const. It is
+ important to keep track of the different kinds of iterators. There are
+ three idioms worth pointing out:</p>
+ <table>
+   <tr><th>Units</th><th>Iterator</th><th>Idiom</th></tr>
+   <tr>
+     <td align="left">Planes Of name/Value maps</td><td>PI</td>
+     <td align="left"><pre><tt>
+ for (SymbolTable::plane_const_iterator PI = ST.plane_begin(),
+      PE = ST.plane_end(); PI != PE; ++PI ) {
+   PI->first // This is the Type* of the plane
+   PI->second // This is the SymbolTable::ValueMap of name/Value pairs
+     </tt></pre></td>
+   </tr>
+   <tr>
+     <td align="left">All name/Type Pairs</td><td>TI</td>
+     <td align="left"><pre><tt>
+ for (SymbolTable::type_const_iterator TI = ST.type_begin(),
+      TE = ST.type_end(); TI != TE; ++TI )
+   TI->first  // This is the name of the type
+   TI->second // This is the Type* value associated with the name
+     </tt></pre></td>
+   </tr>
+   <tr>
+     <td align="left">name/Value pairs in a plane</td><td>VI</td>
+     <td align="left"><pre><tt>
+ for (SymbolTable::value_const_iterator VI = ST.value_begin(SomeType),
+      VE = ST.value_end(SomeType); VI != VE; ++VI )
+   VI->first  // This is the name of the Value
+   VI->second // This is the Value* value associated with the name
+     </tt></pre></td>
+   </tr>
+ </table>
+ 
+ <p>Using the recommended iterator names and idioms will help you avoid
+ making mistakes. Of particular note, make sure that whenever you use
+ value_begin(SomeType) that you always compare the resulting iterator
+ with value_end(SomeType) not value_end(SomeOtherType) or else you 
+ will loop infinitely.</p>
+ 
+ <dl>
+ 
+   <dt><tt>plane_iterator plane_begin()</tt>:</dt>
+   <dd>Get an iterator that starts at the beginning of the type planes.
+   The iterator will iterate over the Type/ValueMap pairs in the
+   type planes. </dd>
+ 
+   <dt><tt>plane_const_iterator plane_begin() const</tt>:</dt>
+   <dd>Get a const_iterator that starts at the beginning of the type 
+   planes.  The iterator will iterate over the Type/ValueMap pairs 
+   in the type planes. </dd>
+ 
+   <dt><tt>plane_iterator plane_end()</tt>:</dt>
+   <dd>Get an iterator at the end of the type planes. This serves as
+   the marker for end of iteration over the type planes.</dd>
+ 
+   <dt><tt>plane_const_iterator plane_end() const</tt>:</dt>
+   <dd>Get a const_iterator at the end of the type planes. This serves as
+   the marker for end of iteration over the type planes.</dd>
+ 
+   <dt><tt>value_iterator value_begin(const Type *Typ)</tt>:</dt>
+   <dd>Get an iterator that starts at the beginning of a type plane.
+   The iterator will iterate over the name/value pairs in the type plane.
+   Note: The type plane must already exist before using this.</dd>
+ 
+   <dt><tt>value_const_iterator value_begin(const Type *Typ) const</tt>:</dt>
+   <dd>Get a const_iterator that starts at the beginning of a type plane.
+   The iterator will iterate over the name/value pairs in the type plane.
+   Note: The type plane must already exist before using this.</dd>
+ 
+   <dt><tt>value_iterator value_end(const Type *Typ)</tt>:</dt>
+   <dd>Get an iterator to the end of a type plane. This serves as the marker
+   for end of iteration of the type plane.
+   Note: The type plane must already exist before using this.</dd>
+ 
+   <dt><tt>value_const_iterator value_end(const Type *Typ) const</tt>:</dt>
+   <dd>Get a const_iterator to the end of a type plane. This serves as the
+   marker for end of iteration of the type plane.
+   Note: the type plane must already exist before using this.</dd>
+ 
+   <dt><tt>type_iterator type_begin()</tt>:</dt>
+   <dd>Get an iterator to the start of the name/Type map.</dd>
+ 
+   <dt><tt>type_const_iterator type_begin() cons</tt>:</dt>
+   <dd> Get a const_iterator to the start of the name/Type map.</dd>
+ 
+   <dt><tt>type_iterator type_end()</tt>:</dt>
+   <dd>Get an iterator to the end of the name/Type map. This serves as the
+   marker for end of iteration of the types.</dd>
+ 
+   <dt><tt>type_const_iterator type_end() const</tt>:</dt>
+   <dd>Get a const-iterator to the end of the name/Type map. This serves 
+   as the marker for end of iteration of the types.</dd>
+ 
+   <dt><tt>plane_const_iterator find(const Type* Typ ) const</tt>:</dt>
+   <dd>This method returns a plane_const_iterator for iteration over
+   the type planes starting at a specific plane, given by \p Ty.</dd>
+ 
+   <dt><tt>plane_iterator find( const Type* Typ </tt>:</dt>
+   <dd>This method returns a plane_iterator for iteration over the
+   type planes starting at a specific plane, given by \p Ty.</dd>
+ 
+ </dl>
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="coreclasses">The Core LLVM Class Hierarchy Reference </a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The Core LLVM classes are the primary means of representing the program
+ being inspected or transformed.  The core LLVM classes are defined in
+ header files in the <tt>include/llvm/</tt> directory, and implemented in
+ the <tt>lib/VMCore</tt> directory.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Value">The <tt>Value</tt> class</a>
+ </div>
+ 
+ <div>
+ 
+ <p><tt>#include "<a href="/doxygen/Value_8h-source.html">llvm/Value.h</a>"</tt>
+ <br> 
+ doxygen info: <a href="/doxygen/structllvm_1_1Value.html">Value Class</a></p>
+ 
+ <p>The <tt>Value</tt> class is the most important class in the LLVM Source
+ base.  It represents a typed value that may be used (among other things) as an
+ operand to an instruction.  There are many different types of <tt>Value</tt>s,
+ such as <a href="#Constant"><tt>Constant</tt></a>s,<a
+ href="#Argument"><tt>Argument</tt></a>s. Even <a
+ href="#Instruction"><tt>Instruction</tt></a>s and <a
+ href="#Function"><tt>Function</tt></a>s are <tt>Value</tt>s.</p>
+ 
+ <p>A particular <tt>Value</tt> may be used many times in the LLVM representation
+ for a program.  For example, an incoming argument to a function (represented
+ with an instance of the <a href="#Argument">Argument</a> class) is "used" by
+ every instruction in the function that references the argument.  To keep track
+ of this relationship, the <tt>Value</tt> class keeps a list of all of the <a
+ href="#User"><tt>User</tt></a>s that is using it (the <a
+ href="#User"><tt>User</tt></a> class is a base class for all nodes in the LLVM
+ graph that can refer to <tt>Value</tt>s).  This use list is how LLVM represents
+ def-use information in the program, and is accessible through the <tt>use_</tt>*
+ methods, shown below.</p>
+ 
+ <p>Because LLVM is a typed representation, every LLVM <tt>Value</tt> is typed,
+ and this <a href="#Type">Type</a> is available through the <tt>getType()</tt>
+ method. In addition, all LLVM values can be named.  The "name" of the
+ <tt>Value</tt> is a symbolic string printed in the LLVM code:</p>
+ 
+   <pre>   %<b>foo</b> = add int 1, 2<br></pre>
+ 
+ <p><a name="#nameWarning">The name of this instruction is "foo".</a> <b>NOTE</b>
+ that the name of any value may be missing (an empty string), so names should
+ <b>ONLY</b> be used for debugging (making the source code easier to read,
+ debugging printouts), they should not be used to keep track of values or map
+ between them.  For this purpose, use a <tt>std::map</tt> of pointers to the
+ <tt>Value</tt> itself instead.</p>
+ 
+ <p>One important aspect of LLVM is that there is no distinction between an SSA
+ variable and the operation that produces it.  Because of this, any reference to
+ the value produced by an instruction (or the value available as an incoming
+ argument, for example) is represented as a direct pointer to the instance of
+ the class that
+ represents this value.  Although this may take some getting used to, it
+ simplifies the representation and makes it easier to manipulate.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Value">Important Public Members of the <tt>Value</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt>Value::use_iterator</tt> - Typedef for iterator over the
+ use-list<br>
+     <tt>Value::use_const_iterator</tt> - Typedef for const_iterator over
+ the use-list<br>
+     <tt>unsigned use_size()</tt> - Returns the number of users of the
+ value.<br>
+     <tt>bool use_empty()</tt> - Returns true if there are no users.<br>
+     <tt>use_iterator use_begin()</tt> - Get an iterator to the start of
+ the use-list.<br>
+     <tt>use_iterator use_end()</tt> - Get an iterator to the end of the
+ use-list.<br>
+     <tt><a href="#User">User</a> *use_back()</tt> - Returns the last
+ element in the list.
+     <p> These methods are the interface to access the def-use
+ information in LLVM.  As with all other iterators in LLVM, the naming
+ conventions follow the conventions defined by the <a href="#stl">STL</a>.</p>
+   </li>
+   <li><tt><a href="#Type">Type</a> *getType() const</tt>
+     <p>This method returns the Type of the Value.</p>
+   </li>
+   <li><tt>bool hasName() const</tt><br>
+     <tt>std::string getName() const</tt><br>
+     <tt>void setName(const std::string &Name)</tt>
+     <p> This family of methods is used to access and assign a name to a <tt>Value</tt>,
+ be aware of the <a href="#nameWarning">precaution above</a>.</p>
+   </li>
+   <li><tt>void replaceAllUsesWith(Value *V)</tt>
+ 
+     <p>This method traverses the use list of a <tt>Value</tt> changing all <a
+     href="#User"><tt>User</tt>s</a> of the current value to refer to
+     "<tt>V</tt>" instead.  For example, if you detect that an instruction always
+     produces a constant value (for example through constant folding), you can
+     replace all uses of the instruction with the constant like this:</p>
+ 
+     <pre>  Inst->replaceAllUsesWith(ConstVal);<br></pre>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="User">The <tt>User</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+   
+ <p>
+ <tt>#include "<a href="/doxygen/User_8h-source.html">llvm/User.h</a>"</tt><br>
+ doxygen info: <a href="/doxygen/classllvm_1_1User.html">User Class</a><br>
+ Superclass: <a href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>The <tt>User</tt> class is the common base class of all LLVM nodes that may
+ refer to <a href="#Value"><tt>Value</tt></a>s.  It exposes a list of "Operands"
+ that are all of the <a href="#Value"><tt>Value</tt></a>s that the User is
+ referring to.  The <tt>User</tt> class itself is a subclass of
+ <tt>Value</tt>.</p>
+ 
+ <p>The operands of a <tt>User</tt> point directly to the LLVM <a
+ href="#Value"><tt>Value</tt></a> that it refers to.  Because LLVM uses Static
+ Single Assignment (SSA) form, there can only be one definition referred to,
+ allowing this direct connection.  This connection provides the use-def
+ information in LLVM.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_User">Important Public Members of the <tt>User</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>User</tt> class exposes the operand list in two ways: through
+ an index access interface and through an iterator based interface.</p>
+ 
+ <ul>
+   <li><tt>Value *getOperand(unsigned i)</tt><br>
+     <tt>unsigned getNumOperands()</tt>
+     <p> These two methods expose the operands of the <tt>User</tt> in a
+ convenient form for direct access.</p></li>
+ 
+   <li><tt>User::op_iterator</tt> - Typedef for iterator over the operand
+ list<br>
+     <tt>op_iterator op_begin()</tt> - Get an iterator to the start of 
+ the operand list.<br>
+     <tt>op_iterator op_end()</tt> - Get an iterator to the end of the
+ operand list.
+     <p> Together, these methods make up the iterator based interface to
+ the operands of a <tt>User</tt>.</p></li>
+ </ul>
+ 
+ </div>    
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Instruction">The <tt>Instruction</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "</tt><tt><a
+ href="/doxygen/Instruction_8h-source.html">llvm/Instruction.h</a>"</tt><br>
+ doxygen info: <a href="/doxygen/classllvm_1_1Instruction.html">Instruction Class</a><br>
+ Superclasses: <a href="#User"><tt>User</tt></a>, <a
+ href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>The <tt>Instruction</tt> class is the common base class for all LLVM
+ instructions.  It provides only a few methods, but is a very commonly used
+ class.  The primary data tracked by the <tt>Instruction</tt> class itself is the
+ opcode (instruction type) and the parent <a
+ href="#BasicBlock"><tt>BasicBlock</tt></a> the <tt>Instruction</tt> is embedded
+ into.  To represent a specific type of instruction, one of many subclasses of
+ <tt>Instruction</tt> are used.</p>
+ 
+ <p> Because the <tt>Instruction</tt> class subclasses the <a
+ href="#User"><tt>User</tt></a> class, its operands can be accessed in the same
+ way as for other <a href="#User"><tt>User</tt></a>s (with the
+ <tt>getOperand()</tt>/<tt>getNumOperands()</tt> and
+ <tt>op_begin()</tt>/<tt>op_end()</tt> methods).</p> <p> An important file for
+ the <tt>Instruction</tt> class is the <tt>llvm/Instruction.def</tt> file. This
+ file contains some meta-data about the various different types of instructions
+ in LLVM.  It describes the enum values that are used as opcodes (for example
+ <tt>Instruction::Add</tt> and <tt>Instruction::SetLE</tt>), as well as the
+ concrete sub-classes of <tt>Instruction</tt> that implement the instruction (for
+ example <tt><a href="#BinaryOperator">BinaryOperator</a></tt> and <tt><a
+ href="#SetCondInst">SetCondInst</a></tt>).  Unfortunately, the use of macros in
+ this file confuses doxygen, so these enum values don't show up correctly in the
+ <a href="/doxygen/classllvm_1_1Instruction.html">doxygen output</a>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Instruction">Important Public Members of the <tt>Instruction</tt>
+   class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt><a href="#BasicBlock">BasicBlock</a> *getParent()</tt>
+     <p>Returns the <a href="#BasicBlock"><tt>BasicBlock</tt></a> that
+ this  <tt>Instruction</tt> is embedded into.</p></li>
+   <li><tt>bool mayWriteToMemory()</tt>
+     <p>Returns true if the instruction writes to memory, i.e. it is a
+       <tt>call</tt>,<tt>free</tt>,<tt>invoke</tt>, or <tt>store</tt>.</p></li>
+   <li><tt>unsigned getOpcode()</tt>
+     <p>Returns the opcode for the <tt>Instruction</tt>.</p></li>
+   <li><tt><a href="#Instruction">Instruction</a> *clone() const</tt>
+     <p>Returns another instance of the specified instruction, identical
+ in all ways to the original except that the instruction has no parent
+ (ie it's not embedded into a <a href="#BasicBlock"><tt>BasicBlock</tt></a>),
+ and it has no name</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="BasicBlock">The <tt>BasicBlock</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "<a
+ href="/doxygen/BasicBlock_8h-source.html">llvm/BasicBlock.h</a>"</tt><br>
+ doxygen info: <a href="/doxygen/structllvm_1_1BasicBlock.html">BasicBlock
+ Class</a><br>
+ Superclass: <a href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>This class represents a single entry multiple exit section of the code,
+ commonly known as a basic block by the compiler community.  The
+ <tt>BasicBlock</tt> class maintains a list of <a
+ href="#Instruction"><tt>Instruction</tt></a>s, which form the body of the block.
+ Matching the language definition, the last element of this list of instructions
+ is always a terminator instruction (a subclass of the <a
+ href="#TerminatorInst"><tt>TerminatorInst</tt></a> class).</p>
+ 
+ <p>In addition to tracking the list of instructions that make up the block, the
+ <tt>BasicBlock</tt> class also keeps track of the <a
+ href="#Function"><tt>Function</tt></a> that it is embedded into.</p>
+ 
+ <p>Note that <tt>BasicBlock</tt>s themselves are <a
+ href="#Value"><tt>Value</tt></a>s, because they are referenced by instructions
+ like branches and can go in the switch tables. <tt>BasicBlock</tt>s have type
+ <tt>label</tt>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_BasicBlock">Important Public Members of the <tt>BasicBlock</tt>
+   class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li><tt>BasicBlock(const std::string &Name = "", </tt><tt><a
+  href="#Function">Function</a> *Parent = 0)</tt>
+ 
+ <p>The <tt>BasicBlock</tt> constructor is used to create new basic blocks for
+ insertion into a function.  The constructor optionally takes a name for the new
+ block, and a <a href="#Function"><tt>Function</tt></a> to insert it into.  If
+ the <tt>Parent</tt> parameter is specified, the new <tt>BasicBlock</tt> is
+ automatically inserted at the end of the specified <a
+ href="#Function"><tt>Function</tt></a>, if not specified, the BasicBlock must be
+ manually inserted into the <a href="#Function"><tt>Function</tt></a>.</p></li>
+ 
+ <li><tt>BasicBlock::iterator</tt> - Typedef for instruction list iterator<br>
+ <tt>BasicBlock::const_iterator</tt> - Typedef for const_iterator.<br>
+ <tt>begin()</tt>, <tt>end()</tt>, <tt>front()</tt>, <tt>back()</tt>,
+ <tt>size()</tt>, <tt>empty()</tt>
+ STL-style functions for accessing the instruction list.
+ 
+ <p>These methods and typedefs are forwarding functions that have the same
+ semantics as the standard library methods of the same names.  These methods
+ expose the underlying instruction list of a basic block in a way that is easy to
+ manipulate.  To get the full complement of container operations (including
+ operations to update the list), you must use the <tt>getInstList()</tt>
+ method.</p></li>
+ 
+ <li><tt>BasicBlock::InstListType &getInstList()</tt>
+ 
+ <p>This method is used to get access to the underlying container that actually
+ holds the Instructions.  This method must be used when there isn't a forwarding
+ function in the <tt>BasicBlock</tt> class for the operation that you would like
+ to perform.  Because there are no forwarding functions for "updating"
+ operations, you need to use this if you want to update the contents of a
+ <tt>BasicBlock</tt>.</p></li>
+ 
+ <li><tt><a href="#Function">Function</a> *getParent()</tt>
+ 
+ <p> Returns a pointer to <a href="#Function"><tt>Function</tt></a> the block is
+ embedded into, or a null pointer if it is homeless.</p></li>
+ 
+ <li><tt><a href="#TerminatorInst">TerminatorInst</a> *getTerminator()</tt>
+ 
+ <p> Returns a pointer to the terminator instruction that appears at the end of
+ the <tt>BasicBlock</tt>.  If there is no terminator instruction, or if the last
+ instruction in the block is not a terminator, then a null pointer is
+ returned.</p></li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="GlobalValue">The <tt>GlobalValue</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "<a
+ href="/doxygen/GlobalValue_8h-source.html">llvm/GlobalValue.h</a>"</tt><br>
+ doxygen info: <a href="/doxygen/classllvm_1_1GlobalValue.html">GlobalValue
+ Class</a><br>
+ Superclasses: <a href="#Constant"><tt>Constant</tt></a>, 
+ <a href="#User"><tt>User</tt></a>, <a href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>Global values (<a href="#GlobalVariable"><tt>GlobalVariable</tt></a>s or <a
+ href="#Function"><tt>Function</tt></a>s) are the only LLVM values that are
+ visible in the bodies of all <a href="#Function"><tt>Function</tt></a>s.
+ Because they are visible at global scope, they are also subject to linking with
+ other globals defined in different translation units.  To control the linking
+ process, <tt>GlobalValue</tt>s know their linkage rules. Specifically,
+ <tt>GlobalValue</tt>s know whether they have internal or external linkage, as
+ defined by the <tt>LinkageTypes</tt> enumeration.</p>
+ 
+ <p>If a <tt>GlobalValue</tt> has internal linkage (equivalent to being
+ <tt>static</tt> in C), it is not visible to code outside the current translation
+ unit, and does not participate in linking.  If it has external linkage, it is
+ visible to external code, and does participate in linking.  In addition to
+ linkage information, <tt>GlobalValue</tt>s keep track of which <a
+ href="#Module"><tt>Module</tt></a> they are currently part of.</p>
+ 
+ <p>Because <tt>GlobalValue</tt>s are memory objects, they are always referred to
+ by their <b>address</b>. As such, the <a href="#Type"><tt>Type</tt></a> of a
+ global is always a pointer to its contents. It is important to remember this
+ when using the <tt>GetElementPtrInst</tt> instruction because this pointer must
+ be dereferenced first. For example, if you have a <tt>GlobalVariable</tt> (a
+ subclass of <tt>GlobalValue)</tt> that is an array of 24 ints, type <tt>[24 x
+ int]</tt>, then the <tt>GlobalVariable</tt> is a pointer to that array. Although
+ the address of the first element of this array and the value of the
+ <tt>GlobalVariable</tt> are the same, they have different types. The
+ <tt>GlobalVariable</tt>'s type is <tt>[24 x int]</tt>. The first element's type
+ is <tt>int.</tt> Because of this, accessing a global value requires you to
+ dereference the pointer with <tt>GetElementPtrInst</tt> first, then its elements
+ can be accessed. This is explained in the <a href="LangRef.html#globalvars">LLVM
+ Language Reference Manual</a>.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_GlobalValue">Important Public Members of the <tt>GlobalValue</tt>
+   class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt>bool hasInternalLinkage() const</tt><br>
+     <tt>bool hasExternalLinkage() const</tt><br>
+     <tt>void setInternalLinkage(bool HasInternalLinkage)</tt>
+     <p> These methods manipulate the linkage characteristics of the <tt>GlobalValue</tt>.</p>
+     <p> </p>
+   </li>
+   <li><tt><a href="#Module">Module</a> *getParent()</tt>
+     <p> This returns the <a href="#Module"><tt>Module</tt></a> that the
+ GlobalValue is currently embedded into.</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Function">The <tt>Function</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "<a
+ href="/doxygen/Function_8h-source.html">llvm/Function.h</a>"</tt><br> doxygen
+ info: <a href="/doxygen/classllvm_1_1Function.html">Function Class</a><br>
+ Superclasses: <a href="#GlobalValue"><tt>GlobalValue</tt></a>, 
+ <a href="#Constant"><tt>Constant</tt></a>, 
+ <a href="#User"><tt>User</tt></a>, 
+ <a href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>The <tt>Function</tt> class represents a single procedure in LLVM.  It is
+ actually one of the more complex classes in the LLVM heirarchy because it must
+ keep track of a large amount of data.  The <tt>Function</tt> class keeps track
+ of a list of <a href="#BasicBlock"><tt>BasicBlock</tt></a>s, a list of formal 
+ <a href="#Argument"><tt>Argument</tt></a>s, and a 
+ <a href="#SymbolTable"><tt>SymbolTable</tt></a>.</p>
+ 
+ <p>The list of <a href="#BasicBlock"><tt>BasicBlock</tt></a>s is the most
+ commonly used part of <tt>Function</tt> objects.  The list imposes an implicit
+ ordering of the blocks in the function, which indicate how the code will be
+ layed out by the backend.  Additionally, the first <a
+ href="#BasicBlock"><tt>BasicBlock</tt></a> is the implicit entry node for the
+ <tt>Function</tt>.  It is not legal in LLVM to explicitly branch to this initial
+ block.  There are no implicit exit nodes, and in fact there may be multiple exit
+ nodes from a single <tt>Function</tt>.  If the <a
+ href="#BasicBlock"><tt>BasicBlock</tt></a> list is empty, this indicates that
+ the <tt>Function</tt> is actually a function declaration: the actual body of the
+ function hasn't been linked in yet.</p>
+ 
+ <p>In addition to a list of <a href="#BasicBlock"><tt>BasicBlock</tt></a>s, the
+ <tt>Function</tt> class also keeps track of the list of formal <a
+ href="#Argument"><tt>Argument</tt></a>s that the function receives.  This
+ container manages the lifetime of the <a href="#Argument"><tt>Argument</tt></a>
+ nodes, just like the <a href="#BasicBlock"><tt>BasicBlock</tt></a> list does for
+ the <a href="#BasicBlock"><tt>BasicBlock</tt></a>s.</p>
+ 
+ <p>The <a href="#SymbolTable"><tt>SymbolTable</tt></a> is a very rarely used
+ LLVM feature that is only used when you have to look up a value by name.  Aside
+ from that, the <a href="#SymbolTable"><tt>SymbolTable</tt></a> is used
+ internally to make sure that there are not conflicts between the names of <a
+ href="#Instruction"><tt>Instruction</tt></a>s, <a
+ href="#BasicBlock"><tt>BasicBlock</tt></a>s, or <a
+ href="#Argument"><tt>Argument</tt></a>s in the function body.</p>
+ 
+ <p>Note that <tt>Function</tt> is a <a href="#GlobalValue">GlobalValue</a>
+ and therefore also a <a href="#Constant">Constant</a>. The value of the function
+ is its address (after linking) which is guaranteed to be constant.</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Function">Important Public Members of the <tt>Function</tt>
+   class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt>Function(const </tt><tt><a href="#FunctionType">FunctionType</a>
+   *Ty, LinkageTypes Linkage, const std::string &N = "", Module* Parent = 0)</tt>
+ 
+     <p>Constructor used when you need to create new <tt>Function</tt>s to add
+     the the program.  The constructor must specify the type of the function to
+     create and what type of linkage the function should have. The <a 
+     href="#FunctionType"><tt>FunctionType</tt></a> argument
+     specifies the formal arguments and return value for the function. The same
+     <a href="#FunctionTypel"><tt>FunctionType</tt></a> value can be used to
+     create multiple functions. The <tt>Parent</tt> argument specifies the Module
+     in which the function is defined. If this argument is provided, the function
+     will automatically be inserted into that module's list of
+     functions.</p></li>
+ 
+   <li><tt>bool isExternal()</tt>
+ 
+     <p>Return whether or not the <tt>Function</tt> has a body defined.  If the
+     function is "external", it does not have a body, and thus must be resolved
+     by linking with a function defined in a different translation unit.</p></li>
+ 
+   <li><tt>Function::iterator</tt> - Typedef for basic block list iterator<br>
+     <tt>Function::const_iterator</tt> - Typedef for const_iterator.<br>
+ 
+     <tt>begin()</tt>, <tt>end()</tt>
+     <tt>size()</tt>, <tt>empty()</tt>
+ 
+     <p>These are forwarding methods that make it easy to access the contents of
+     a <tt>Function</tt> object's <a href="#BasicBlock"><tt>BasicBlock</tt></a>
+     list.</p></li>
+ 
+   <li><tt>Function::BasicBlockListType &getBasicBlockList()</tt>
+ 
+     <p>Returns the list of <a href="#BasicBlock"><tt>BasicBlock</tt></a>s.  This
+     is necessary to use when you need to update the list or perform a complex
+     action that doesn't have a forwarding method.</p></li>
+ 
+   <li><tt>Function::arg_iterator</tt> - Typedef for the argument list
+ iterator<br>
+     <tt>Function::const_arg_iterator</tt> - Typedef for const_iterator.<br>
+ 
+     <tt>arg_begin()</tt>, <tt>arg_end()</tt>
+     <tt>arg_size()</tt>, <tt>arg_empty()</tt>
+ 
+     <p>These are forwarding methods that make it easy to access the contents of
+     a <tt>Function</tt> object's <a href="#Argument"><tt>Argument</tt></a>
+     list.</p></li>
+ 
+   <li><tt>Function::ArgumentListType &getArgumentList()</tt>
+ 
+     <p>Returns the list of <a href="#Argument"><tt>Argument</tt></a>s.  This is
+     necessary to use when you need to update the list or perform a complex
+     action that doesn't have a forwarding method.</p></li>
+ 
+   <li><tt><a href="#BasicBlock">BasicBlock</a> &getEntryBlock()</tt>
+ 
+     <p>Returns the entry <a href="#BasicBlock"><tt>BasicBlock</tt></a> for the
+     function.  Because the entry block for the function is always the first
+     block, this returns the first block of the <tt>Function</tt>.</p></li>
+ 
+   <li><tt><a href="#Type">Type</a> *getReturnType()</tt><br>
+     <tt><a href="#FunctionType">FunctionType</a> *getFunctionType()</tt>
+ 
+     <p>This traverses the <a href="#Type"><tt>Type</tt></a> of the
+     <tt>Function</tt> and returns the return type of the function, or the <a
+     href="#FunctionType"><tt>FunctionType</tt></a> of the actual
+     function.</p></li>
+ 
+   <li><tt><a href="#SymbolTable">SymbolTable</a> *getSymbolTable()</tt>
+ 
+     <p> Return a pointer to the <a href="#SymbolTable"><tt>SymbolTable</tt></a>
+     for this <tt>Function</tt>.</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="GlobalVariable">The <tt>GlobalVariable</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "<a
+ href="/doxygen/GlobalVariable_8h-source.html">llvm/GlobalVariable.h</a>"</tt>
+ <br>
+ doxygen info: <a href="/doxygen/classllvm_1_1GlobalVariable.html">GlobalVariable
+  Class</a><br>
+ Superclasses: <a href="#GlobalValue"><tt>GlobalValue</tt></a>, 
+ <a href="#Constant"><tt>Constant</tt></a>,
+ <a href="#User"><tt>User</tt></a>,
+ <a href="#Value"><tt>Value</tt></a></p>
+ 
+ <p>Global variables are represented with the (suprise suprise)
+ <tt>GlobalVariable</tt> class. Like functions, <tt>GlobalVariable</tt>s are also
+ subclasses of <a href="#GlobalValue"><tt>GlobalValue</tt></a>, and as such are
+ always referenced by their address (global values must live in memory, so their
+ "name" refers to their constant address). See 
+ <a href="#GlobalValue"><tt>GlobalValue</tt></a> for more on this.  Global 
+ variables may have an initial value (which must be a 
+ <a href="#Constant"><tt>Constant</tt></a>), and if they have an initializer, 
+ they may be marked as "constant" themselves (indicating that their contents 
+ never change at runtime).</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_GlobalVariable">Important Public Members of the
+   <tt>GlobalVariable</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt>GlobalVariable(const </tt><tt><a href="#Type">Type</a> *Ty, bool
+   isConstant, LinkageTypes& Linkage, <a href="#Constant">Constant</a>
+   *Initializer = 0, const std::string &Name = "", Module* Parent = 0)</tt>
+ 
+     <p>Create a new global variable of the specified type. If
+     <tt>isConstant</tt> is true then the global variable will be marked as
+     unchanging for the program. The Linkage parameter specifies the type of
+     linkage (internal, external, weak, linkonce, appending) for the variable. If
+     the linkage is InternalLinkage, WeakLinkage, or LinkOnceLinkage,  then
+     the resultant global variable will have internal linkage.  AppendingLinkage
+     concatenates together all instances (in different translation units) of the
+     variable into a single variable but is only applicable to arrays.   See
+     the <a href="LangRef.html#modulestructure">LLVM Language Reference</a> for
+     further details on linkage types. Optionally an initializer, a name, and the
+     module to put the variable into may be specified for the global variable as
+     well.</p></li>
+ 
+   <li><tt>bool isConstant() const</tt>
+ 
+     <p>Returns true if this is a global variable that is known not to
+     be modified at runtime.</p></li>
+ 
+   <li><tt>bool hasInitializer()</tt>
+ 
+     <p>Returns true if this <tt>GlobalVariable</tt> has an intializer.</p></li>
+ 
+   <li><tt><a href="#Constant">Constant</a> *getInitializer()</tt>
+ 
+     <p>Returns the intial value for a <tt>GlobalVariable</tt>.  It is not legal
+     to call this method if there is no initializer.</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Module">The <tt>Module</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>#include "<a
+ href="/doxygen/Module_8h-source.html">llvm/Module.h</a>"</tt><br> doxygen info:
+ <a href="/doxygen/classllvm_1_1Module.html">Module Class</a></p>
+ 
+ <p>The <tt>Module</tt> class represents the top level structure present in LLVM
+ programs.  An LLVM module is effectively either a translation unit of the
+ original program or a combination of several translation units merged by the
+ linker.  The <tt>Module</tt> class keeps track of a list of <a
+ href="#Function"><tt>Function</tt></a>s, a list of <a
+ href="#GlobalVariable"><tt>GlobalVariable</tt></a>s, and a <a
+ href="#SymbolTable"><tt>SymbolTable</tt></a>.  Additionally, it contains a few
+ helpful member functions that try to make common operations easy.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Module">Important Public Members of the <tt>Module</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li><tt>Module::Module(std::string name = "")</tt></li>
+ </ul>
+ 
+ <p>Constructing a <a href="#Module">Module</a> is easy. You can optionally
+ provide a name for it (probably based on the name of the translation unit).</p>
+ 
+ <ul>
+   <li><tt>Module::iterator</tt> - Typedef for function list iterator<br>
+     <tt>Module::const_iterator</tt> - Typedef for const_iterator.<br>
+ 
+     <tt>begin()</tt>, <tt>end()</tt>
+     <tt>size()</tt>, <tt>empty()</tt>
+ 
+     <p>These are forwarding methods that make it easy to access the contents of
+     a <tt>Module</tt> object's <a href="#Function"><tt>Function</tt></a>
+     list.</p></li>
+ 
+   <li><tt>Module::FunctionListType &getFunctionList()</tt>
+ 
+     <p> Returns the list of <a href="#Function"><tt>Function</tt></a>s.  This is
+     necessary to use when you need to update the list or perform a complex
+     action that doesn't have a forwarding method.</p>
+ 
+     <p><!--  Global Variable --></p></li> 
+ </ul>
+ 
+ <hr>
+ 
+ <ul>
+   <li><tt>Module::global_iterator</tt> - Typedef for global variable list iterator<br>
+ 
+     <tt>Module::const_global_iterator</tt> - Typedef for const_iterator.<br>
+ 
+     <tt>global_begin()</tt>, <tt>global_end()</tt>
+     <tt>global_size()</tt>, <tt>global_empty()</tt>
+ 
+     <p> These are forwarding methods that make it easy to access the contents of
+     a <tt>Module</tt> object's <a
+     href="#GlobalVariable"><tt>GlobalVariable</tt></a> list.</p></li>
+ 
+   <li><tt>Module::GlobalListType &getGlobalList()</tt>
+ 
+     <p>Returns the list of <a
+     href="#GlobalVariable"><tt>GlobalVariable</tt></a>s.  This is necessary to
+     use when you need to update the list or perform a complex action that
+     doesn't have a forwarding method.</p>
+ 
+     <p><!--  Symbol table stuff --> </p></li>
+ </ul>
+ 
+ <hr>
+ 
+ <ul>
+   <li><tt><a href="#SymbolTable">SymbolTable</a> *getSymbolTable()</tt>
+ 
+     <p>Return a reference to the <a href="#SymbolTable"><tt>SymbolTable</tt></a>
+     for this <tt>Module</tt>.</p>
+ 
+     <p><!--  Convenience methods --></p></li>
+ </ul>
+ 
+ <hr>
+ 
+ <ul>
+   <li><tt><a href="#Function">Function</a> *getFunction(const std::string
+   &Name, const <a href="#FunctionType">FunctionType</a> *Ty)</tt>
+ 
+     <p>Look up the specified function in the <tt>Module</tt> <a
+     href="#SymbolTable"><tt>SymbolTable</tt></a>. If it does not exist, return
+     <tt>null</tt>.</p></li>
+ 
+   <li><tt><a href="#Function">Function</a> *getOrInsertFunction(const
+   std::string &Name, const <a href="#FunctionType">FunctionType</a> *T)</tt>
+ 
+     <p>Look up the specified function in the <tt>Module</tt> <a
+     href="#SymbolTable"><tt>SymbolTable</tt></a>. If it does not exist, add an
+     external declaration for the function and return it.</p></li>
+ 
+   <li><tt>std::string getTypeName(const <a href="#Type">Type</a> *Ty)</tt>
+ 
+     <p>If there is at least one entry in the <a
+     href="#SymbolTable"><tt>SymbolTable</tt></a> for the specified <a
+     href="#Type"><tt>Type</tt></a>, return it.  Otherwise return the empty
+     string.</p></li>
+ 
+   <li><tt>bool addTypeName(const std::string &Name, const <a
+   href="#Type">Type</a> *Ty)</tt>
+ 
+     <p>Insert an entry in the <a href="#SymbolTable"><tt>SymbolTable</tt></a>
+     mapping <tt>Name</tt> to <tt>Ty</tt>. If there is already an entry for this
+     name, true is returned and the <a
+     href="#SymbolTable"><tt>SymbolTable</tt></a> is not modified.</p></li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Constant">The <tt>Constant</tt> class and subclasses</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Constant represents a base class for different types of constants. It
+ is subclassed by ConstantBool, ConstantInt, ConstantSInt, ConstantUInt,
+ ConstantArray etc for representing the various types of Constants.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Constant">Important Public Methods</a>
+ </div>
+ <div class="doc_text">
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Important Subclasses of Constant </div>
+ <div class="doc_text">
+ <ul>
+   <li>ConstantSInt : This subclass of Constant represents a signed integer 
+   constant.
+     <ul>
+       <li><tt>int64_t getValue() const</tt>: Returns the underlying value of
+       this constant. </li>
+     </ul>
+   </li>
+   <li>ConstantUInt : This class represents an unsigned integer.
+     <ul>
+       <li><tt>uint64_t getValue() const</tt>: Returns the underlying value of 
+       this constant. </li>
+     </ul>
+   </li>
+   <li>ConstantFP : This class represents a floating point constant.
+     <ul>
+       <li><tt>double getValue() const</tt>: Returns the underlying value of 
+       this constant. </li>
+     </ul>
+   </li>
+   <li>ConstantBool : This represents a boolean constant.
+     <ul>
+       <li><tt>bool getValue() const</tt>: Returns the underlying value of this 
+       constant. </li>
+     </ul>
+   </li>
+   <li>ConstantArray : This represents a constant array.
+     <ul>
+       <li><tt>const std::vector<Use> &getValues() const</tt>: Returns 
+       a vector of component constants that makeup this array. </li>
+     </ul>
+   </li>
+   <li>ConstantStruct : This represents a constant struct.
+     <ul>
+       <li><tt>const std::vector<Use> &getValues() const</tt>: Returns 
+       a vector of component constants that makeup this array. </li>
+     </ul>
+   </li>
+   <li>GlobalValue : This represents either a global variable or a function. In 
+   either case, the value is a constant fixed address (after linking). 
+   </li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Type">The <tt>Type</tt> class and Derived Types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Type as noted earlier is also a subclass of a Value class.  Any primitive
+ type (like int, short etc) in LLVM is an instance of Type Class.  All other
+ types are instances of subclasses of type like FunctionType, ArrayType
+ etc. DerivedType is the interface for all such dervied types including
+ FunctionType, ArrayType, PointerType, StructType. Types can have names. They can
+ be recursive (StructType).  There exists exactly one instance of any type
+ structure at a time. This allows using pointer equality of Type *s for comparing
+ types.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Value">Important Public Methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+   <li><tt>bool isSigned() const</tt>: Returns whether an integral numeric type
+   is signed. This is true for SByteTy, ShortTy, IntTy, LongTy. Note that this is
+   not true for Float and Double. </li>
+ 
+   <li><tt>bool isUnsigned() const</tt>: Returns whether a numeric type is
+   unsigned. This is not quite the complement of isSigned... nonnumeric types
+   return false as they do with isSigned. This returns true for UByteTy,
+   UShortTy, UIntTy, and ULongTy. </li>
+ 
+   <li><tt>bool isInteger() const</tt>: Equivalent to isSigned() || isUnsigned().</li>
+ 
+   <li><tt>bool isIntegral() const</tt>: Returns true if this is an integral
+   type, which is either Bool type or one of the Integer types.</li>
+ 
+   <li><tt>bool isFloatingPoint()</tt>: Return true if this is one of the two
+   floating point types.</li>
+ 
+   <li><tt>isLosslesslyConvertableTo (const Type *Ty) const</tt>: Return true if
+   this type can be converted to 'Ty' without any reinterpretation of bits. For
+   example, uint to int or one pointer type to another.</li>
+ </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="m_Value">Important Derived Types</a>
+ </div>
+ <div class="doc_text">
+ <ul>
+   <li>SequentialType : This is subclassed by ArrayType and PointerType
+     <ul>
+       <li><tt>const Type * getElementType() const</tt>: Returns the type of each
+       of the elements in the sequential type. </li>
+     </ul>
+   </li>
+   <li>ArrayType : This is a subclass of SequentialType and defines interface for
+   array types.
+     <ul>
+       <li><tt>unsigned getNumElements() const</tt>: Returns the number of 
+       elements in the array. </li>
+     </ul>
+   </li>
+   <li>PointerType : Subclass of SequentialType for  pointer types. </li>
+   <li>StructType : subclass of DerivedTypes for struct types </li>
+   <li>FunctionType : subclass of DerivedTypes for function types.
+     <ul>
+       <li><tt>bool isVarArg() const</tt>: Returns true if its a vararg
+       function</li>
+       <li><tt> const Type * getReturnType() const</tt>: Returns the
+       return type of the function.</li>
+       <li><tt>const Type * getParamType (unsigned i)</tt>: Returns
+       the type of the ith parameter.</li>
+       <li><tt> const unsigned getNumParams() const</tt>: Returns the
+       number of formal parameters.</li>
+     </ul>
+   </li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="Argument">The <tt>Argument</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>This subclass of Value defines the interface for incoming formal
+ arguments to a function. A Function maintains a list of its formal
+ arguments. An argument has a pointer to the parent Function.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:dhurjati at cs.uiuc.edu">Dinakar Dhurjati</a> and
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/Projects.html
diff -c /dev/null llvm-www/releases/1.8/docs/Projects.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/Projects.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,460 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Creating an LLVM Project</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">Creating an LLVM Project</div>
+ 
+ <ol>
+ <li><a href="#overview">Overview</a></li>
+ <li><a href="#create">Create a project from the Sample Project</a></li>
+ <li><a href="#source">Source tree layout</a></li>
+ <li><a href="#makefiles">Writing LLVM-style Makefiles</a>
+   <ol>
+   <li><a href="#reqVars">Required Variables</a></li>
+   <li><a href="#varsBuildDir">Variables for Building Subdirectories</a></li>
+   <li><a href="#varsBuildLib">Variables for Building Libraries</a></li>
+   <li><a href="#varsBuildProg">Variables for Building Programs</a></li>
+   <li><a href="#miscVars">Miscellaneous Variables</a></li>
+   </ol></li>
+ <li><a href="#objcode">Placement of object code</a></li>
+ <li><a href="#help">Further help</a></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by John Criswell</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="overview">Overview</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM build system is designed to facilitate the building of third party
+ projects that use LLVM header files, libraries, and tools.  In order to use
+ these facilities, a Makefile from a project must do the following things:</p>
+ 
+ <ol>
+   <li>Set <tt>make</tt> variables. There are several variables that a Makefile
+   needs to set to use the LLVM build system:
+   <ul>
+     <li><tt>PROJECT_NAME</tt> - The name by which your project is known.</li>
+     <li><tt>LLVM_SRC_ROOT</tt> - The root of the LLVM source tree.</li>
+     <li><tt>LLVM_OBJ_ROOT</tt> - The root of the LLVM object tree.</li>
+     <li><tt>PROJ_SRC_ROOT</tt> - The root of the project's source tree.</li>
+     <li><tt>PROJ_OBJ_ROOT</tt> - The root of the project's object tree.</li>
+     <li><tt>PROJ_INSTALL_ROOT</tt> - The root installation directory.</li>
+     <li><tt>LEVEL</tt> - The relative path from the current directory to the 
+     project's root ($PROJ_OBJ_ROOT).</li>
+   </ul></li>
+   <li>Include <tt>Makefile.config</tt> from <tt>$(LLVM_OBJ_ROOT)</tt>.</li>
+   <li>Include <tt>Makefile.rules</tt> from <tt>$(LLVM_SRC_ROOT)</tt>.</li>
+ </ol>
+ 
+ <p>There are two ways that you can set all of these variables:</p>
+ <ol>
+   <li>You can write your own Makefiles which hard-code these values.</li>
+   <li>You can use the pre-made LLVM sample project. This sample project 
+   includes Makefiles, a configure script that can be used to configure the 
+   location of LLVM, and the ability to support multiple object directories 
+   from a single source directory.</li>
+ </ol>
+ 
+ <p>This document assumes that you will base your project on the LLVM sample
+ project found in <tt>llvm/projects/sample</tt>.  If you want to devise your own
+ build system, studying the sample project and LLVM Makefiles will probably
+ provide enough information on how to write your own Makefiles.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="create">Create a Project from the Sample Project</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Follow these simple steps to start your project:</p>
+ 
+ <ol>
+ <li>Copy the <tt>llvm/projects/sample</tt> directory to any place of your
+ choosing.  You can place it anywhere you like.  Rename the directory to match
+ the name of your project.</li>
+ 
+ <li>
+ If you downloaded LLVM using CVS, remove all the directories named CVS (and all
+ the files therein) from your project's new source tree.  This will keep CVS
+ from thinking that your project is inside <tt>llvm/projects/sample</tt>.
+ </li>
+ 
+ <li>Add your source code and Makefiles to your source tree.</li>
+ 
+ <li>If you want your project to be configured with the <tt>configure</tt> script
+ then you need to edit <tt>autoconf/configure.ac</tt> as follows:
+   <ul>
+     <li><b>AC_INIT</b>. Place the name of your project, its version number and
+     a contact email address for your project as the arguments to this macro</li>
+     <li><b>AC_CONFIG_AUX_DIR</b>. If your project isn't in the
+     <tt>llvm/projects</tt> directory then you might need to adjust this so that
+     it specifies a relative path to the <tt>llvm/autoconf</tt> directory.</li>
+     <li><b>LLVM_CONFIG_PROJECT</b>. Just leave this alone.</li>
+     <li><b>AC_CONFIG_SRCDIR</b>. Specify a path to a file name that identifies
+     your project; or just leave it at <tt>Makefile.common.in</tt></li>
+     <li><b>AC_CONFIG_FILES</b>. Do not change.</li>
+     <li><b>AC_CONFIG_MAKEFILE</b>. Use one of these macros for each Makefile
+     that your project uses. This macro arranges for your makefiles to be copied
+     from the source directory, unmodified, to the build directory.</li>
+   </ul>
+ </li>
+ 
+ <li>After updating <tt>autoconf/configure.ac</tt>, regenerate the
+ configure script with these commands:
+ 
+ <div class="doc_code">
+ <p><tt>% cd autoconf<br>
+        % AutoRegen.sh</tt></p>
+ </div>
+ 
+ <p>You must be using Autoconf version 2.59 or later and your aclocal version 
+ should 1.9 or later.</p></li>
+ 
+ <li>Run <tt>configure</tt> in the directory in which you want to place
+ object code.  Use the following options to tell your project where it
+ can find LLVM:
+ 
+   <dl>
+     <dt><tt>--with-llvmsrc=<directory></tt></dt>
+     <dd>Tell your project where the LLVM source tree is located.</dd>
+     <dt><br/><tt>--with-llvmobj=<directory></tt></dt>
+     <dd>Tell your project where the LLVM object tree is located.</dd>
+     <dt><br/><tt>--prefix=<directory></tt></dt>
+     <dd>Tell your project where it should get installed.</dd>
+   </dl>
+ </ol>
+ 
+ <p>That's it!  Now all you have to do is type <tt>gmake</tt> (or <tt>make</tt>
+ if your on a GNU/Linux system) in the root of your object directory, and your 
+ project should build.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="source">Source Tree Layout</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>In order to use the LLVM build system, you will want to organize your
+ source code so that it can benefit from the build system's features.
+ Mainly, you want your source tree layout to look similar to the LLVM
+ source tree layout.  The best way to do this is to just copy the
+ project tree from <tt>llvm/projects/sample</tt> and modify it to meet
+ your needs, but you can certainly add to it if you want.</p>
+ 
+ <p>Underneath your top level directory, you should have the following
+ directories:</p>
+ 
+ <dl>
+   <dt><b>lib</b>
+   <dd>
+   This subdirectory should contain all of your library source
+   code.  For each library that you build, you will have one
+   directory in <b>lib</b> that will contain that library's source
+   code.
+ 
+   <p>
+   Libraries can be object files, archives, or dynamic libraries.
+   The <b>lib</b> directory is just a convenient place for libraries
+   as it places them all in a directory from which they can be linked
+   later.
+ 
+   <dt><b>include</b>
+   <dd>
+   This subdirectory should contain any header files that are
+   global to your project.  By global, we mean that they are used
+   by more than one library or executable of your project.
+   <p>
+   By placing your header files in <b>include</b>, they will be
+   found automatically by the LLVM build system.  For example, if
+   you have a file <b>include/jazz/note.h</b>, then your source
+   files can include it simply with <b>#include "jazz/note.h"</b>.
+ 
+   <dt><b>tools</b>
+   <dd>
+   This subdirectory should contain all of your source
+   code for executables.  For each program that you build, you
+   will have one directory in <b>tools</b> that will contain that
+   program's source code.
+   <p>
+ 
+   <dt><b>test</b>
+   <dd>
+   This subdirectory should contain tests that verify that your code
+   works correctly.  Automated tests are especially useful.
+   <p>
+   Currently, the LLVM build system provides basic support for tests.
+   The LLVM system provides the following:
+   <ul>
+     <li>
+     LLVM provides a tcl procedure that is used by Dejagnu to run
+     tests.  It can be found in <tt>llvm/lib/llvm-dg.exp</tt>.  This
+     test procedure uses RUN lines in the actual test case to determine
+     how to run the test.  See the <a
+     href="TestingGuide.html">TestingGuide</a> for more details. You
+     can easily write Makefile support similar to the Makefiles in 
+     <tt>llvm/test</tt> to use Dejagnu to run your project's tests.<br/></li>
+     <li>
+     LLVM contains an optional package called <tt>llvm-test</tt>
+     which provides benchmarks and programs that are known to compile with the
+     LLVM GCC front ends.  You can use these
+     programs to test your code, gather statistics information, and
+     compare it to the current LLVM performance statistics.
+     <br/>Currently, there is no way to hook your tests directly into the
+     <tt>llvm/test</tt> testing harness.  You will simply
+     need to find a way to use the source provided within that directory
+     on your own.
+   </ul>
+ </dl>
+ 
+ <p>Typically, you will want to build your <b>lib</b> directory first followed by
+ your <b>tools</b> directory.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="makefiles">Writing LLVM Style Makefiles</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM build system provides a convenient way to build libraries and
+ executables.  Most of your project Makefiles will only need to define a few
+ variables.  Below is a list of the variables one can set and what they can
+ do:</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="reqVars">Required Variables</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt>LEVEL
+   <dd>
+   This variable is the relative path from this Makefile to the
+   top directory of your project's source code.  For example, if
+   your source code is in <tt>/tmp/src</tt>, then the Makefile in
+   <tt>/tmp/src/jump/high</tt> would set <tt>LEVEL</tt> to <tt>"../.."</tt>.
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="varsBuildDir">Variables for Building Subdirectories</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt>DIRS
+   <dd>
+   This is a space separated list of subdirectories that should be
+   built.  They will be built, one at a time, in the order
+   specified.
+   <p>
+ 
+   <dt>PARALLEL_DIRS
+   <dd>
+   This is a list of directories that can be built in parallel.
+   These will be built after the directories in DIRS have been
+   built.
+   <p>
+ 
+   <dt>OPTIONAL_DIRS
+   <dd>
+   This is a list of directories that can be built if they exist,
+   but will not cause an error if they do not exist.  They are
+   built serially in the order in which they are listed.
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="varsBuildLib">Variables for Building Libraries</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt>LIBRARYNAME
+   <dd>
+   This variable contains the base name of the library that will
+   be built.  For example, to build a library named
+   <tt>libsample.a</tt>, LIBRARYNAME should be set to
+   <tt>sample</tt>.
+   <p>
+ 
+   <dt>BUILD_ARCHIVE
+   <dd>
+   By default, a library is a <tt>.o</tt> file that is linked
+   directly into a program.  To build an archive (also known as
+   a static library), set the BUILD_ARCHIVE variable.
+   <p>
+ 
+   <dt>SHARED_LIBRARY
+   <dd>
+   If SHARED_LIBRARY is defined in your Makefile, a shared
+   (or dynamic) library will be built.
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="varsBuildProg">Variables for Building Programs</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt>TOOLNAME
+   <dd>
+   This variable contains the name of the program that will
+   be built.  For example, to build an executable named
+   <tt>sample</tt>, TOOLNAME should be set to <tt>sample</tt>.
+   <p>
+ 
+   <dt>USEDLIBS
+   <dd>
+   This variable holds a space separated list of libraries that
+   should be linked into the program.  These libraries must either
+   be LLVM libraries or libraries that come from your <b>lib</b>
+   directory.  The libraries must be specified by their base name.
+   For example, to link libsample.a, you would set USEDLIBS to
+   <tt>sample</tt>.
+   <p>
+   Note that this works only for statically linked libraries.
+   <p>
+ 
+   <dt>LIBS
+   <dd>
+   To link dynamic libraries, add <tt>-l<library base name></tt> to
+   the LIBS variable.  The LLVM build system will look in the same places
+   for dynamic libraries as it does for static libraries.
+   <p>
+   For example, to link <tt>libsample.so</tt>, you would have the
+   following line in your <tt>Makefile</tt>:
+   <p>
+   <tt>
+   LIBS += -lsample
+   </tt>
+ </dl>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="miscVars">Miscellaneous Variables</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <dl>
+   <dt>ExtraSource
+   <dd>
+   This variable contains a space separated list of extra source
+   files that need to be built.  It is useful for including the
+   output of Lex and Yacc programs.
+   <p>
+ 
+   <dt>CFLAGS
+   <dt>CPPFLAGS
+   <dd>
+   This variable can be used to add options to the C and C++
+   compiler, respectively.  It is typically used to add options
+   that tell the compiler the location of additional directories
+   to search for header files.
+   <p>
+   It is highly suggested that you append to CFLAGS and CPPFLAGS as
+   opposed to overwriting them.  The master Makefiles may already
+   have useful options in them that you may not want to overwrite.
+   <p>
+ </dl>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="objcode">Placement of Object Code</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The final location of built libraries and executables will depend upon
+ whether you do a Debug, Release, or Profile build.</p>
+ 
+ <dl>
+   <dt>Libraries
+   <dd>
+   All libraries (static and dynamic) will be stored in
+   <tt>PROJ_OBJ_ROOT/<type>/lib</tt>, where type is <tt>Debug</tt>,
+   <tt>Release</tt>, or <tt>Profile</tt> for a debug, optimized, or
+   profiled build, respectively.<p>
+ 
+   <dt>Executables
+   <dd>All executables will be stored in
+   <tt>PROJ_OBJ_ROOT/<type>/bin</tt>, where type is <tt>Debug</tt>,
+   <tt>Release</tt>, or <tt>Profile</tt> for a debug, optimized, or profiled
+   build, respectively.
+ </dl>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="help">Further Help</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>If you have any questions or need any help creating an LLVM project,
+ the LLVM team would be more than happy to help.  You can always post your
+ questions to the <a
+ href="http://mail.cs.uiuc.edu/mailman/listinfo/llvmdev">LLVM Developers
+ Mailing List</a>.</p>
+ 
+ </div>
+   
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:criswell at uiuc.edu">John Criswell</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a>
+   <br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/ReleaseNotes.html
diff -c /dev/null llvm-www/releases/1.8/docs/ReleaseNotes.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/ReleaseNotes.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,691 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+   <title>LLVM 1.8 Release Notes</title>
+ </head>
+ <body>
+ 
+ <div class="doc_title">LLVM 1.8 Release Notes</div>
+  
+ <ol>
+   <li><a href="#intro">Introduction</a></li>
+   <li><a href="#whatsnew">What's New?</a></li>
+   <li><a href="GettingStarted.html">Installation Instructions</a></li>
+   <li><a href="#portability">Portability and Supported Platforms</a></li>
+   <li><a href="#knownproblems">Known Problems</a>
+   <li><a href="#additionalinfo">Additional Information</a></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by the <a href="http://llvm.org">LLVM Team</a><p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="intro">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document contains the release notes for the LLVM compiler
+ infrastructure, release 1.8.  Here we describe the status of LLVM, including any
+ known problems and major improvements from the previous release.  The most
+ up-to-date version of this document (corresponding to LLVM CVS) can be found
+ on the <a
+ href="http://llvm.org/releases/">LLVM releases web site</a>.  If you are
+ not reading this on the LLVM web pages, you should probably go there because
+ this document may be updated after the release.</p>
+ 
+ <p>For more information about LLVM, including information about the latest
+ release, please check out the <a href="http://llvm.org/">main LLVM
+ web site</a>.  If you have questions or comments, the <a
+ href="http://mail.cs.uiuc.edu/mailman/listinfo/llvmdev">LLVM developer's mailing
+ list</a> is a good place to send them.</p>
+ 
+ <p>Note that if you are reading this file from CVS or the main LLVM web page,
+ this document applies to the <i>next</i> release, not the current one.  To see
+ the release notes for the current or previous releases, see the <a
+ href="http://llvm.org/releases/">releases page</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="whatsnew">What's New?</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This is the nineth public release of the LLVM Compiler Infrastructure. This
+ release incorporates a large number of enhancements and new features,
+ including DWARF debugging support (C and C++ on Darwin/PPC), improved inline
+ assembly support, a new <a href="http://llvm.org/nightlytest/">nightly 
+ tester</a>, llvm-config enhancements, many bugs
+ fixed, and performance and compile time improvements.
+ </p>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_subsection">
+ <a name="newfeatures">New Features in LLVM 1.8</a>
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="dwarf">DWARF debugging 
+ support </a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>The llvm-gcc4 C front-end now generates debugging info for C and C++.  This
+ information is propagated through the compiler and the code generator can
+ currently produce DWARF debugging information from it.  DWARF is a standard
+ debugging format used on many platforms, but currently LLVM only includes 
+ target support for Mac OS X targets for the 1.8 release.
+ </p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="inlineasm">Inline Assembly
+ Support</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>Inline assembly support is substantially improved in LLVM 1.8 over LLVM 1.7.
+ Many unsupported features are now supported, and inline asm support in the X86
+ backend is far better.  llvm-gcc4 now supports global register variables as 
+ well.</p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="loopopt">Loop Optimizer Improvements</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>The loop optimizer passes now uses "Loop-Closed SSA Form", which makes it
+ easier to update SSA form as loop transformations change the code.  An 
+ immediate benefit of this is that the loop unswitching pass can now unswitch
+ loops in more cases.
+ </p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="jumptab">Jump Table Support for Switches
+ </a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>The code generator now lowers switch statements to jump tables, providing
+ significant performance boosts for applications (e.g. interpreters) whose
+ performance is highly correlated to switch statement performance.</p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="jitrelease">Deallocation of JIT'd 
+ Machine Code
+ </a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM JIT now allows clients to deallocate machine code JIT'd to its code
+ buffer.  This is important for long living applications that depend on the JIT.
+ </p>
+ 
+ </div>
+ 
+ <!--_________________________________________________________________________-->
+ <div class="doc_subsubsection"><a name="other">Other Improvements</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>This release includes many other improvements, including improvements to
+    the optimizers and code generators (improving the generated code) changes to
+    speed up the compiler in many ways (improving algorithms and fine tuning 
+    code), and changes to reduce the code size of the compiler itself.</p>
+ 
+ <p>More specific changes include:</p>
+ 
+ <ul>
+ <li>LLVM 1.8 includes an initial ARM backend.  This backend is in early 
+     development stages.</li>
+ <li>LLVM 1.8 now includes significantly better support for mingw and 
+     cygwin.</li>
+ <li>The <a href="CommandGuide/html/llvm-config.html">llvm-config</a> tool is 
+     now built by default and has several new features.</li>
+ <li>The X86 and PPC backends now use the correct platform ABI for passing 
+     vectors as arguments to functions.</li>
+ <li>The X86 backend now includes support for the Microsoft ML assembler 
+     ("MASM").</li>
+ <li>The PowerPC backend now pattern matches the 'rlwimi' instruction more 
+     aggressively.</li>
+ <li>Most of LLVM is now built with "-pedantic", ensuring better portability 
+     to more C++ Compilers.</li>
+ <li>The PowerPC backend now includes initial 64-bit support.  The JIT is not
+     complete, and the static compiler has a couple of known bugs, but support
+     is mostly in place. LLVM 1.9 will include completed PPC-64 support. </li>
+ 
+ </ul>
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_subsection">
+ <a name="changes">Significant Changes in LLVM 1.8</a>
+ </div>
+ 
+ <div class="doc_text">
+ <ul>
+ <li>The LLVM "SparcV9" backend (deprecated in LLVM 1.7) has been removed in 
+ LLVM 1.8.  The LLVM "Sparc" backend replaces it.</li>
+ <li>The --version option now prints more useful information, including the
+     build configuration for the tool.</li>
+ </ul>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="portability">Portability and Supported Platforms</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM is known to work on the following platforms:</p>
+ 
+ <ul>
+   <li>Intel and AMD machines running Red Hat Linux, Fedora Core and FreeBSD 
+       (and probably other unix-like systems).</li>
+ <li>Intel and AMD machines running on Win32 using MinGW libraries (native)</li>
+ <li>Sun UltraSPARC workstations running Solaris 8.</li>
+ <li>Intel and AMD machines running on Win32 with the Cygwin libraries (limited
+     support is available for native builds with Visual C++).</li>
+ <li>PowerPC and X86-based Mac OS X systems, running 10.2 and above.</li>
+ <li>Alpha-based machines running Debian GNU/Linux.</li>
+ <li>Itanium-based machines running Linux and HP-UX.</li>
+ </ul>
+ 
+ <p>The core LLVM infrastructure uses
+ <a href="http://www.gnu.org/software/autoconf/">GNU autoconf</a> to adapt itself
+ to the machine and operating system on which it is built.  However, minor
+ porting may be required to get LLVM to work on new platforms.  We welcome your
+ portability patches and reports of successful builds or error messages.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="knownproblems">Known Problems</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This section contains all known problems with the LLVM system, listed by
+ component.  As new problems are discovered, they will be added to these
+ sections.  If you run into a problem, please check the <a
+ href="http://llvm.org/bugs/">LLVM bug database</a> and submit a bug if
+ there isn't already one.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="experimental">Experimental features included with this release</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The following components of this LLVM release are either untested, known to
+ be broken or unreliable, or are in early development.  These components should
+ not be relied on, and bugs should not be filed against them, but they may be
+ useful to some people.  In particular, if you would like to work on one of these
+ components, please contact us on the <a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev">LLVMdev list</a>.</p>
+ 
+ <ul>
+ <li>The <tt>-cee</tt> pass is known to be buggy, and may be removed in in a 
+     future release.</li>
+ <li>The IA64 code generator is experimental.</li>
+ <li>The Alpha JIT is experimental.</li>
+ <li>"<tt>-filetype=asm</tt>" (the default) is the only supported value for the 
+     <tt>-filetype</tt> llc option.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="build">Known problems with the Build System</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li>none yet</li>
+ </ul>
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="core">Known problems with the LLVM Core</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+   <li>In the JIT, <tt>dlsym()</tt> on a symbol compiled by the JIT will not
+   work.</li>
+ </ul>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="c-fe">Known problems with the C front-end</a>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Bugs</div>
+ 
+ <div class="doc_text">
+ 
+ <p>
+ llvm-gcc3 has many significant problems that are fixed by llvm-gcc4.
+ Two major ones include:</p>
+ 
+ <ul>
+ <li>With llvm-gcc3, 
+     C99 variable sized arrays do not release stack memory when they go out of 
+     scope.  Thus, the following program may run out of stack space:
+ <pre>
+     for (i = 0; i != 1000000; ++i) {
+       int X[n];
+       foo(X);
+     }
+ </pre></li>
+ 
+ <li>With llvm-gcc3, Initialization of global union variables can only be done <a
+ href="http://llvm.org/PR162">with the largest union member</a>.</li>
+ 
+ </ul>
+ 
+ <p>llvm-gcc4 is far more stable and produces better code than llvm-gcc3, but
+ does not currently support Link-Time-Optimization or C++ Exception Handling,
+ which llvm-gcc3 does.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   Notes
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li>"long double" is transformed by the front-end into "double".  There is no
+ support for floating point data types of any size other than 32 and 64
+ bits.</li>
+     
+ <li>The following Unix system functionality has not been tested and may not
+ work:
+   <ol>
+   <li><tt>sigsetjmp</tt>, <tt>siglongjmp</tt> - These are not turned into the
+       appropriate <tt>invoke</tt>/<tt>unwind</tt> instructions.  Note that
+       <tt>setjmp</tt> and <tt>longjmp</tt> <em>are</em> compiled correctly.
+   <li><tt>getcontext</tt>, <tt>setcontext</tt>, <tt>makecontext</tt>
+       - These functions have not been tested.
+   </ol></li>
+ 
+ <li>Although many GCC extensions are supported, some are not.  In particular,
+     the following extensions are known to <b>not be</b> supported:
+   <ol>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Local-Labels.html#Local%20Labels">Local Labels</a>: Labels local to a block.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Nested-Functions.html#Nested%20Functions">Nested Functions</a>: As in Algol and Pascal, lexical scoping of functions.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Constructing-Calls.html#Constructing%20Calls">Constructing Calls</a>: Dispatching a call to another function.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html#Extended%20Asm">Extended Asm</a>: Assembler instructions with C expressions as operands.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Constraints.html#Constraints">Constraints</a>: Constraints for asm operands.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Asm-Labels.html#Asm%20Labels">Asm Labels</a>: Specifying the assembler name to use for a C symbol.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Explicit-Reg-Vars.html#Explicit%20Reg%20Vars">Explicit Reg Vars</a>: Defining variables residing in specified registers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Vector-Extensions.html#Vector%20Extensions">Vector Extensions</a>: Using vector instructions through built-in functions.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Target-Builtins.html#Target%20Builtins">Target Builtins</a>:   Built-in functions specific to particular targets.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Thread_002dLocal.html">Thread-Local</a>: Per-thread variables.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Pragmas.html#Pragmas">Pragmas</a>: Pragmas accepted by GCC.</li>
+   </ol>
+ 
+   <p>The following GCC extensions are <b>partially</b> supported.  An ignored
+   attribute means that the LLVM compiler ignores the presence of the attribute,
+   but the code should still work.  An unsupported attribute is one which is
+   ignored by the LLVM compiler and will cause a different interpretation of
+   the program.</p>
+ 
+   <ol>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Variable-Length.html#Variable%20Length">Variable Length</a>:
+       Arrays whose length is computed at run time.<br>
+       Supported, but allocated stack space is not freed until the function returns (noted above).</li>
+ 
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Function-Attributes.html#Function%20Attributes">Function Attributes</a>:
+ 
+       Declaring that functions have no side effects or that they can never
+       return.<br>
+ 
+       <b>Supported:</b> <tt>format</tt>, <tt>format_arg</tt>, <tt>non_null</tt>,
+       <tt>noreturn</tt>, <tt>constructor</tt>, <tt>destructor</tt>,
+       <tt>unused</tt>, <tt>used</tt>,
+       <tt>deprecated</tt>, <tt>warn_unused_result</tt>, <tt>weak</tt><br>
+ 
+       <b>Ignored:</b> <tt>noinline</tt>,
+       <tt>always_inline</tt>, <tt>pure</tt>, <tt>const</tt>, <tt>nothrow</tt>,
+       <tt>malloc</tt>, <tt>no_instrument_function</tt>, <tt>cdecl</tt><br>
+ 
+       <b>Unsupported:</b> <tt>section</tt>, <tt>alias</tt>,
+       <tt>visibility</tt>, <tt>regparm</tt>, <tt>stdcall</tt>,
+       <tt>fastcall</tt>, all other target specific attributes</li>
+    
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Variable-Attributes.html#Variable%20Attributes">Variable Attributes</a>:
+       Specifying attributes of variables.<br>
+       <b>Supported:</b> <tt>cleanup</tt>, <tt>common</tt>, <tt>nocommon</tt>,
+                         <tt>deprecated</tt>, <tt>transparent_union</tt>,
+                         <tt>unused</tt>, <tt>used</tt>, <tt>weak</tt><br>
+ 
+       <b>Unsupported:</b> <tt>aligned</tt>, <tt>mode</tt>, <tt>packed</tt>,
+                         <tt>section</tt>, <tt>shared</tt>, <tt>tls_model</tt>,
+                         <tt>vector_size</tt>, <tt>dllimport</tt>, 
+                         <tt>dllexport</tt>, all target specific attributes.</li>
+ 
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Type-Attributes.html#Type%20Attributes">Type Attributes</a>:	Specifying attributes of types.<br>
+       <b>Supported:</b> <tt>transparent_union</tt>, <tt>unused</tt>,
+                         <tt>deprecated</tt>, <tt>may_alias</tt><br>
+ 
+       <b>Unsupported:</b> <tt>aligned</tt>, <tt>packed</tt>, 
+                         all target specific attributes.</li>
+ 
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#Other%20Builtins">Other Builtins</a>:
+       Other built-in functions.<br>
+       We support all builtins which have a C language equivalent (e.g., 
+          <tt>__builtin_cos</tt>),  <tt>__builtin_alloca</tt>, 
+          <tt>__builtin_types_compatible_p</tt>, <tt>__builtin_choose_expr</tt>,
+          <tt>__builtin_constant_p</tt>, and <tt>__builtin_expect</tt>
+          (currently ignored).  We also support builtins for ISO C99 floating
+          point comparison macros (e.g., <tt>__builtin_islessequal</tt>), 
+          <tt>__builtin_prefetch</tt>, <tt>__builtin_popcount[ll]</tt>,
+          <tt>__builtin_clz[ll]</tt>, and <tt>__builtin_ctz[ll]</tt>.</li>
+   </ol>
+ 
+   <p>The following extensions <b>are</b> known to be supported:</p>
+ 
+   <ol>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Labels-as-Values.html#Labels%20as%20Values">Labels as Values</a>: Getting pointers to labels and computed gotos.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Statement-Exprs.html#Statement%20Exprs">Statement Exprs</a>:   Putting statements and declarations inside expressions.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Typeof.html#Typeof">Typeof</a>: <code>typeof</code>: referring to the type of an expression.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc-3.4.0/gcc/Lvalues.html#Lvalues">Lvalues</a>: Using <code>?:</code>, "<code>,</code>" and casts in lvalues.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Conditionals.html#Conditionals">Conditionals</a>: Omitting the middle operand of a <code>?:</code> expression.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Long-Long.html#Long%20Long">Long Long</a>: Double-word integers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Complex.html#Complex">Complex</a>:   Data types for complex numbers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Hex-Floats.html#Hex%20Floats">Hex Floats</a>:Hexadecimal floating-point constants.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html#Zero%20Length">Zero Length</a>: Zero-length arrays.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Empty-Structures.html#Empty%20Structures">Empty Structures</a>: Structures with no members.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Variadic-Macros.html#Variadic%20Macros">Variadic Macros</a>: Macros with a variable number of arguments.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Escaped-Newlines.html#Escaped%20Newlines">Escaped Newlines</a>:  Slightly looser rules for escaped newlines.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Subscripting.html#Subscripting">Subscripting</a>: Any array can be subscripted, even if not an lvalue.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Pointer-Arith.html#Pointer%20Arith">Pointer Arith</a>: Arithmetic on <code>void</code>-pointers and function pointers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Initializers.html#Initializers">Initializers</a>: Non-constant initializers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Compound-Literals.html#Compound%20Literals">Compound Literals</a>: Compound literals give structures, unions,
+ or arrays as values.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Designated-Inits.html#Designated%20Inits">Designated Inits</a>: Labeling elements of initializers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Cast-to-Union.html#Cast%20to%20Union">Cast to Union</a>: Casting to union type from any member of the union.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Case-Ranges.html#Case%20Ranges">Case Ranges</a>: `case 1 ... 9' and such.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Mixed-Declarations.html#Mixed%20Declarations">Mixed Declarations</a>: Mixing declarations and code.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Function-Prototypes.html#Function%20Prototypes">Function Prototypes</a>: Prototype declarations and old-style definitions.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/C_002b_002b-Comments.html#C_002b_002b-Comments">C++ Comments</a>: C++ comments are recognized.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Dollar-Signs.html#Dollar%20Signs">Dollar Signs</a>: Dollar sign is allowed in identifiers.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Character-Escapes.html#Character%20Escapes">Character Escapes</a>: <code>\e</code> stands for the character <ESC>.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Alignment.html#Alignment">Alignment</a>: Inquiring about the alignment of a type or variable.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Inline.html#Inline">Inline</a>: Defining inline functions (as fast as macros).</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Alternate-Keywords.html#Alternate%20Keywords">Alternate Keywords</a>:<code>__const__</code>, <code>__asm__</code>, etc., for header files.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Incomplete-Enums.html#Incomplete%20Enums">Incomplete Enums</a>:  <code>enum foo;</code>, with details to follow.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Function-Names.html#Function%20Names">Function Names</a>: Printable strings which are the name of the current function.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Return-Address.html#Return%20Address">Return Address</a>: Getting the return or frame address of a function.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Unnamed-Fields.html#Unnamed%20Fields">Unnamed Fields</a>: Unnamed struct/union fields within structs/unions.</li>
+   <li><a href="http://gcc.gnu.org/onlinedocs/gcc/Attribute-Syntax.html#Attribute%20Syntax">Attribute Syntax</a>: Formal syntax for attributes.</li>
+   </ol></li>
+ 
+ </ul>
+ 
+ <p>If you run into GCC extensions which have not been included in any of these
+ lists, please let us know (also including whether or not they work).</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="c++-fe">Known problems with the C++ front-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>For this release, the C++ front-end is considered to be fully
+ tested and works for a number of non-trivial programs, including LLVM
+ itself.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">Bugs</div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li>The C++ front-end inherits all problems afflicting the <a href="#c-fe">C
+     front-end</a>.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   Notes
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li>Destructors for local objects are not always run when a <tt>longjmp</tt> is
+     performed. In particular, destructors for objects in the <tt>longjmp</tt>ing
+     function and in the <tt>setjmp</tt> receiver function may not be run.
+     Objects in intervening stack frames will be destroyed, however (which is
+     better than most compilers).</li>
+ 
+ <li>The LLVM C++ front-end follows the <a
+     href="http://www.codesourcery.com/cxx-abi">Itanium C++ ABI</a>.
+     This document, which is not Itanium specific, specifies a standard for name
+     mangling, class layout, v-table layout, RTTI formats, and other C++
+     representation issues.  Because we use this API, code generated by the LLVM
+     compilers should be binary compatible with machine code generated by other
+     Itanium ABI C++ compilers (such as G++, the Intel and HP compilers, etc).
+     <i>However</i>, the exception handling mechanism used by LLVM is very
+     different from the model used in the Itanium ABI, so <b>exceptions will not
+     interact correctly</b>. </li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="c-be">Known problems with the C back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li>The C back-end produces code that violates the ANSI C Type-Based Alias
+ Analysis rules.  As such, special options may be necessary to compile the code
+ (for example, GCC requires the <tt>-fno-strict-aliasing</tt> option).  This
+ problem probably cannot be fixed.</li>
+ 
+ <li><a href="http://llvm.org/PR56">Zero arg vararg functions are not 
+ supported</a>.  This should not affect LLVM produced by the C or C++ 
+ frontends.</li>
+ 
+ <li>The C backend does not correctly implement the <a 
+ href="LangRef.html#i_stacksave"><tt>llvm.stacksave</tt></a> or
+ <a href="LangRef.html#i_stackrestore"><tt>llvm.stackrestore</tt></a> 
+ intrinsics.  This means that some code compiled by it can run out of stack
+ space if they depend on these (e.g. C99 varargs).</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="x86-be">Known problems with the X86 back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li>none yet.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ppc-be">Known problems with the PowerPC back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a href="http://llvm.org/PR642">PowerPC backend does not correctly
+ implement ordered FP comparisons</a>.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="alpha-be">Known problems with the Alpha back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li>On 21164s, some rare FP arithmetic sequences which may trap do not have the
+ appropriate nops inserted to ensure restartability.</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ia64-be">Known problems with the IA64 back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li>C++ programs are likely to fail on IA64, as calls to <tt>setjmp</tt> are
+ made where the argument is not 16-byte aligned, as required on IA64. (Strictly
+ speaking this is not a bug in the IA64 back-end; it will also be encountered
+ when building C++ programs using the C back-end.)</li>
+ 
+ <li>The C++ front-end does not use <a href="http://llvm.org/PR406">IA64
+ ABI compliant layout of v-tables</a>.  In particular, it just stores function
+ pointers instead of function descriptors in the vtable.  This bug prevents
+ mixing C++ code compiled with LLVM with C++ objects compiled by other C++
+ compilers.</li>
+ 
+ <li>There are a few ABI violations which will lead to problems when mixing LLVM
+ output with code built with other compilers, particularly for floating-point
+ programs.</li>
+ 
+ <li>Defining vararg functions is not supported (but calling them is ok).</li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="sparc-be">Known problems with the SPARC back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li>The SPARC backend only supports the 32-bit SPARC ABI (-m32), it does not
+     support the 64-bit SPARC ABI (-m64).</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="arm-be">Known problems with the ARM back-end</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li>The ARM backend is currently in early development stages, it is not 
+ ready for production use.</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="additionalinfo">Additional Information</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>A wide variety of additional information is available on the <a
+ href="http://llvm.org">LLVM web page</a>, including <a
+ href="http://llvm.org/docs/">documentation</a> and <a
+ href="http://llvm.org/pubs/">publications describing algorithms and
+ components implemented in LLVM</a>.  The web page also contains versions of the
+ API documentation which is up-to-date with the CVS version of the source code.
+ You can access versions of these documents specific to this release by going
+ into the "<tt>llvm/doc/</tt>" directory in the LLVM tree.</p>
+ 
+ <p>If you have any questions or comments about LLVM, please feel free to contact
+ us via the <a href="http://llvm.org/docs/#maillist"> mailing
+ lists</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="http://llvm.org/">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/SourceLevelDebugging.html
diff -c /dev/null llvm-www/releases/1.8/docs/SourceLevelDebugging.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/SourceLevelDebugging.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1762 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Source Level Debugging with LLVM</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">Source Level Debugging with LLVM</div>
+ 
+ <table class="layout" style="width:100%">
+   <tr class="layout">
+     <td class="left">
+ <ul>
+   <li><a href="#introduction">Introduction</a>
+   <ol>
+     <li><a href="#phil">Philosophy behind LLVM debugging information</a></li>
+     <li><a href="#consumers">Debug information consumers</a></li>
+     <li><a href="#debugopt">Debugging optimized code</a></li>
+   </ol></li>
+   <li><a href="#format">Debugging information format</a>
+   <ol>
+     <li><a href="#debug_info_descriptors">Debug information descriptors</a>
+     <ul>
+       <li><a href="#format_anchors">Anchor descriptors</a></li>
+       <li><a href="#format_compile_units">Compile unit descriptors</a></li>
+       <li><a href="#format_global_variables">Global variable descriptors</a></li>
+       <li><a href="#format_subprograms">Subprogram descriptors</a></li>
+       <li><a href="#format_blocks">Block descriptors</a></li>
+       <li><a href="#format_basic_type">Basic type descriptors</a></li>
+       <li><a href="#format_derived_type">Derived type descriptors</a></li>
+       <li><a href="#format_composite_type">Composite type descriptors</a></li>
+       <li><a href="#format_subrange">Subrange descriptors</a></li>
+       <li><a href="#format_enumeration">Enumerator descriptors</a></li>
+       <li><a href="#format_variables">Local variables</a></li>
+     </ul></li>
+     <li><a href="#format_common_intrinsics">Debugger intrinsic functions</a>
+       <ul>
+       <li><a href="#format_common_stoppoint">llvm.dbg.stoppoint</a></li>
+       <li><a href="#format_common_func_start">llvm.dbg.func.start</a></li>
+       <li><a href="#format_common_region_start">llvm.dbg.region.start</a></li>
+       <li><a href="#format_common_region_end">llvm.dbg.region.end</a></li>
+       <li><a href="#format_common_declare">llvm.dbg.declare</a></li>
+     </ul></li>
+     <li><a href="#format_common_stoppoints">Representing stopping points in the
+                                            source program</a></li>
+   </ol></li>
+   <li><a href="#ccxx_frontend">C/C++ front-end specific debug information</a>
+   <ol>
+     <li><a href="#ccxx_compile_units">C/C++ source file information</a></li>
+     <li><a href="#ccxx_global_variable">C/C++ global variable information</a></li>
+     <li><a href="#ccxx_subprogram">C/C++ function information</a></li>
+     <li><a href="#ccxx_basic_types">C/C++ basic types</a></li>
+     <li><a href="#ccxx_derived_types">C/C++ derived types</a></li>
+     <li><a href="#ccxx_composite_types">C/C++ struct/union types</a></li>
+     <li><a href="#ccxx_enumeration_types">C/C++ enumeration types</a></li>
+   </ol></li>
+ </ul>
+ </td>
+ <td class="right">
+ <img src="img/venusflytrap.jpg" alt="A leafy and green bug eater" width="247"
+ height="369">
+ </td>
+ </tr></table>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a>
+             and <a href="mailto:jlaskey at apple.com">Jim Laskey</a></p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="introduction">Introduction</a></div> 
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document is the central repository for all information pertaining to
+ debug information in LLVM.  It describes the <a href="#format">actual format
+ that the LLVM debug information</a> takes, which is useful for those interested
+ in creating front-ends or dealing directly with the information.  Further, this
+ document provides specifc examples of what debug information for C/C++.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="phil">Philosophy behind LLVM debugging information</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The idea of the LLVM debugging information is to capture how the important
+ pieces of the source-language's Abstract Syntax Tree map onto LLVM code.
+ Several design aspects have shaped the solution that appears here.  The
+ important ones are:</p>
+ 
+ <ul>
+ <li>Debugging information should have very little impact on the rest of the
+ compiler.  No transformations, analyses, or code generators should need to be
+ modified because of debugging information.</li>
+ 
+ <li>LLVM optimizations should interact in <a href="#debugopt">well-defined and
+ easily described ways</a> with the debugging information.</li>
+ 
+ <li>Because LLVM is designed to support arbitrary programming languages,
+ LLVM-to-LLVM tools should not need to know anything about the semantics of the
+ source-level-language.</li>
+ 
+ <li>Source-level languages are often <b>widely</b> different from one another.
+ LLVM should not put any restrictions of the flavor of the source-language, and
+ the debugging information should work with any language.</li>
+ 
+ <li>With code generator support, it should be possible to use an LLVM compiler
+ to compile a program to native machine code and standard debugging formats.
+ This allows compatibility with traditional machine-code level debuggers, like
+ GDB or DBX.</li>
+ 
+ </ul>
+ 
+ <p>The approach used by the LLVM implementation is to use a small set of <a
+ href="#format_common_intrinsics">intrinsic functions</a> to define a mapping
+ between LLVM program objects and the source-level objects.  The description of
+ the source-level program is maintained in LLVM global variables in an <a
+ href="#ccxx_frontend">implementation-defined format</a> (the C/C++ front-end
+ currently uses working draft 7 of the <a
+ href="http://www.eagercon.com/dwarf/dwarf3std.htm">Dwarf 3 standard</a>).</p>
+ 
+ <p>When a program is being debugged, a debugger interacts with the user and
+ turns the stored debug information into source-language specific information. 
+ As such, a debugger must be aware of the source-language, and is thus tied to
+ a specific language of family of languages.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="consumers">Debug information consumers</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>The role of debug information is to provide meta information normally
+ stripped away during the compilation process.  This meta information provides an
+ llvm user a relationship between generated code and the original program source
+ code.</p>
+ 
+ <p>Currently, debug information is consumed by the DwarfWriter to produce dwarf
+ information used by the gdb debugger.  Other targets could use the same
+ information to produce stabs or other debug forms.</p>
+ 
+ <p>It would also be reasonable to use debug information to feed profiling tools
+ for analysis of generated code, or, tools for reconstructing the original source
+ from generated code.</p>
+ 
+ <p>TODO - expound a bit more.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="debugopt">Debugging optimized code</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>An extremely high priority of LLVM debugging information is to make it
+ interact well with optimizations and analysis.  In particular, the LLVM debug
+ information provides the following guarantees:</p>
+ 
+ <ul>
+ 
+ <li>LLVM debug information <b>always provides information to accurately read the
+ source-level state of the program</b>, regardless of which LLVM optimizations
+ have been run, and without any modification to the optimizations themselves.
+ However, some optimizations may impact the ability to modify the current state
+ of the program with a debugger, such as setting program variables, or calling
+ function that have been deleted.</li>
+ 
+ <li>LLVM optimizations gracefully interact with debugging information.  If they
+ are not aware of debug information, they are automatically disabled as necessary
+ in the cases that would invalidate the debug info.  This retains the LLVM
+ features making it easy to write new transformations.</li>
+ 
+ <li>As desired, LLVM optimizations can be upgraded to be aware of the LLVM
+ debugging information, allowing them to update the debugging information as they
+ perform aggressive optimizations.  This means that, with effort, the LLVM
+ optimizers could optimize debug code just as well as non-debug code.</li>
+ 
+ <li>LLVM debug information does not prevent many important optimizations from
+ happening (for example inlining, basic block reordering/merging/cleanup, tail
+ duplication, etc), further reducing the amount of the compiler that eventually
+ is "aware" of debugging information.</li>
+ 
+ <li>LLVM debug information is automatically optimized along with the rest of the
+ program, using existing facilities.  For example, duplicate information is
+ automatically merged by the linker, and unused information is automatically
+ removed.</li>
+ 
+ </ul>
+ 
+ <p>Basically, the debug information allows you to compile a program with
+ "<tt>-O0 -g</tt>" and get full debug information, allowing you to arbitrarily
+ modify the program as it executes from a debugger.  Compiling a program with
+ "<tt>-O3 -g</tt>" gives you full debug information that is always available and
+ accurate for reading (e.g., you get accurate stack traces despite tail call
+ elimination and inlining), but you might lose the ability to modify the program
+ and call functions where were optimized out of the program, or inlined away
+ completely.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="format">Debugging information format</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM debugging information has been carefully designed to make it possible
+ for the optimizer to optimize the program and debugging information without
+ necessarily having to know anything about debugging information.  In particular,
+ the global constant merging pass automatically eliminates duplicated debugging
+ information (often caused by header files), the global dead code elimination
+ pass automatically deletes debugging information for a function if it decides to
+ delete the function, and the linker eliminates debug information when it merges
+ <tt>linkonce</tt> functions.</p>
+ 
+ <p>To do this, most of the debugging information (descriptors for types,
+ variables, functions, source files, etc) is inserted by the language front-end
+ in the form of LLVM global variables.  These LLVM global variables are no
+ different from any other global variables, except that they have a web of LLVM
+ intrinsic functions that point to them.  If the last references to a particular
+ piece of debugging information are deleted (for example, by the
+ <tt>-globaldce</tt> pass), the extraneous debug information will automatically
+ become dead and be removed by the optimizer.</p>
+ 
+ <p>Debug information is designed to be agnostic about the target debugger and
+ debugging information representation (e.g. DWARF/Stabs/etc).  It uses a generic
+ machine debug information pass to decode the information that represents
+ variables, types, functions, namespaces, etc: this allows for arbitrary
+ source-language semantics and type-systems to be used, as long as there is a
+ module written for the target debugger to interpret the information. In
+ addition, debug global variables are declared in the <tt>"llvm.metadata"</tt>
+ section.  All values declared in this section are stripped away after target
+ debug information is constructed and before the program object is emitted.</p>
+ 
+ <p>To provide basic functionality, the LLVM debugger does have to make some
+ assumptions about the source-level language being debugged, though it keeps
+ these to a minimum.  The only common features that the LLVM debugger assumes
+ exist are <a href="#format_compile_units">source files</a>, and <a
+ href="#format_global_variables">program objects</a>.  These abstract objects are
+ used by a debugger to form stack traces, show information about local
+ variables, etc.</p>
+ 
+ <p>This section of the documentation first describes the representation aspects
+ common to any source-language.  The <a href="#ccxx_frontend">next section</a>
+ describes the data layout conventions used by the C and C++ front-ends.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="debug_info_descriptors">Debug information descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>In consideration of the complexity and volume of debug information, LLVM
+ provides a specification for well formed debug global variables.  The constant
+ value of each of these globals is one of a limited set of structures, known as
+ debug descriptors.</p>
+ 
+ <p>Consumers of LLVM debug information expect the descriptors for program
+ objects to start in a canonical format, but the descriptors can include
+ additional information appended at the end that is source-language specific. All
+ LLVM debugging information is versioned, allowing backwards compatibility in the
+ case that the core structures need to change in some way.  Also, all debugging
+ information objects start with a tag to indicate what type of object it is.  The
+ source-language is allowed to define its own objects, by using unreserved tag
+ numbers.  We recommend using with tags in the range 0x1000 thru 0x2000 (there is
+ a defined enum DW_TAG_user_base = 0x1000.)</p>
+ 
+ <p>The fields of debug descriptors used internally by LLVM (MachineDebugInfo)
+ are restricted to only the simple data types <tt>int</tt>, <tt>uint</tt>,
+ <tt>bool</tt>, <tt>float</tt>, <tt>double</tt>, <tt>sbyte*</tt> and <tt> { }*
+ </tt>.  References to arbitrary values are handled using a <tt> { }* </tt> and a
+ cast to <tt> { }* </tt> expression; typically references to other field
+ descriptors, arrays of descriptors or global variables.</p>
+ 
+ <pre>
+   %llvm.dbg.object.type = type {
+     uint,   ;; A tag
+     ...
+   }
+ </pre>
+ 
+ <p><a name="LLVMDebugVersion">The first field of a descriptor is always an
+ <tt>uint</tt> containing a tag value identifying the content of the descriptor.
+ The remaining fields are specific to the descriptor.  The values of tags are
+ loosely bound to the tag values of Dwarf information entries.  However, that
+ does not restrict the use of the information supplied to Dwarf targets.  To
+ facilitate versioning of debug information, the tag is augmented with the
+ current debug version (LLVMDebugVersion = 4 << 16 or 0x40000 or 262144.)</a></p>
+ 
+ <p>The details of the various descriptors follow.</p>  
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_anchors">Anchor descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_anchors">llvm.dbg.anchor.type</a> = type {
+     uint,   ;; Tag = 0 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a>
+     uint    ;; Tag of descriptors grouped by the anchor
+   }
+ </pre>
+ 
+ <p>One important aspect of the LLVM debug representation is that it allows the
+ LLVM debugger to efficiently index all of the global objects without having the
+ scan the program.  To do this, all of the global objects use "anchor"
+ descriptors with designated names.  All of the global objects of a particular
+ type (e.g., compile units) contain a pointer to the anchor.  This pointer allows
+ a debugger to use def-use chains to find all global objects of that type.</p>
+ 
+ <p>The following names are recognized as anchors by LLVM:</p>
+ 
+ <pre>
+   %<a href="#format_compile_units">llvm.dbg.compile_units</a>       = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a>  { uint 0, uint 17 } ;; DW_TAG_compile_unit
+   %<a href="#format_global_variables">llvm.dbg.global_variables</a>    = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a>  { uint 0, uint 52 } ;; DW_TAG_variable
+   %<a href="#format_subprograms">llvm.dbg.subprograms</a>         = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a>  { uint 0, uint 46 } ;; DW_TAG_subprogram
+ </pre>
+ 
+ <p>Using anchors in this way (where the compile unit descriptor points to the
+ anchors, as opposed to having a list of compile unit descriptors) allows for the
+ standard dead global elimination and merging passes to automatically remove
+ unused debugging information.  If the globals were kept track of through lists,
+ there would always be an object pointing to the descriptors, thus would never be
+ deleted.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_compile_units">Compile unit descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> = type {
+     uint,   ;; Tag = 17 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_compile_unit)
+     {  }*,  ;; Compile unit anchor = cast = (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_units</a> to {  }*)
+     uint,   ;; Dwarf language identifier (ex. DW_LANG_C89) 
+     sbyte*, ;; Source file name
+     sbyte*, ;; Source file directory (includes trailing slash)
+     sbyte*  ;; Producer (ex. "4.0.1 LLVM (LLVM research group)")
+   }
+ </pre>
+ 
+ <p>These descriptors contain a source language ID for the file (we use the Dwarf
+ 3.0 ID numbers, such as <tt>DW_LANG_C89</tt>, <tt>DW_LANG_C_plus_plus</tt>,
+ <tt>DW_LANG_Cobol74</tt>, etc), three strings describing the filename, working
+ directory of the compiler, and an identifier string for the compiler that
+ produced it.</p>
+ 
+ <p> Compile unit descriptors provide the root context for objects declared in a
+ specific source file.  Global variables and top level functions would be defined
+ using this context.  Compile unit descriptors also provide context for source
+ line correspondence.</p>  
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_global_variables">Global variable descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_global_variables">llvm.dbg.global_variable.type</a> = type {
+     uint,   ;; Tag = 52 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_variable)
+     {  }*,  ;; Global variable anchor = cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_global_variables">llvm.dbg.global_variables</a> to {  }*),  
+     {  }*,  ;; Reference to context descriptor
+     sbyte*, ;; Name
+     {  }*,  ;; Reference to compile unit where defined
+     uint,   ;; Line number where defined
+     {  }*,  ;; Reference to type descriptor
+     bool,   ;; True if the global is local to compile unit (static)
+     bool,   ;; True if the global is defined in the compile unit (not extern)
+     {  }*   ;; Reference to the global variable
+   }
+ </pre>
+ 
+ <p>These descriptors provide debug information about globals variables.  The
+ provide details such as name, type and where the variable is defined.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_subprograms">Subprogram descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_subprograms">llvm.dbg.subprogram.type</a> = type {
+     uint,   ;; Tag = 46 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_subprogram)
+     {  }*,  ;; Subprogram anchor = cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_subprograms">llvm.dbg.subprograms</a> to {  }*),  
+     {  }*,  ;; Reference to context descriptor
+     sbyte*, ;; Name
+     {  }*,  ;; Reference to compile unit where defined
+     uint,   ;; Line number where defined
+     {  }*,  ;; Reference to type descriptor
+     bool,   ;; True if the global is local to compile unit (static)
+     bool    ;; True if the global is defined in the compile unit (not extern)
+   }
+ </pre>
+ 
+ <p>These descriptors provide debug information about functions, methods and
+ subprograms.  They provide details such as name, return types and the source
+ location where the subprogram is defined.</p>
+ 
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_blocks">Block descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_blocks">llvm.dbg.block</a> = type {
+     uint,   ;; Tag = 13 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_lexical_block)
+     {  }*   ;; Reference to context descriptor
+   }
+ </pre>
+ 
+ <p>These descriptors provide debug information about nested blocks within a
+ subprogram.  The array of member descriptors is used to define local variables
+ and deeper nested blocks.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_basic_type">Basic type descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_basic_type">llvm.dbg.basictype.type</a> = type {
+     uint,   ;; Tag = 36 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_base_type)
+     {  }*,  ;; Reference to context (typically a compile unit)
+     sbyte*, ;; Name (may be "" for anonymous types)
+     {  }*,  ;; Reference to compile unit where defined (may be NULL)
+     uint,   ;; Line number where defined (may be 0)
+     uint,   ;; Size in bits
+     uint,   ;; Alignment in bits
+     uint,   ;; Offset in bits
+     uint    ;; Dwarf type encoding
+   }
+ </pre>
+ 
+ <p>These descriptors define primitive types used in the code. Example int, bool
+ and float.  The context provides the scope of the type, which is usually the top
+ level.  Since basic types are not usually user defined the compile unit and line
+ number can be left as NULL and 0.  The size, alignment and offset are expressed
+ in bits and can be 64 bit values.  The alignment is used to round the offset
+ when embedded in a <a href="#format_composite_type">composite type</a>
+ (example to keep float doubles on 64 bit boundaries.) The offset is the bit
+ offset if embedded in a <a href="#format_composite_type">composite
+ type</a>.</p>
+ 
+ <p>The type encoding provides the details of the type.  The values are typically
+ one of the following;</p>
+ 
+ <pre>
+   DW_ATE_address = 1
+   DW_ATE_boolean = 2
+   DW_ATE_float = 4
+   DW_ATE_signed = 5
+   DW_ATE_signed_char = 6
+   DW_ATE_unsigned = 7
+   DW_ATE_unsigned_char = 8
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_derived_type">Derived type descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> = type {
+     uint,   ;; Tag (see below)
+     {  }*,  ;; Reference to context
+     sbyte*, ;; Name (may be "" for anonymous types)
+     {  }*,  ;; Reference to compile unit where defined (may be NULL)
+     uint,   ;; Line number where defined (may be 0)
+     uint,   ;; Size in bits
+     uint,   ;; Alignment in bits
+     uint,   ;; Offset in bits
+     {  }*   ;; Reference to type derived from
+   }
+ </pre>
+ 
+ <p>These descriptors are used to define types derived from other types.  The
+ value of the tag varies depending on the meaning.  The following are possible
+ tag values;</p>
+ 
+ <pre>
+   DW_TAG_formal_parameter = 5
+   DW_TAG_member = 13
+   DW_TAG_pointer_type = 15
+   DW_TAG_reference_type = 16
+   DW_TAG_typedef = 22
+   DW_TAG_const_type = 38
+   DW_TAG_volatile_type = 53
+   DW_TAG_restrict_type = 55
+ </pre>
+ 
+ <p> <tt>DW_TAG_member</tt> is used to define a member of a <a
+ href="#format_composite_type">composite type</a> or <a
+ href="#format_subprograms">subprogram</a>.  The type of the member is the <a
+ href="#format_derived_type">derived type</a>. <tt>DW_TAG_formal_parameter</tt>
+ is used to define a member which is a formal argument of a subprogram.</p>
+ 
+ <p><tt>DW_TAG_typedef</tt> is used to
+ provide a name for the derived type.</p>
+ 
+ <p><tt>DW_TAG_pointer_type</tt>,
+ <tt>DW_TAG_reference_type</tt>, <tt>DW_TAG_const_type</tt>,
+ <tt>DW_TAG_volatile_type</tt> and <tt>DW_TAG_restrict_type</tt> are used to
+ qualify the <a href="#format_derived_type">derived type</a>. </p>
+ 
+ <p><a href="#format_derived_type">Derived type</a> location can be determined
+ from the compile unit and line number.  The size, alignment and offset are
+ expressed in bits and can be 64 bit values.  The alignment is used to round the
+ offset when embedded in a <a href="#format_composite_type">composite type</a>
+ (example to keep float doubles on 64 bit boundaries.) The offset is the bit
+ offset if embedded in a <a href="#format_composite_type">composite
+ type</a>.</p>
+ 
+ <p>Note that the <tt>void *</tt> type is expressed as a
+ <tt>llvm.dbg.derivedtype.type</tt> with tag of <tt>DW_TAG_pointer_type</tt> and
+ NULL derived type.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_composite_type">Composite type descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_composite_type">llvm.dbg.compositetype.type</a> = type {
+     uint,   ;; Tag (see below)
+     {  }*,  ;; Reference to context
+     sbyte*, ;; Name (may be "" for anonymous types)
+     {  }*,  ;; Reference to compile unit where defined (may be NULL)
+     uint,   ;; Line number where defined (may be 0)
+     uint,   ;; Size in bits
+     uint,   ;; Alignment in bits
+     uint,   ;; Offset in bits
+     {  }*   ;; Reference to array of member descriptors
+   }
+ </pre>
+ 
+ <p>These descriptors are used to define types that are composed of 0 or more
+ elements.  The value of the tag varies depending on the meaning.  The following
+ are possible tag values;</p>
+ 
+ <pre>
+   DW_TAG_array_type = 1
+   DW_TAG_enumeration_type = 4
+   DW_TAG_structure_type = 19
+   DW_TAG_union_type = 23
+   DW_TAG_vector_type = 259
+   DW_TAG_subroutine_type = 46
+ </pre>
+ 
+ <p>The vector flag indicates that an array type is a native packed vector.</p>
+ 
+ <p>The members of array types (tag = <tt>DW_TAG_array_type</tt>) or vector types
+ (tag = <tt>DW_TAG_vector_type</tt>) are <a href="#format_subrange">subrange
+ descriptors</a>, each representing the range of subscripts at that level of
+ indexing.</p>
+ 
+ <p>The members of enumeration types (tag = <tt>DW_TAG_enumeration_type</tt>) are
+ <a href="#format_enumeration">enumerator descriptors</a>, each representing the
+ definition of enumeration value
+ for the set.</p>
+ 
+ <p>The members of structure (tag = <tt>DW_TAG_structure_type</tt>) or union (tag
+ = <tt>DW_TAG_union_type</tt>) types are any one of the <a
+ href="#format_basic_type">basic</a>, <a href="#format_derived_type">derived</a>
+ or <a href="#format_composite_type">composite</a> type descriptors, each
+ representing a field member of the structure or union.</p>
+ 
+ <p>The first member of subroutine (tag = <tt>DW_TAG_subroutine_type</tt>)
+ type elements is the return type for the subroutine.  The remaining
+ elements are the formal arguments to the subroutine.</p>
+ 
+ <p><a href="#format_composite_type">Composite type</a> location can be
+ determined from the compile unit and line number.  The size, alignment and
+ offset are expressed in bits and can be 64 bit values.  The alignment is used to
+ round the offset when embedded in a <a href="#format_composite_type">composite
+ type</a> (as an example, to keep float doubles on 64 bit boundaries.) The offset
+ is the bit offset if embedded in a <a href="#format_composite_type">composite
+ type</a>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_subrange">Subrange descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_subrange">llvm.dbg.subrange.type</a> = type {
+     uint,   ;; Tag = 33 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_subrange_type)
+     uint,   ;; Low value
+     uint    ;; High value
+   }
+ </pre>
+ 
+ <p>These descriptors are used to define ranges of array subscripts for an array
+ <a href="#format_composite_type">composite type</a>.  The low value defines the
+ lower bounds typically zero for C/C++.  The high value is the upper bounds. 
+ Values are 64 bit.  High - low + 1 is the size of the array.  If
+ low == high the array will be unbounded.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_enumeration">Enumerator descriptors</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   %<a href="#format_enumeration">llvm.dbg.enumerator.type</a> = type {
+     uint,   ;; Tag = 40 + <a href="#LLVMDebugVersion">LLVMDebugVersion</a> (DW_TAG_enumerator)
+     sbyte*, ;; Name
+     uint    ;; Value
+   }
+ </pre>
+ 
+ <p>These descriptors are used to define members of an enumeration <a
+ href="#format_composite_type">composite type</a>, it associates the name to the
+ value.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_variables">Local variables</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   %<a href="#format_variables">llvm.dbg.variable.type</a> = type {
+     uint,    ;; Tag (see below)
+     {  }*,   ;; Context
+     sbyte*,  ;; Name
+     {  }*,   ;; Reference to compile unit where defined
+     uint,    ;; Line number where defined
+     {  }*    ;; Type descriptor
+   }
+ </pre>
+ 
+ <p>These descriptors are used to define variables local to a sub program.  The
+ value of the tag depends on the usage of the variable;</p>
+ 
+ <pre>
+   DW_TAG_auto_variable = 256
+   DW_TAG_arg_variable = 257
+   DW_TAG_return_variable = 258
+ </pre>
+ 
+ <p>An auto variable is any variable declared in the body of the function.  An
+ argument variable is any variable that appears as a formal argument to the
+ function.  A return variable is used to track the result of a function and has
+ no source correspondent.</p>
+ 
+ <p>The context is either the subprogram or block where the variable is defined.
+ Name the source variable name.  Compile unit and line indicate where the
+ variable was defined. Type descriptor defines the declared type of the
+ variable.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="format_common_intrinsics">Debugger intrinsic functions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM uses several intrinsic functions (name prefixed with "llvm.dbg") to
+ provide debug information at various points in generated code.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_common_stoppoint">llvm.dbg.stoppoint</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint, uint, { }* )
+ </pre>
+ 
+ <p>This intrinsic is used to provide correspondence between the source file and
+ the generated code.  The first argument is the line number (base 1), second
+ argument si the column number (0 if unknown) and the third argument the source
+ <tt>%<a href="#format_compile_units">llvm.dbg.compile_unit</a>*</tt> cast to a
+ <tt>{ }*</tt>.  Code following a call to this intrinsic will have been defined
+ in close proximity of the line, column and file.  This information holds until
+ the next call to <tt>%<a
+ href="#format_common_stoppoint">lvm.dbg.stoppoint</a></tt>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_common_func_start">llvm.dbg.func.start</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   void %<a href="#format_common_func_start">llvm.dbg.func.start</a>( { }* )
+ </pre>
+ 
+ <p>This intrinsic is used to link the debug information in <tt>%<a
+ href="#format_subprograms">llvm.dbg.subprogram</a></tt> to the function. It also
+ defines the beginning of the function's declarative region (scope.)  The
+ intrinsic should be called early in the function after the all the alloca
+ instructions.  It should be paired off with a closing <tt>%<a
+ href="#format_common_region_end">llvm.dbg.region.end</a></tt>.  The function's
+ single argument is the <tt>%<a
+ href="#format_subprograms">llvm.dbg.subprogram.type</a></tt>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_common_region_start">llvm.dbg.region.start</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   void %<a href="#format_common_region_start">llvm.dbg.region.start</a>( { }* )
+ </pre>
+ 
+ <p>This intrinsic is used to define the beginning of a declarative scope (ex.
+ block) for local language elements.  It should be paired off with a closing
+ <tt>%<a href="#format_common_region_end">llvm.dbg.region.end</a></tt>.  The
+ function's single argument is the <tt>%<a
+ href="#format_blocks">llvm.dbg.block</a></tt> which is starting.</p>
+ 
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_common_region_end">llvm.dbg.region.end</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   void %<a href="#format_common_region_end">llvm.dbg.region.end</a>( { }* )
+ </pre>
+ 
+ <p>This intrinsic is used to define the end of a declarative scope (ex. block)
+ for local language elements.  It should be paired off with an opening <tt>%<a
+ href="#format_common_region_start">llvm.dbg.region.start</a></tt> or <tt>%<a
+ href="#format_common_func_start">llvm.dbg.func.start</a></tt>.  The function's
+ single argument is either the <tt>%<a
+ href="#format_blocks">llvm.dbg.block</a></tt> or the <tt>%<a
+ href="#format_subprograms">llvm.dbg.subprogram.type</a></tt> which is
+ ending.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="format_common_declare">llvm.dbg.declare</a>
+ </div>
+ 
+ <div class="doc_text">
+ <pre>
+   void %<a href="#format_common_declare">llvm.dbg.declare</a>( { } *, { }* )
+ </pre>
+ 
+ <p>This intrinsic provides information about a local element (ex. variable.) The
+ first argument is the alloca for the variable, cast to a <tt>{ }*</tt>. The
+ second argument is the <tt>%<a
+ href="#format_variables">llvm.dbg.variable</a></tt> containing the description
+ of the variable, also cast to a <tt>{ }*</tt>.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="format_common_stoppoints">
+      Representing stopping points in the source program
+   </a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>LLVM debugger "stop points" are a key part of the debugging representation
+ that allows the LLVM to maintain simple semantics for <a
+ href="#debugopt">debugging optimized code</a>.  The basic idea is that the
+ front-end inserts calls to the <a
+ href="#format_common_stoppoint">%<tt>llvm.dbg.stoppoint</tt></a> intrinsic
+ function at every point in the program where a debugger should be able to
+ inspect the program (these correspond to places a debugger stops when you
+ "<tt>step</tt>" through it).  The front-end can choose to place these as
+ fine-grained as it would like (for example, before every subexpression
+ evaluated), but it is recommended to only put them after every source statement
+ that includes executable code.</p>
+ 
+ <p>Using calls to this intrinsic function to demark legal points for the
+ debugger to inspect the program automatically disables any optimizations that
+ could potentially confuse debugging information.  To non-debug-information-aware
+ transformations, these calls simply look like calls to an external function,
+ which they must assume to do anything (including reading or writing to any part
+ of reachable memory).  On the other hand, it does not impact many optimizations,
+ such as code motion of non-trapping instructions, nor does it impact
+ optimization of subexpressions, code duplication transformations, or basic-block
+ reordering transformations.</p>
+ 
+ </div>
+ 
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="format_common_lifetime">Object lifetimes and scoping</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>In many languages, the local variables in functions can have their lifetime
+ or scope limited to a subset of a function.  In the C family of languages, for
+ example, variables are only live (readable and writable) within the source block
+ that they are defined in.  In functional languages, values are only readable
+ after they have been defined.  Though this is a very obvious concept, it is also
+ non-trivial to model in LLVM, because it has no notion of scoping in this sense,
+ and does not want to be tied to a language's scoping rules.</p>
+ 
+ <p>In order to handle this, the LLVM debug format uses the notion of "regions"
+ of a function, delineated by calls to intrinsic functions.  These intrinsic
+ functions define new regions of the program and indicate when the region
+ lifetime expires.  Consider the following C fragment, for example:</p>
+ 
+ <pre>
+ 1.  void foo() {
+ 2.    int X = ...;
+ 3.    int Y = ...;
+ 4.    {
+ 5.      int Z = ...;
+ 6.      ...
+ 7.    }
+ 8.    ...
+ 9.  }
+ </pre>
+ 
+ <p>Compiled to LLVM, this function would be represented like this:</p>
+ 
+ <pre>
+ void %foo() {
+ entry:
+     %X = alloca int
+     %Y = alloca int
+     %Z = alloca int
+     
+     ...
+     
+     call void %<a href="#format_common_func_start">llvm.dbg.func.start</a>( %<a href="#format_subprograms">llvm.dbg.subprogram.type</a>* %llvm.dbg.subprogram )
+     
+     call void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint 2, uint 2, %<a href="#format_compile_units">llvm.dbg.compile_unit</a>* %llvm.dbg.compile_unit )
+     
+     call void %<a href="#format_common_declare">llvm.dbg.declare</a>({}* %X, ...)
+     call void %<a href="#format_common_declare">llvm.dbg.declare</a>({}* %Y, ...)
+     
+     <i>;; Evaluate expression on line 2, assigning to X.</i>
+     
+     call void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint 3, uint 2, %<a href="#format_compile_units">llvm.dbg.compile_unit</a>* %llvm.dbg.compile_unit )
+     
+     <i>;; Evaluate expression on line 3, assigning to Y.</i>
+     
+     call void %<a href="#format_common_stoppoint">llvm.region.start</a>()
+     call void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint 5, uint 4, %<a href="#format_compile_units">llvm.dbg.compile_unit</a>* %llvm.dbg.compile_unit )
+     call void %<a href="#format_common_declare">llvm.dbg.declare</a>({}* %X, ...)
+     
+     <i>;; Evaluate expression on line 5, assigning to Z.</i>
+     
+     call void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint 7, uint 2, %<a href="#format_compile_units">llvm.dbg.compile_unit</a>* %llvm.dbg.compile_unit )
+     call void %<a href="#format_common_region_end">llvm.region.end</a>()
+     
+     call void %<a href="#format_common_stoppoint">llvm.dbg.stoppoint</a>( uint 9, uint 2, %<a href="#format_compile_units">llvm.dbg.compile_unit</a>* %llvm.dbg.compile_unit )
+     
+     call void %<a href="#format_common_region_end">llvm.region.end</a>()
+     
+     ret void
+ }
+ </pre>
+ 
+ <p>This example illustrates a few important details about the LLVM debugging
+ information.  In particular, it shows how the various intrinsics are applied
+ together to allow a debugger to analyze the relationship between statements,
+ variable definitions, and the code used to implement the function.</p>
+ 
+ <p>The first intrinsic <tt>%<a
+ href="#format_common_func_start">llvm.dbg.func.start</a></tt> provides
+ a link with the <a href="#format_subprograms">subprogram descriptor</a>
+ containing the details of this function.  This call also defines the beginning
+ of the function region, bounded by the <tt>%<a
+ href="#format_common_region_end">llvm.region.end</a></tt> at the end of
+ the function.  This region is used to bracket the lifetime of variables declared
+ within.  For a function, this outer region defines a new stack frame whose
+ lifetime ends when the region is ended.</p>
+ 
+ <p>It is possible to define inner regions for short term variables by using the
+ %<a href="#format_common_stoppoint"><tt>llvm.region.start</tt></a> and <a
+ href="#format_common_region_end"><tt>%llvm.region.end</tt></a> to bound a
+ region.  The inner region in this example would be for the block containing the
+ declaration of Z.</p>
+ 
+ <p>Using regions to represent the boundaries of source-level functions allow
+ LLVM interprocedural optimizations to arbitrarily modify LLVM functions without
+ having to worry about breaking mapping information between the LLVM code and the
+ and source-level program.  In particular, the inliner requires no modification
+ to support inlining with debugging information: there is no explicit correlation
+ drawn between LLVM functions and their source-level counterparts (note however,
+ that if the inliner inlines all instances of a non-strong-linkage function into
+ its caller that it will not be possible for the user to manually invoke the
+ inlined function from a debugger).</p>
+ 
+ <p>Once the function has been defined, the <a
+ href="#format_common_stoppoint"><tt>stopping point</tt></a> corresponding to
+ line #2 (column #2) of the function is encountered.  At this point in the
+ function, <b>no</b> local variables are live.  As lines 2 and 3 of the example
+ are executed, their variable definitions are introduced into the program using
+ %<a href="#format_common_declare"><tt>llvm.dbg.declare</tt></a>, without the
+ need to specify a new region.  These variables do not require new regions to be
+ introduced because they go out of scope at the same point in the program: line
+ 9.</p>
+ 
+ <p>In contrast, the <tt>Z</tt> variable goes out of scope at a different time,
+ on line 7.  For this reason, it is defined within the inner region, which kills
+ the availability of <tt>Z</tt> before the code for line 8 is executed.  In this
+ way, regions can support arbitrary source-language scoping rules, as long as
+ they can only be nested (ie, one scope cannot partially overlap with a part of
+ another scope).</p>
+ 
+ <p>It is worth noting that this scoping mechanism is used to control scoping of
+ all declarations, not just variable declarations.  For example, the scope of a
+ C++ using declaration is controlled with this couldchange how name lookup is
+ performed.</p>
+ 
+ </div>
+ 
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="ccxx_frontend">C/C++ front-end specific debug information</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The C and C++ front-ends represent information about the program in a format
+ that is effectively identical to <a
+ href="http://www.eagercon.com/dwarf/dwarf3std.htm">Dwarf 3.0</a> in terms of
+ information content.  This allows code generators to trivially support native
+ debuggers by generating standard dwarf information, and contains enough
+ information for non-dwarf targets to translate it as needed.</p>
+ 
+ <p>This section describes the forms used to represent C and C++ programs. Other
+ languages could pattern themselves after this (which itself is tuned to
+ representing programs in the same way that Dwarf 3 does), or they could choose
+ to provide completely different forms if they don't fit into the Dwarf model. 
+ As support for debugging information gets added to the various LLVM
+ source-language front-ends, the information used should be documented here.</p>
+ 
+ <p>The following sections provide examples of various C/C++ constructs and the
+ debug information that would best describe those constructs.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_compile_units">C/C++ source file information</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given the source files "MySource.cpp" and "MyHeader.h" located in the
+ directory "/Users/mine/sources", the following code;</p>
+ 
+ <pre>
+ #include "MyHeader.h"
+ 
+ int main(int argc, char *argv[]) {
+   return 0;
+ }
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ...
+ ;;
+ ;; Define types used.  In this case we need one for compile unit anchors and one
+ ;; for compile units.
+ ;;
+ %<a href="#format_anchors">llvm.dbg.anchor.type</a> = type { uint, uint }
+ %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> = type { uint, {  }*, uint, uint, sbyte*, sbyte*, sbyte* }
+ ...
+ ;;
+ ;; Define the anchor for compile units.  Note that the second field of the
+ ;; anchor is 17, which is the same as the tag for compile units
+ ;; (17 = DW_TAG_compile_unit.)
+ ;;
+ %<a href="#format_compile_units">llvm.dbg.compile_units</a> = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a> { uint 0, uint 17 }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the compile unit for the source file "/Users/mine/sources/MySource.cpp".
+ ;;
+ %<a href="#format_compile_units">llvm.dbg.compile_unit1</a> = internal constant %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> {
+     uint add(uint 17, uint 262144), 
+     {  }* cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_units</a> to {  }*), 
+     uint 1, 
+     uint 1, 
+     sbyte* getelementptr ([13 x sbyte]* %str1, int 0, int 0), 
+     sbyte* getelementptr ([21 x sbyte]* %str2, int 0, int 0), 
+     sbyte* getelementptr ([33 x sbyte]* %str3, int 0, int 0) }, section "llvm.metadata"
+     
+ ;;
+ ;; Define the compile unit for the header file "/Users/mine/sources/MyHeader.h".
+ ;;
+ %<a href="#format_compile_units">llvm.dbg.compile_unit2</a> = internal constant %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> {
+     uint add(uint 17, uint 262144), 
+     {  }* cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_units</a> to {  }*), 
+     uint 1, 
+     uint 1, 
+     sbyte* getelementptr ([11 x sbyte]* %str4, int 0, int 0), 
+     sbyte* getelementptr ([21 x sbyte]* %str2, int 0, int 0), 
+     sbyte* getelementptr ([33 x sbyte]* %str3, int 0, int 0) }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define each of the strings used in the compile units.
+ ;;
+ %str1 = internal constant [13 x sbyte] c"MySource.cpp\00", section "llvm.metadata";
+ %str2 = internal constant [21 x sbyte] c"/Users/mine/sources/\00", section "llvm.metadata";
+ %str3 = internal constant [33 x sbyte] c"4.0.1 LLVM (LLVM research group)\00", section "llvm.metadata";
+ %str4 = internal constant [11 x sbyte] c"MyHeader.h\00", section "llvm.metadata";
+ ...
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_global_variable">C/C++ global variable information</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given an integer global variable declared as follows;</p>
+ 
+ <pre>
+ int MyGlobal = 100;
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ;;
+ ;; Define types used. One for global variable anchors, one for the global
+ ;; variable descriptor, one for the global's basic type and one for the global's
+ ;; compile unit.
+ ;;
+ %<a href="#format_anchors">llvm.dbg.anchor.type</a> = type { uint, uint }
+ %<a href="#format_global_variables">llvm.dbg.global_variable.type</a> = type { uint, {  }*, {  }*, sbyte*, {  }*, uint, {  }*, bool, bool, {  }*, uint }
+ %<a href="#format_basic_type">llvm.dbg.basictype.type</a> = type { uint, {  }*, sbyte*, {  }*, int, uint, uint, uint, uint }
+ %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> = ...
+ ...
+ ;;
+ ;; Define the global itself.
+ ;;
+ %MyGlobal = global int 100
+ ...
+ ;;
+ ;; Define the anchor for global variables.  Note that the second field of the
+ ;; anchor is 52, which is the same as the tag for global variables
+ ;; (52 = DW_TAG_variable.)
+ ;;
+ %<a href="#format_global_variables">llvm.dbg.global_variables</a> = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a> { uint 0, uint 52 }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the global variable descriptor.  Note the reference to the global
+ ;; variable anchor and the global variable itself.
+ ;;
+ %<a href="#format_global_variables">llvm.dbg.global_variable</a> = internal constant %<a href="#format_global_variables">llvm.dbg.global_variable.type</a> {
+     uint add(uint 52, uint 262144), 
+     {  }* cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_global_variables">llvm.dbg.global_variables</a> to {  }*), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([9 x sbyte]* %str1, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     uint 1,
+     {  }* cast (%<a href="#format_basic_type">llvm.dbg.basictype.type</a>* %<a href="#format_basic_type">llvm.dbg.basictype</a> to {  }*), 
+     bool false, 
+     bool true, 
+     {  }* cast (int* %MyGlobal to {  }*) }, section "llvm.metadata"
+     
+ ;;
+ ;; Define the basic type of 32 bit signed integer.  Note that since int is an
+ ;; intrinsic type the source file is NULL and line 0.
+ ;;    
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([4 x sbyte]* %str2, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 5 }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the names of the global variable and basic type.
+ ;;
+ %str1 = internal constant [9 x sbyte] c"MyGlobal\00", section "llvm.metadata"
+ %str2 = internal constant [4 x sbyte] c"int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_subprogram">C/C++ function information</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given a function declared as follows;</p>
+ 
+ <pre>
+ int main(int argc, char *argv[]) {
+   return 0;
+ }
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ;;
+ ;; Define types used. One for subprogram anchors, one for the subprogram
+ ;; descriptor, one for the global's basic type and one for the subprogram's
+ ;; compile unit.
+ ;;
+ %<a href="#format_subprograms">llvm.dbg.subprogram.type</a> = type { uint, {  }*, {  }*, sbyte*, {  }*, bool, bool }
+ %<a href="#format_anchors">llvm.dbg.anchor.type</a> = type { uint, uint }
+ %<a href="#format_compile_units">llvm.dbg.compile_unit.type</a> = ...
+ 	
+ ;;
+ ;; Define the anchor for subprograms.  Note that the second field of the
+ ;; anchor is 46, which is the same as the tag for subprograms
+ ;; (46 = DW_TAG_subprogram.)
+ ;;
+ %<a href="#format_subprograms">llvm.dbg.subprograms</a> = linkonce constant %<a href="#format_anchors">llvm.dbg.anchor.type</a> { uint 0, uint 46 }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the descriptor for the subprogram.  TODO - more details.
+ ;;
+ %<a href="#format_subprograms">llvm.dbg.subprogram</a> = internal constant %<a href="#format_subprograms">llvm.dbg.subprogram.type</a> {
+     uint add(uint 46, uint 262144), 
+     {  }* cast (%<a href="#format_anchors">llvm.dbg.anchor.type</a>* %<a href="#format_subprograms">llvm.dbg.subprograms</a> to {  }*), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([5 x sbyte]* %str1, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*),
+     uint 1,
+     {  }* null, 
+     bool false, 
+     bool true }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the name of the subprogram.
+ ;;
+ %str1 = internal constant [5 x sbyte] c"main\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the subprogram itself.
+ ;;
+ int %main(int %argc, sbyte** %argv) {
+ ...
+ }
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_basic_types">C/C++ basic types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The following are the basic type descriptors for C/C++ core types;</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_type_bool">bool</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([5 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 2 }, section "llvm.metadata"
+ %str1 = internal constant [5 x sbyte] c"bool\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_char">char</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([5 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 8, 
+     uint 8, 
+     uint 0, 
+     uint 6 }, section "llvm.metadata"
+ %str1 = internal constant [5 x sbyte] c"char\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_unsigned_char">unsigned char</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([14 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 8, 
+     uint 8, 
+     uint 0, 
+     uint 8 }, section "llvm.metadata"
+ %str1 = internal constant [14 x sbyte] c"unsigned char\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_short">short</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([10 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 16, 
+     uint 16, 
+     uint 0, 
+     uint 5 }, section "llvm.metadata"
+ %str1 = internal constant [10 x sbyte] c"short int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_unsigned_short">unsigned short</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([19 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 16, 
+     uint 16, 
+     uint 0, 
+     uint 7 }, section "llvm.metadata"
+ %str1 = internal constant [19 x sbyte] c"short unsigned int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_int">int</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([4 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 5 }, section "llvm.metadata"
+ %str1 = internal constant [4 x sbyte] c"int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_unsigned_int">unsigned int</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([13 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 7 }, section "llvm.metadata"
+ %str1 = internal constant [13 x sbyte] c"unsigned int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_long_long">long long</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([14 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 64, 
+     uint 64, 
+     uint 0, 
+     uint 5 }, section "llvm.metadata"
+ %str1 = internal constant [14 x sbyte] c"long long int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_unsigned_long_long">unsigned long long</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([23 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 64, 
+     uint 64, 
+     uint 0, 
+     uint 7 }, section "llvm.metadata"
+ %str1 = internal constant [23 x sbyte] c"long long unsigned int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_float">float</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([6 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 4 }, section "llvm.metadata"
+ %str1 = internal constant [6 x sbyte] c"float\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsubsection">
+   <a name="ccxx_basic_double">double</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([7 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 64, 
+     uint 64, 
+     uint 0, 
+     uint 4 }, section "llvm.metadata"
+ %str1 = internal constant [7 x sbyte] c"double\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_derived_types">C/C++ derived types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given the following as an example of C/C++ derived type;</p>
+ 
+ <pre>
+ typedef const int *IntPtr;
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ;;
+ ;; Define the typedef "IntPtr".
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype1</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 22, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([7 x sbyte]* %str1, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 1, 
+     uint 0, 
+     uint 0, 
+     uint 0, 
+     {  }* cast (%<a href="#format_derived_type">llvm.dbg.derivedtype.type</a>* %<a href="#format_derived_type">llvm.dbg.derivedtype2</a> to {  }*) }, section "llvm.metadata"
+ %str1 = internal constant [7 x sbyte] c"IntPtr\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the pointer type.
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype2</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 15, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* null, 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     {  }* cast (%<a href="#format_derived_type">llvm.dbg.derivedtype.type</a>* %<a href="#format_derived_type">llvm.dbg.derivedtype3</a> to {  }*) }, section "llvm.metadata"
+ 
+ ;;
+ ;; Define the const type.
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype3</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 38, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* null, 
+     {  }* null, 
+     int 0, 
+     uint 0, 
+     uint 0, 
+     uint 0, 
+     {  }* cast (%<a href="#format_basic_type">llvm.dbg.basictype.type</a>* %<a href="#format_basic_type">llvm.dbg.basictype1</a> to {  }*) }, section "llvm.metadata"	
+ 
+ ;;
+ ;; Define the int type.
+ ;;
+ %<a href="#format_basic_type">llvm.dbg.basictype1</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([4 x sbyte]* %str2, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 5 }, section "llvm.metadata"
+ %str2 = internal constant [4 x sbyte] c"int\00", section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_composite_types">C/C++ struct/union types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given the following as an example of C/C++ struct type;</p>
+ 
+ <pre>
+ struct Color {
+   unsigned Red;
+   unsigned Green;
+   unsigned Blue;
+ };
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ;;
+ ;; Define basic type for unsigned int.
+ ;;
+ %<a href="#format_basic_type">llvm.dbg.basictype</a> = internal constant %<a href="#format_basic_type">llvm.dbg.basictype.type</a> {
+     uint add(uint 36, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([13 x sbyte]* %str1, int 0, int 0), 
+     {  }* null, 
+     int 0, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     uint 7 }, section "llvm.metadata"
+ %str1 = internal constant [13 x sbyte] c"unsigned int\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define composite type for struct Color.
+ ;;
+ %<a href="#format_composite_type">llvm.dbg.compositetype</a> = internal constant %<a href="#format_composite_type">llvm.dbg.compositetype.type</a> {
+     uint add(uint 19, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([6 x sbyte]* %str2, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 1, 
+     uint 96, 
+     uint 32, 
+     uint 0, 
+     {  }* null,
+     {  }* cast ([3 x {  }*]* %llvm.dbg.array to {  }*) }, section "llvm.metadata"
+ %str2 = internal constant [6 x sbyte] c"Color\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the Red field.
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype1</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 13, uint 262144), 
+     {  }* null, 
+     sbyte* getelementptr ([4 x sbyte]* %str3, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 2, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     {  }* cast (%<a href="#format_basic_type">llvm.dbg.basictype.type</a>* %<a href="#format_basic_type">llvm.dbg.basictype</a> to {  }*) }, section "llvm.metadata"
+ %str3 = internal constant [4 x sbyte] c"Red\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the Green field.
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype2</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 13, uint 262144), 
+     {  }* null, 
+     sbyte* getelementptr ([6 x sbyte]* %str4, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 3, 
+     uint 32, 
+     uint 32, 
+     uint 32, 
+     {  }* cast (%<a href="#format_basic_type">llvm.dbg.basictype.type</a>* %<a href="#format_basic_type">llvm.dbg.basictype</a> to {  }*) }, section "llvm.metadata"
+ %str4 = internal constant [6 x sbyte] c"Green\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the Blue field.
+ ;;
+ %<a href="#format_derived_type">llvm.dbg.derivedtype3</a> = internal constant %<a href="#format_derived_type">llvm.dbg.derivedtype.type</a> {
+     uint add(uint 13, uint 262144), 
+     {  }* null, 
+     sbyte* getelementptr ([5 x sbyte]* %str5, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 4, 
+     uint 32, 
+     uint 32, 
+     uint 64, 
+     {  }* cast (%<a href="#format_basic_type">llvm.dbg.basictype.type</a>* %<a href="#format_basic_type">llvm.dbg.basictype</a> to {  }*) }, section "llvm.metadata"
+ %str5 = internal constant [5 x sbyte] c"Blue\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the array of fields used by the composite type Color.
+ ;;
+ %llvm.dbg.array = internal constant [3 x {  }*] [
+       {  }* cast (%<a href="#format_derived_type">llvm.dbg.derivedtype.type</a>* %<a href="#format_derived_type">llvm.dbg.derivedtype1</a> to {  }*),
+       {  }* cast (%<a href="#format_derived_type">llvm.dbg.derivedtype.type</a>* %<a href="#format_derived_type">llvm.dbg.derivedtype2</a> to {  }*),
+       {  }* cast (%<a href="#format_derived_type">llvm.dbg.derivedtype.type</a>* %<a href="#format_derived_type">llvm.dbg.derivedtype3</a> to {  }*) ], section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ccxx_enumeration_types">C/C++ enumeration types</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Given the following as an example of C/C++ enumeration type;</p>
+ 
+ <pre>
+ enum Trees {
+   Spruce = 100,
+   Oak = 200,
+   Maple = 300
+ };
+ </pre>
+ 
+ <p>a C/C++ front-end would generate the following descriptors;</p>
+ 
+ <pre>
+ ;;
+ ;; Define composite type for enum Trees
+ ;;
+ %<a href="#format_composite_type">llvm.dbg.compositetype</a> = internal constant %<a href="#format_composite_type">llvm.dbg.compositetype.type</a> {
+     uint add(uint 4, uint 262144), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     sbyte* getelementptr ([6 x sbyte]* %str1, int 0, int 0), 
+     {  }* cast (%<a href="#format_compile_units">llvm.dbg.compile_unit.type</a>* %<a href="#format_compile_units">llvm.dbg.compile_unit</a> to {  }*), 
+     int 1, 
+     uint 32, 
+     uint 32, 
+     uint 0, 
+     {  }* null, 
+     {  }* cast ([3 x {  }*]* %llvm.dbg.array to {  }*) }, section "llvm.metadata"
+ %str1 = internal constant [6 x sbyte] c"Trees\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define Spruce enumerator.
+ ;;
+ %<a href="#format_enumeration">llvm.dbg.enumerator1</a> = internal constant %<a href="#format_enumeration">llvm.dbg.enumerator.type</a> {
+     uint add(uint 40, uint 262144), 
+     sbyte* getelementptr ([7 x sbyte]* %str2, int 0, int 0), 
+     int 100 }, section "llvm.metadata"
+ %str2 = internal constant [7 x sbyte] c"Spruce\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define Oak enumerator.
+ ;;
+ %<a href="#format_enumeration">llvm.dbg.enumerator2</a> = internal constant %<a href="#format_enumeration">llvm.dbg.enumerator.type</a> {
+     uint add(uint 40, uint 262144), 
+     sbyte* getelementptr ([4 x sbyte]* %str3, int 0, int 0), 
+     int 200 }, section "llvm.metadata"
+ %str3 = internal constant [4 x sbyte] c"Oak\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define Maple enumerator.
+ ;;
+ %<a href="#format_enumeration">llvm.dbg.enumerator3</a> = internal constant %<a href="#format_enumeration">llvm.dbg.enumerator.type</a> {
+     uint add(uint 40, uint 262144), 
+     sbyte* getelementptr ([6 x sbyte]* %str4, int 0, int 0), 
+     int 300 }, section "llvm.metadata"
+ %str4 = internal constant [6 x sbyte] c"Maple\00", section "llvm.metadata"
+ 
+ ;;
+ ;; Define the array of enumerators used by composite type Trees.
+ ;;
+ %llvm.dbg.array = internal constant [3 x {  }*] [
+   {  }* cast (%<a href="#format_enumeration">llvm.dbg.enumerator.type</a>* %<a href="#format_enumeration">llvm.dbg.enumerator1</a> to {  }*),
+   {  }* cast (%<a href="#format_enumeration">llvm.dbg.enumerator.type</a>* %<a href="#format_enumeration">llvm.dbg.enumerator2</a> to {  }*),
+   {  }* cast (%<a href="#format_enumeration">llvm.dbg.enumerator.type</a>* %<a href="#format_enumeration">llvm.dbg.enumerator3</a> to {  }*) ], section "llvm.metadata"
+ </pre>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/Stacker.html
diff -c /dev/null llvm-www/releases/1.8/docs/Stacker.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/Stacker.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1412 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Stacker: An Example Of Using LLVM</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">Stacker: An Example Of Using LLVM</div>
+ 
+ <ol>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#lessons">Lessons I Learned About LLVM</a>
+     <ol>
+       <li><a href="#value">Everything's a Value!</a></li>
+       <li><a href="#terminate">Terminate Those Blocks!</a></li>
+       <li><a href="#blocks">Concrete Blocks</a></li>
+       <li><a href="#push_back">push_back Is Your Friend</a></li>
+       <li><a href="#gep">The Wily GetElementPtrInst</a></li>
+       <li><a href="#linkage">Getting Linkage Types Right</a></li>
+       <li><a href="#constants">Constants Are Easier Than That!</a></li>
+     </ol></li>
+   <li><a href="#lexicon">The Stacker Lexicon</a>
+     <ol>
+       <li><a href="#stack">The Stack</a></li>
+       <li><a href="#punctuation">Punctuation</a></li>
+       <li><a href="#comments">Comments</a></li>
+       <li><a href="#literals">Literals</a></li>
+       <li><a href="#words">Words</a></li>
+       <li><a href="#style">Standard Style</a></li>
+       <li><a href="#builtins">Built-Ins</a></li>
+     </ol></li>
+   <li><a href="#example">Prime: A Complete Example</a></li>
+   <li><a href="#internal">Internal Code Details</a>
+     <ol>
+       <li><a href="#directory">The Directory Structure </a></li>
+       <li><a href="#lexer">The Lexer</a></li>
+       <li><a href="#parser">The Parser</a></li>
+       <li><a href="#compiler">The Compiler</a></li>
+       <li><a href="#runtime">The Runtime</a></li>
+       <li><a href="#driver">Compiler Driver</a></li>
+       <li><a href="#tests">Test Programs</a></li>
+       <li><a href="#exercise">Exercise</a></li>
+       <li><a href="#todo">Things Remaining To Be Done</a></li>
+     </ol></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:rspencer at x10sys.com">Reid Spencer</a></p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="abstract">Abstract</a></div>
+ <div class="doc_text">
+ <p>This document is another way to learn about LLVM. Unlike the 
+ <a href="LangRef.html">LLVM Reference Manual</a> or 
+ <a href="ProgrammersManual.html">LLVM Programmer's Manual</a>, here we learn
+ about LLVM through the experience of creating a simple programming language
+ named Stacker.  Stacker was invented specifically as a demonstration of
+ LLVM. The emphasis in this document is not on describing the
+ intricacies of LLVM itself but on how to use it to build your own
+ compiler system.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"> <a name="introduction">Introduction</a> </div>
+ <div class="doc_text">
+ <p>Amongst other things, LLVM is a platform for compiler writers.
+ Because of its exceptionally clean and small IR (intermediate
+ representation), compiler writing with LLVM is much easier than with
+ other system. As proof, I wrote the entire compiler (language definition, 
+ lexer, parser, code generator, etc.) in about <em>four days</em>! 
+ That's important to know because it shows how quickly you can get a new
+ language running when using LLVM. Furthermore, this was the <em >first</em> 
+ language the author ever created using LLVM. The learning curve is 
+ included in that four days.</p>
+ <p>The language described here, Stacker, is Forth-like. Programs
+ are simple collections of word definitions, and the only thing definitions
+ can do is manipulate a stack or generate I/O.  Stacker is not a "real" 
+ programming language; it's very simple.  Although it is computationally 
+ complete, you wouldn't use it for your next big project. However, 
+ the fact that it is complete, it's simple, and it <em>doesn't</em> have 
+ a C-like syntax make it useful for demonstration purposes. It shows
+ that LLVM could be applied to a wide variety of languages.</p>
+ <p>The basic notions behind stacker is very simple. There's a stack of 
+ integers (or character pointers) that the program manipulates. Pretty 
+ much the only thing the program can do is manipulate the stack and do 
+ some limited I/O operations. The language provides you with several 
+ built-in words that manipulate the stack in interesting ways. To get 
+ your feet wet, here's how you write the traditional "Hello, World" 
+ program in Stacker:</p>
+ <p><code>: hello_world "Hello, World!" >s DROP CR ;<br>
+ : MAIN hello_world ;<br></code></p>
+ <p>This has two "definitions" (Stacker manipulates words, not
+ functions and words have definitions): <code>MAIN</code> and <code>
+ hello_world</code>. The <code>MAIN</code> definition is standard; it
+ tells Stacker where to start. Here, <code>MAIN</code> is defined to 
+ simply invoke the word <code>hello_world</code>. The
+ <code>hello_world</code> definition tells stacker to push the 
+ <code>"Hello, World!"</code> string on to the stack, print it out 
+ (<code>>s</code>), pop it off the stack (<code>DROP</code>), and
+ finally print a carriage return (<code>CR</code>). Although 
+ <code>hello_world</code> uses the stack, its net effect is null. Well
+ written Stacker definitions have that characteristic. </p>
+ <p>Exercise for the reader: how could you make this a one line program?</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="lessons"></a>Lessons I Learned About LLVM</div>
+ <div class="doc_text">
+ <p>Stacker was written for two purposes: </p>
+ <ol>
+     <li>to get the author over the learning curve, and</li>
+     <li>to provide a simple example of how to write a compiler using LLVM.</li>
+ </ol>
+ <p>During the development of Stacker, many lessons about LLVM were
+ learned. Those lessons are described in the following subsections.<p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="value"></a>Everything's a Value!</div>
+ <div class="doc_text">
+ <p>Although I knew that LLVM uses a Single Static Assignment (SSA) format, 
+ it wasn't obvious to me how prevalent this idea was in LLVM until I really
+ started using it.  Reading the <a href="ProgrammersManual.html">
+ Programmer's Manual</a> and <a href="LangRef.html">Language Reference</a>,
+ I noted that most of the important LLVM IR (Intermediate Representation) C++ 
+ classes were derived from the Value class. The full power of that simple
+ design only became fully understood once I started constructing executable
+ expressions for Stacker.</p>
+ 
+ <p>This really makes your programming go faster. Think about compiling code
+ for the following C/C++ expression: <code>(a|b)*((x+1)/(y+1))</code>. Assuming
+ the values are on the stack in the order a, b, x, y, this could be
+ expressed in stacker as: <code>1 + SWAP 1 + / ROT2 OR *</code>.
+ You could write a function using LLVM that computes this expression like 
+ this: </p>
+ 
+ <div class="doc_code"><pre>
+ Value* 
+ expression(BasicBlock* bb, Value* a, Value* b, Value* x, Value* y )
+ {
+     ConstantSInt* one = ConstantSInt::get(Type::IntTy, 1);
+     BinaryOperator* or1 = BinaryOperator::createOr(a, b, "", bb);
+     BinaryOperator* add1 = BinaryOperator::createAdd(x, one, "", bb);
+     BinaryOperator* add2 = BinaryOperator::createAdd(y, one, "", bb);
+     BinaryOperator* div1 = BinaryOperator::createDiv(add1, add2, "", bb);
+     BinaryOperator* mult1 = BinaryOperator::createMul(or1, div1, "", bb);
+     return mult1;
+ }
+ </pre></div>
+ 
+ <p>"Okay, big deal," you say?  It is a big deal. Here's why. Note that I didn't
+ have to tell this function which kinds of Values are being passed in. They could be
+ <code>Instruction</code>s, <code>Constant</code>s, <code>GlobalVariable</code>s, or
+ any of the other subclasses of <code>Value</code> that LLVM supports.
+ Furthermore, if you specify Values that are incorrect for this sequence of 
+ operations, LLVM will either notice right away (at compilation time) or the LLVM 
+ Verifier will pick up the inconsistency when the compiler runs. In either case 
+ LLVM prevents you from making a type error that gets passed through to the 
+ generated program.  This <em>really</em> helps you write a compiler that 
+ always generates correct code!<p>
+ <p>The second point is that we don't have to worry about branching, registers,
+ stack variables, saving partial results, etc. The instructions we create 
+ <em>are</em> the values we use. Note that all that was created in the above
+ code is a Constant value and five operators. Each of the instructions <em>is</em> 
+ the resulting value of that instruction. This saves a lot of time.</p>
+ <p>The lesson is this: <em>SSA form is very powerful: there is no difference
+ between a value and the instruction that created it.</em> This is fully
+ enforced by the LLVM IR. Use it to your best advantage.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="terminate"></a>Terminate Those Blocks!</div>
+ <div class="doc_text">
+ <p>I had to learn about terminating blocks the hard way: using the debugger 
+ to figure out what the LLVM verifier was trying to tell me and begging for
+ help on the LLVMdev mailing list. I hope you avoid this experience.</p>
+ <p>Emblazon this rule in your mind:</p>
+ <ul>
+     <li><em>All</em> <code>BasicBlock</code>s in your compiler <b>must</b> be
+ 	terminated with a terminating instruction (branch, return, etc.).
+     </li>
+ </ul>
+ <p>Terminating instructions are a semantic requirement of the LLVM IR. There
+ is no facility for implicitly chaining together blocks placed into a function
+ in the order they occur. Indeed, in the general case, blocks will not be
+ added to the function in the order of execution because of the recursive
+ way compilers are written.</p>
+ <p>Furthermore, if you don't terminate your blocks, your compiler code will 
+ compile just fine. You won't find out about the problem until you're running 
+ the compiler and the module you just created fails on the LLVM Verifier.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="blocks"></a>Concrete Blocks</div>
+ <div class="doc_text">
+ <p>After a little initial fumbling around, I quickly caught on to how blocks
+ should be constructed. In general, here's what I learned:
+ <ol>
+     <li><em>Create your blocks early.</em> While writing your compiler, you 
+     will encounter several situations where you know apriori that you will
+     need several blocks. For example, if-then-else, switch, while, and for
+     statements in C/C++ all need multiple blocks for expression in LVVM. 
+     The rule is, create them early.</li>
+     <li><em>Terminate your blocks early.</em> This just reduces the chances 
+     that you forget to terminate your blocks which is required (go 
+     <a href="#terminate">here</a> for more). 
+     <li><em>Use getTerminator() for instruction insertion.</em> I noticed early on
+     that many of the constructors for the Instruction classes take an optional
+     <code>insert_before</code> argument. At first, I thought this was a mistake
+     because clearly the normal mode of inserting instructions would be one at
+     a time <em>after</em> some other instruction, not <em>before</em>. However,
+     if you hold on to your terminating instruction (or use the handy dandy
+     <code>getTerminator()</code> method on a <code>BasicBlock</code>), it can
+     always be used as the <code>insert_before</code> argument to your instruction
+     constructors. This causes the instruction to automatically be inserted in 
+     the RightPlace™ place, just before the terminating instruction. The 
+     nice thing about this design is that you can pass blocks around and insert 
+     new instructions into them without ever knowing what instructions came 
+     before. This makes for some very clean compiler design.</li>
+ </ol>
+ <p>The foregoing is such an important principal, its worth making an idiom:</p>
+ <pre>
+ BasicBlock* bb = new BasicBlock();
+ bb->getInstList().push_back( new Branch( ... ) );
+ new Instruction(..., bb->getTerminator() );
+ </pre>
+ <p>To make this clear, consider the typical if-then-else statement
+ (see StackerCompiler::handle_if() method).  We can set this up
+ in a single function using LLVM in the following way: </p>
+ <pre>
+ using namespace llvm;
+ BasicBlock*
+ MyCompiler::handle_if( BasicBlock* bb, SetCondInst* condition )
+ {
+     // Create the blocks to contain code in the structure of if/then/else
+     BasicBlock* then_bb = new BasicBlock(); 
+     BasicBlock* else_bb = new BasicBlock();
+     BasicBlock* exit_bb = new BasicBlock();
+ 
+     // Insert the branch instruction for the "if"
+     bb->getInstList().push_back( new BranchInst( then_bb, else_bb, condition ) );
+ 
+     // Set up the terminating instructions
+     then->getInstList().push_back( new BranchInst( exit_bb ) );
+     else->getInstList().push_back( new BranchInst( exit_bb ) );
+ 
+     // Fill in the then part .. details excised for brevity
+     this->fill_in( then_bb );
+ 
+     // Fill in the else part .. details excised for brevity
+     this->fill_in( else_bb );
+ 
+     // Return a block to the caller that can be filled in with the code
+     // that follows the if/then/else construct.
+     return exit_bb;
+ }
+ </pre>
+ <p>Presumably in the foregoing, the calls to the "fill_in" method would add 
+ the instructions for the "then" and "else" parts. They would use the third part
+ of the idiom almost exclusively (inserting new instructions before the 
+ terminator). Furthermore, they could even recurse back to <code>handle_if</code> 
+ should they encounter another if/then/else statement, and it will just work.</p>
+ <p>Note how cleanly this all works out. In particular, the push_back methods on
+ the <code>BasicBlock</code>'s instruction list. These are lists of type 
+ <code>Instruction</code> (which is also of type <code>Value</code>). To create 
+ the "if" branch we merely instantiate a <code>BranchInst</code> that takes as 
+ arguments the blocks to branch to and the condition to branch on. The 
+ <code>BasicBlock</code> objects act like branch labels! This new 
+ <code>BranchInst</code> terminates the <code>BasicBlock</code> provided 
+ as an argument. To give the caller a way to keep inserting after calling 
+ <code>handle_if</code>, we create an <code>exit_bb</code> block which is
+ returned 
+ to the caller.  Note that the <code>exit_bb</code> block is used as the 
+ terminator for both the <code>then_bb</code> and the <code>else_bb</code>
+ blocks. This guarantees that no matter what else <code>handle_if</code>
+ or <code>fill_in</code> does, they end up at the <code>exit_bb</code> block.
+ </p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="push_back"></a>push_back Is Your Friend</div>
+ <div class="doc_text">
+ <p>
+ One of the first things I noticed is the frequent use of the "push_back"
+ method on the various lists. This is so common that it is worth mentioning.
+ The "push_back" inserts a value into an STL list, vector, array, etc. at the
+ end. The method might have also been named "insert_tail" or "append".
+ Although I've used STL quite frequently, my use of push_back wasn't very
+ high in other programs. In LLVM, you'll use it all the time.
+ </p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="gep"></a>The Wily GetElementPtrInst</div>
+ <div class="doc_text">
+ <p>
+ It took a little getting used to and several rounds of postings to the LLVM
+ mailing list to wrap my head around this instruction correctly. Even though I had
+ read the Language Reference and Programmer's Manual a couple times each, I still
+ missed a few <em>very</em> key points:
+ </p>
+ <ul>
+ <li>GetElementPtrInst gives you back a Value for the last thing indexed.</li>
+ <li>All global variables in LLVM  are <em>pointers</em>.</li>
+ <li>Pointers must also be dereferenced with the GetElementPtrInst
+ instruction.</li>
+ </ul>
+ <p>This means that when you look up an element in the global variable (assuming
+ it's a struct or array), you <em>must</em> deference the pointer first! For many
+ things, this leads to the idiom:
+ </p>
+ <pre>
+ std::vector<Value*> index_vector;
+ index_vector.push_back( ConstantSInt::get( Type::LongTy, 0 );
+ // ... push other indices ...
+ GetElementPtrInst* gep = new GetElementPtrInst( ptr, index_vector );
+ </pre>
+ <p>For example, suppose we have a global variable whose type is [24 x int]. The
+ variable itself represents a <em>pointer</em> to that array. To subscript the
+ array, we need two indices, not just one. The first index (0) dereferences the
+ pointer. The second index subscripts the array. If you're a "C" programmer, this
+ will run against your grain because you'll naturally think of the global array
+ variable and the address of its first element as the same. That tripped me up
+ for a while until I realized that they really do differ .. by <em>type</em>.
+ Remember that LLVM is strongly typed. Everything has a type.  
+ The "type" of the global variable is [24 x int]*. That is, it's
+ a pointer to an array of 24 ints.  When you dereference that global variable with
+ a single (0) index, you now have a "[24 x int]" type.  Although
+ the pointer value of the dereferenced global and the address of the zero'th element
+ in the array will be the same, they differ in their type. The zero'th element has
+ type "int" while the pointer value has type "[24 x int]".</p>
+ <p>Get this one aspect of LLVM right in your head, and you'll save yourself
+ a lot of compiler writing headaches down the road.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="linkage"></a>Getting Linkage Types Right</div>
+ <div class="doc_text">
+ <p>Linkage types in LLVM can be a little confusing, especially if your compiler
+ writing mind has affixed firm concepts to particular words like "weak",
+ "external", "global", "linkonce", etc. LLVM does <em>not</em> use the precise
+ definitions of, say, ELF or GCC, even though they share common terms. To be fair,
+ the concepts are related and similar but not precisely the same. This can lead
+ you to think you know what a linkage type represents but in fact it is slightly
+ different. I recommend you read the 
+ <a href="LangRef.html#linkage"> Language Reference on this topic</a> very 
+ carefully. Then, read it again.<p>
+ <p>Here are some handy tips that I discovered along the way:</p>
+ <ul>
+     <li><em>Uninitialized means external.</em> That is, the symbol is declared in the current
+     module and can be used by that module, but it is not defined by that module.</li>
+     <li><em>Setting an initializer changes a global' linkage type.</em> Setting an 
+     initializer changes a global's linkage type from whatever it was to a normal, 
+     defined global (not external). You'll need to call the setLinkage() method to 
+     reset it if you specify the initializer after the GlobalValue has been constructed. 
+     This is important for LinkOnce and Weak linkage types.</li> 
+     <li><em>Appending linkage can keep track of things.</em> Appending linkage can 
+     be used to keep track of compilation information at runtime. It could be used, 
+     for example, to build a full table of all the C++ virtual tables or hold the 
+     C++ RTTI data, or whatever. Appending linkage can only be applied to arrays. 
+     All arrays with the same name in each module are concatenated together at link 
+     time.</li>
+ </ul>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="constants"></a>Constants Are Easier Than That!</div>
+ <div class="doc_text">
+ <p>
+ Constants in LLVM took a little getting used to until I discovered a few utility
+ functions in the LLVM IR that make things easier. Here's what I learned: </p>
+ <ul>
+  <li>Constants are Values like anything else and can be operands of instructions</li>
+  <li>Integer constants, frequently needed, can be created using the static "get"
+  methods of the ConstantInt, ConstantSInt, and ConstantUInt classes. The nice thing
+  about these is that you can "get" any kind of integer quickly.</li>
+  <li>There's a special method on Constant class which allows you to get the null 
+  constant for <em>any</em> type. This is really handy for initializing large 
+  arrays or structures, etc.</li>
+ </ul>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"> <a name="lexicon">The Stacker Lexicon</a></div>
+ <div class="doc_text"><p>This section describes the Stacker language</p></div>
+ <div class="doc_subsection"><a name="stack"></a>The Stack</div>
+ <div class="doc_text">
+ <p>Stacker definitions define what they do to the global stack. Before
+ proceeding, a few words about the stack are in order. The stack is simply
+ a global array of 32-bit integers or pointers. A global index keeps track
+ of the location of the top of the stack. All of this is hidden from the 
+ programmer, but it needs to be noted because it is the foundation of the 
+ conceptual programming model for Stacker. When you write a definition,
+ you are, essentially, saying how you want that definition to manipulate
+ the global stack.</p>
+ <p>Manipulating the stack can be quite hazardous. There is no distinction
+ given and no checking for the various types of values that can be placed
+ on the stack. Automatic coercion between types is performed. In many 
+ cases, this is useful. For example, a boolean value placed on the stack
+ can be interpreted as an integer with good results. However, using a
+ word that interprets that boolean value as a pointer to a string to
+ print out will almost always yield a crash. Stacker simply leaves it
+ to the programmer to get it right without any interference or hindering
+ on interpretation of the stack values. You've been warned. :) </p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="punctuation"></a>Punctuation</div>
+ <div class="doc_text">
+ <p>Punctuation in Stacker is very simple. The colon and semi-colon 
+ characters are used to introduce and terminate a definition
+ (respectively). Except for <em>FORWARD</em> declarations, definitions 
+ are all you can specify in Stacker.  Definitions are read left to right. 
+ Immediately after the colon comes the name of the word being defined. 
+ The remaining words in the definition specify what the word does. The definition
+ is terminated by a semi-colon.</p>
+ <p>So, your typical definition will have the form:</p>
+ <pre><code>: name ... ;</code></pre>
+ <p>The <code>name</code> is up to you but it must start with a letter and contain
+ only letters, numbers, and underscore. Names are case sensitive and must not be
+ the same as the name of a built-in word. The <code>...</code> is replaced by
+ the stack manipulating words that you wish to define <code>name</code> as. <p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="comments"></a>Comments</div>
+ <div class="doc_text">
+     <p>Stacker supports two types of comments. A hash mark (#) starts a comment
+     that extends to the end of the line. It is identical to the kind of comments
+     commonly used in shell scripts. A pair of parentheses also surround a comment.
+     In both cases, the content of the comment is ignored by the Stacker compiler. The
+     following does nothing in Stacker.
+     </p>
+ <pre><code>
+ # This is a comment to end of line
+ ( This is an enclosed comment )
+ </code></pre>
+ <p>See the <a href="#example">example</a> program to see comments in use in 
+ a real program.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="literals"></a>Literals</div>
+ <div class="doc_text">
+     <p>There are three kinds of literal values in Stacker: Integers, Strings,
+     and Booleans. In each case, the stack operation is to simply push the
+     value on to the stack. So, for example:<br/>
+     <code> 42 " is the answer." TRUE </code><br/>
+     will push three values on to the stack: the integer 42, the
+     string " is the answer.", and the boolean TRUE.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="words"></a>Words</div>
+ <div class="doc_text">
+ <p>Each definition in Stacker is composed of a set of words. Words are
+ read and executed in order from left to right. There is very little
+ checking in Stacker to make sure you're doing the right thing with 
+ the stack. It is assumed that the programmer knows how the stack 
+ transformation he applies will affect the program.</p>
+ <p>Words in a definition come in two flavors: built-in and programmer
+ defined. Simply mentioning the name of a previously defined or declared
+ programmer-defined word causes that word's stack actions to be invoked. It
+ is somewhat like a function call in other languages. The built-in
+ words have various effects, described <a href="#builtins">below</a>.</p>
+ <p>Sometimes you need to call a word before it is defined. For this, you can
+ use the <code>FORWARD</code> declaration. It looks like this:</p>
+ <p><code>FORWARD name ;</code></p>
+ <p>This simply states to Stacker that "name" is the name of a definition
+ that is defined elsewhere. Generally it means the definition can be found
+ "forward" in the file. But, it doesn't have to be in the current compilation
+ unit. Anything declared with <code>FORWARD</code> is an external symbol for
+ linking.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="style"></a>Standard Style</div>
+ <div class="doc_text">
+ <p>TODO</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="builtins"></a>Built In Words</div>
+ <div class="doc_text">
+ <p>The built-in words of the Stacker language are put in several groups 
+ depending on what they do. The groups are as follows:</p>
+ <ol> 
+     <li><em>Logical</em>: These words provide the logical operations for
+     comparing stack operands.<br/>The words are: < > <= >= 
+     = <> true false.</li>
+     <li><em>Bitwise</em>: These words perform bitwise computations on 
+     their operands. <br/> The words are: << >> XOR AND NOT</li>
+     <li><em>Arithmetic</em>: These words perform arithmetic computations on
+     their operands. <br/> The words are: ABS NEG + - * / MOD */ ++ -- MIN MAX</li>
+     <li><em>Stack</em>These words manipulate the stack directly by moving
+     its elements around.<br/> The words are: DROP DROP2 NIP NIP2 DUP DUP2 
+     SWAP SWAP2 OVER OVER2 ROT ROT2 RROT RROT2 TUCK TUCK2 PICK SELECT ROLL</li>
+     <li><em>Memory</em>These words allocate, free, and manipulate memory
+     areas outside the stack.<br/>The words are: MALLOC FREE GET PUT</li>
+     <li><em>Control</em>: These words alter the normal left to right flow
+     of execution.<br/>The words are: IF ELSE ENDIF WHILE END RETURN EXIT RECURSE</li>
+     <li><em>I/O</em>: These words perform output on the standard output
+     and input on the standard input. No other I/O is possible in Stacker.
+     <br/>The words are: SPACE TAB CR >s >d >c <s <d <c.</li>
+ </ol>
+ <p>While you may be familiar with many of these operations from other
+ programming languages, a careful review of their semantics is important
+ for correct programming in Stacker. Of most importance is the effect 
+ that each of these built-in words has on the global stack. The effect is
+ not always intuitive. To better describe the effects, we'll borrow from Forth the idiom of
+ describing the effect on the stack with:</p>
+ <p><code> BEFORE -- AFTER </code></p> 
+ <p>That is, to the left of the -- is a representation of the stack before
+ the operation. To the right of the -- is a representation of the stack
+ after the operation. In the table below that describes the operation of
+ each of the built in words, we will denote the elements of the stack 
+ using the following construction:</p>
+ <ol>
+     <li><em>b</em> - a boolean truth value</li>
+     <li><em>w</em> - a normal integer valued word.</li>
+     <li><em>s</em> - a pointer to a string value</li>
+     <li><em>p</em> - a pointer to a malloc'd memory block</li>
+ </ol>
+ </div>
+ <div class="doc_text" >
+     <table>
+ <tr><th colspan="4">Definition Of Operation Of Built In Words</th></tr>
+ <tr><th colspan="4"><b>LOGICAL OPERATIONS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr>
+     <td><</td>
+     <td>LT</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is less than w2, TRUE is pushed back on
+     the stack, otherwise FALSE is pushed back on the stack.</td>
+ </tr>
+ <tr><td>></td>
+     <td>GT</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is greater than w2, TRUE is pushed back on
+     the stack, otherwise FALSE is pushed back on the stack.</td>
+ </tr>
+ <tr><td>>=</td>
+     <td>GE</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is greater than or equal to w2, TRUE is 
+     pushed back on the stack, otherwise FALSE is pushed back 
+     on the stack.</td>
+ </tr>
+ <tr><td><=</td>
+     <td>LE</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is less than or equal to w2, TRUE is 
+     pushed back on the stack, otherwise FALSE is pushed back 
+     on the stack.</td>
+ </tr>
+ <tr><td>=</td>
+     <td>EQ</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is equal to w2, TRUE is 
+     pushed back on the stack, otherwise FALSE is pushed back 
+     </td>
+ </tr>
+ <tr><td><></td>
+     <td>NE</td>
+     <td>w1 w2 -- b</td>
+     <td>Two values (w1 and w2) are popped off the stack and
+     compared. If w1 is equal to w2, TRUE is 
+     pushed back on the stack, otherwise FALSE is pushed back 
+     </td>
+ </tr>
+ <tr><td>FALSE</td>
+     <td>FALSE</td>
+     <td> -- b</td>
+     <td>The boolean value FALSE (0) is pushed on to the stack.</td>
+ </tr>
+ <tr><td>TRUE</td>
+     <td>TRUE</td>
+     <td> -- b</td>
+     <td>The boolean value TRUE (-1) is pushed on to the stack.</td>
+ </tr>
+ <tr><th colspan="4"><b>BITWISE OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td><<</td>
+     <td>SHL</td>
+     <td>w1 w2 -- w1<<w2</td>
+     <td>Two values (w1 and w2) are popped off the stack. The w2
+     operand is shifted left by the number of bits given by the
+     w1 operand. The result is pushed back to the stack.</td>
+ </tr>
+ <tr><td>>></td>
+     <td>SHR</td>
+     <td>w1 w2 -- w1>>w2</td>
+     <td>Two values (w1 and w2) are popped off the stack. The w2
+     operand is shifted right by the number of bits given by the
+     w1 operand. The result is pushed back to the stack.</td>
+ </tr>
+ <tr><td>OR</td>
+     <td>OR</td>
+     <td>w1 w2 -- w2|w1</td>
+     <td>Two values (w1 and w2) are popped off the stack. The values
+     are bitwise OR'd together and pushed back on the stack. This is 
+     not a logical OR. The sequence 1 2 OR yields 3 not 1.</td>
+ </tr>
+ <tr><td>AND</td>
+     <td>AND</td>
+     <td>w1 w2 -- w2&w1</td>
+     <td>Two values (w1 and w2) are popped off the stack. The values
+     are bitwise AND'd together and pushed back on the stack. This is 
+     not a logical AND. The sequence 1 2 AND yields 0 not 1.</td>
+ </tr>
+ <tr><td>XOR</td>
+     <td>XOR</td>
+     <td>w1 w2 -- w2^w1</td>
+     <td>Two values (w1 and w2) are popped off the stack. The values
+     are bitwise exclusive OR'd together and pushed back on the stack. 
+     For example, The sequence 1 3 XOR yields 2.</td>
+ </tr>
+ <tr><th colspan="4"><b>ARITHMETIC OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td>ABS</td>
+     <td>ABS</td>
+     <td>w -- |w|</td>
+     <td>One value s popped off the stack; its absolute value is computed
+     and then pushed on to the stack. If w1 is -1 then w2 is 1. If w1 is
+     1 then w2 is also 1.</td>
+ </tr>
+ <tr><td>NEG</td>
+     <td>NEG</td>
+     <td>w -- -w</td>
+     <td>One value is popped off the stack which is negated and then
+     pushed back on to the stack. If w1 is -1 then w2 is 1. If w1 is
+     1 then w2 is -1.</td>
+ </tr>
+ <tr><td> + </td>
+     <td>ADD</td>
+     <td>w1 w2 -- w2+w1</td>
+     <td>Two values are popped off the stack. Their sum is pushed back
+     on to the stack</td>
+ </tr>
+ <tr><td> - </td>
+     <td>SUB</td>
+     <td>w1 w2 -- w2-w1</td>
+     <td>Two values are popped off the stack. Their difference is pushed back
+     on to the stack</td>
+ </tr>
+ <tr><td> * </td>
+     <td>MUL</td>
+     <td>w1 w2 -- w2*w1</td>
+     <td>Two values are popped off the stack. Their product is pushed back
+     on to the stack</td>
+ </tr>
+ <tr><td> / </td>
+     <td>DIV</td>
+     <td>w1 w2 -- w2/w1</td>
+     <td>Two values are popped off the stack. Their quotient is pushed back
+     on to the stack</td>
+ </tr>
+ <tr><td>MOD</td>
+     <td>MOD</td>
+     <td>w1 w2 -- w2%w1</td>
+     <td>Two values are popped off the stack. Their remainder after division
+     of w1 by w2 is pushed back on to the stack</td>
+ </tr>
+ <tr><td> */ </td>
+     <td>STAR_SLAH</td>
+     <td>w1 w2 w3 -- (w3*w2)/w1</td>
+     <td>Three values are popped off the stack. The product of w1 and w2 is
+     divided by w3. The result is pushed back on to the stack.</td>
+ </tr>
+ <tr><td> ++ </td>
+     <td>INCR</td>
+     <td>w -- w+1</td>
+     <td>One value is popped off the stack. It is incremented by one and then
+     pushed back on to the stack.</td>
+ </tr>
+ <tr><td> -- </td>
+     <td>DECR</td>
+     <td>w -- w-1</td>
+     <td>One value is popped off the stack. It is decremented by one and then
+     pushed back on to the stack.</td>
+ </tr>
+ <tr><td>MIN</td>
+     <td>MIN</td>
+     <td>w1 w2 -- (w2<w1?w2:w1)</td>
+     <td>Two values are popped off the stack. The larger one is pushed back
+     on to the stack.</td>
+ </tr>
+ <tr><td>MAX</td>
+     <td>MAX</td>
+     <td>w1 w2 -- (w2>w1?w2:w1)</td>
+     <td>Two values are popped off the stack. The larger value is pushed back
+ 	on to the stack.</td>
+ </tr>
+ <tr><th colspan="4"><b>STACK MANIPULATION OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td>DROP</td>
+     <td>DROP</td>
+     <td>w -- </td>
+     <td>One value is popped off the stack.</td>
+ </tr>
+ <tr><td>DROP2</td>
+     <td>DROP2</td>
+     <td>w1 w2 -- </td>
+     <td>Two values are popped off the stack.</td>
+ </tr>
+ <tr><td>NIP</td>
+     <td>NIP</td>
+     <td>w1 w2 -- w2</td>
+     <td>The second value on the stack is removed from the stack. That is,
+ 	a value is popped off the stack and retained. Then a second value is
+ 	popped and the retained value is pushed.</td>
+ </tr>
+ <tr><td>NIP2</td>
+     <td>NIP2</td>
+     <td>w1 w2 w3 w4 -- w3 w4</td>
+     <td>The third and fourth values on the stack are removed from it. That is,
+ 	two values are popped and retained. Then two more values are popped and
+ 	the two retained values are pushed back on.</td>
+ </tr>
+ <tr><td>DUP</td>
+     <td>DUP</td>
+     <td>w1 -- w1 w1</td>
+     <td>One value is popped off the stack. That value is then pushed on to
+ 	the stack twice to duplicate the top stack vaue.</td>
+ </tr>
+ <tr><td>DUP2</td>
+     <td>DUP2</td>
+     <td>w1 w2 -- w1 w2 w1 w2</td>
+     <td>The top two values on the stack are duplicated. That is, two vaues
+ 	are popped off the stack. They are alternately pushed back on the
+ 	stack twice each.</td>
+ </tr>
+ <tr><td>SWAP</td>
+     <td>SWAP</td>
+     <td>w1 w2 -- w2 w1</td>
+     <td>The top two stack items are reversed in their order. That is, two
+ 	values are popped off the stack and pushed back on to the stack in
+ 	the opposite order they were popped.</td>
+ </tr>
+ <tr><td>SWAP2</td>
+     <td>SWAP2</td>
+     <td>w1 w2 w3 w4 -- w3 w4 w2 w1</td>
+     <td>The top four stack items are swapped in pairs. That is, two values
+ 	are popped and retained. Then, two more values are popped and retained.
+ 	The values are pushed back on to the stack in the reverse order but
+ 	in pairs.</td>
+ </tr>
+ <tr><td>OVER</td>
+     <td>OVER</td>
+     <td>w1 w2-- w1 w2 w1</td>
+     <td>Two values are popped from the stack. They are pushed back
+ 	on to the stack in the order w1 w2 w1. This seems to cause the
+ 	top stack element to be duplicated "over" the next value.</td>
+ </tr>
+ <tr><td>OVER2</td>
+     <td>OVER2</td>
+     <td>w1 w2 w3 w4 -- w1 w2 w3 w4 w1 w2</td>
+     <td>The third and fourth values on the stack are replicated on to the
+ 	top of the stack</td>
+ </tr>
+ <tr><td>ROT</td>
+     <td>ROT</td>
+     <td>w1 w2 w3 -- w2 w3 w1</td>
+     <td>The top three values are rotated. That is, three value are popped
+ 	off the stack. They are pushed back on to the stack in the order
+ 	w1 w3 w2.</td>
+ </tr>
+ <tr><td>ROT2</td>
+     <td>ROT2</td>
+     <td>w1 w2 w3 w4 w5 w6 -- w3 w4 w5 w6 w1 w2</td>
+     <td>Like ROT but the rotation is done using three pairs instead of
+ 	three singles.</td>
+ </tr>
+ <tr><td>RROT</td>
+     <td>RROT</td>
+     <td>w1 w2 w3 -- w3 w1 w2</td>
+     <td>Reverse rotation. Like ROT, but it rotates the other way around.
+ 	Essentially, the third element on the stack is moved to the top
+ 	of the stack.</td>
+ </tr>
+ <tr><td>RROT2</td>
+     <td>RROT2</td>
+     <td>w1 w2 w3 w4 w5 w6 -- w3 w4 w5 w6 w1 w2</td>
+     <td>Double reverse rotation. Like RROT but the rotation is done using 
+ 	three pairs instead of three singles. The fifth and sixth stack 
+ 	elements are moved to the first and second positions</td>
+ </tr>
+ <tr><td>TUCK</td>
+     <td>TUCK</td>
+     <td>w1 w2 -- w2 w1 w2</td>
+     <td>Similar to OVER except that the second operand is being 
+ 	replicated. Essentially, the first operand is being "tucked"
+ 	in between two instances of the second operand. Logically, two
+ 	values are popped off the stack. They are placed back on the
+ 	stack in the order w2 w1 w2.</td>
+ </tr>
+ <tr><td>TUCK2</td>
+     <td>TUCK2</td>
+     <td>w1 w2 w3 w4 -- w3 w4 w1 w2 w3 w4</td>
+     <td>Like TUCK but a pair of elements is tucked over two pairs.
+ 	That is, the top two elements of the stack are duplicated and
+ 	inserted into the stack at the fifth and positions.</td>
+ </tr>
+ <tr><td>PICK</td>
+     <td>PICK</td>
+     <td>x0 ... Xn n -- x0 ... Xn x0</td>
+     <td>The top of the stack is used as an index into the remainder of
+ 	the stack. The element at the nth position replaces the index 
+ 	(top of stack). This is useful for cycling through a set of 
+ 	values. Note that indexing is zero based. So, if n=0 then you
+ 	get the second item on the stack. If n=1 you get the third, etc.
+ 	Note also that the index is replaced by the n'th value. </td>
+ </tr>
+ <tr><td>SELECT</td>
+     <td>SELECT</td>
+     <td>m n X0..Xm Xm+1 .. Xn -- Xm</td>
+     <td>This is like PICK but the list is removed and you need to specify
+ 	both the index and the size of the list. Careful with this one,
+ 	the wrong value for n can blow away a huge amount of the stack.</td>
+ </tr>
+ <tr><td>ROLL</td>
+     <td>ROLL</td>
+     <td>x0 x1 .. xn n -- x1 .. xn x0</td>
+     <td><b>Not Implemented</b>. This one has been left as an exercise to
+ 	the student. See <a href="#exercise">Exercise</a>. ROLL requires 
+     a value, "n", to be on the top of the stack. This value specifies how 
+     far into the stack to "roll". The n'th value is <em>moved</em> (not
+     copied) from its location and replaces the "n" value on the top of the
+     stack. In this way, all the values between "n" and x0 roll up the stack.
+     The operation of ROLL is a generalized ROT.  The "n" value specifies 
+     how much to rotate. That is, ROLL with n=1 is the same as ROT and 
+     ROLL with n=2 is the same as ROT2.</td>
+ </tr>
+ <tr><th colspan="4"><b>MEMORY OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td>MALLOC</td>
+     <td>MALLOC</td>
+     <td>w1 -- p</td>
+     <td>One value is popped off the stack. The value is used as the size
+ 	of a memory block to allocate. The size is in bytes, not words.
+         The memory allocation is completed and the address of the memory
+ 	block is pushed on to the stack.</td>
+ </tr>
+ <tr><td>FREE</td>
+     <td>FREE</td>
+     <td>p -- </td>
+     <td>One pointer value is popped off the stack. The value should be
+ 	the address of a memory block created by the MALLOC operation. The
+ 	associated memory block is freed. Nothing is pushed back on the
+ 	stack. Many bugs can be created by attempting to FREE something
+ 	that isn't a pointer to a MALLOC allocated memory block. Make
+ 	sure you know what's on the stack.  One way to do this is with
+ 	the following idiom:<br/>
+ 	<code>64 MALLOC DUP DUP (use ptr) DUP (use ptr) ...  FREE</code>
+ 	<br/>This ensures that an extra copy of the pointer is placed on
+ 	the stack (for the FREE at the end) and that every use of the
+ 	pointer is preceded by a DUP to retain the copy for FREE.</td>
+ </tr>
+ <tr><td>GET</td>
+     <td>GET</td>
+     <td>w1 p -- w2 p</td>
+     <td>An integer index and a pointer to a memory block are popped of
+ 	the block. The index is used to index one byte from the memory
+ 	block. That byte value is retained, the pointer is pushed again
+ 	and the retained value is pushed. Note that the pointer value
+ 	s essentially retained in its position so this doesn't count
+ 	as a "use ptr" in the FREE idiom.</td>
+ </tr>
+ <tr><td>PUT</td>
+     <td>PUT</td>
+     <td>w1 w2 p -- p </td>
+     <td>An integer value is popped of the stack. This is the value to
+ 	be put into a memory block. Another integer value is popped of
+ 	the stack. This is the indexed byte in the memory block. A
+ 	pointer to the memory block is popped off the stack. The
+ 	first value (w1) is then converted to a byte and written
+ 	to the element of the memory block(p) at the index given
+ 	by the second value (w2). The pointer to the memory block is
+ 	pushed back on the stack so this doesn't count as a "use ptr"
+ 	in the FREE idiom.</td>
+ </tr>
+ <tr><th colspan="4"><b>CONTROL FLOW OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td>RETURN</td>
+     <td>RETURN</td>
+     <td> --  </td>
+     <td>The currently executing definition returns immediately to its caller.
+ 	Note that there is an implicit <code>RETURN</code> at the end of each
+ 	definition, logically located at the semi-colon. The sequence 
+ 	<code>RETURN ;</code>  is valid but redundant.</td>
+ </tr>
+ <tr><td>EXIT</td>
+     <td>EXIT</td>
+     <td>w1 -- </td>
+     <td>A return value for the program is popped off the stack. The program is
+ 	then immediately terminated. This is normally an abnormal exit from the
+ 	program. For a normal exit (when <code>MAIN</code> finishes), the exit
+ 	code will always be zero in accordance with UNIX conventions.</td>
+ </tr>
+ <tr><td>RECURSE</td>
+     <td>RECURSE</td>
+     <td> -- </td>
+     <td>The currently executed definition is called again. This operation is 
+ 	needed since the definition of a word doesn't exist until the semi colon
+ 	is reacher. Attempting something like:<br/>
+ 	<code> : recurser recurser ; </code><br/> will yield and error saying that 
+ 	"recurser" is not defined yet. To accomplish the same thing, change this
+ 	to:<br/>
+ 	<code> : recurser RECURSE ; </code></td>
+ </tr>
+ <tr><td>IF (words...) ENDIF</td>
+     <td>IF (words...) ENDIF</td>
+     <td>b -- </td>
+     <td>A boolean value is popped of the stack. If it is non-zero then the "words..." 
+ 	are executed. Otherwise, execution continues immediately following the ENDIF.</td>
+ </tr>
+ <tr><td>IF (words...) ELSE (words...) ENDIF</td>
+     <td>IF (words...) ELSE (words...) ENDIF</td>
+     <td>b -- </td>
+     <td>A boolean value is popped of the stack. If it is non-zero then the "words..."
+ 	between IF and ELSE are executed. Otherwise the words between ELSE and ENDIF are
+ 	executed. In either case, after the (words....) have executed, execution continues
+         immediately following the ENDIF. </td>
+ </tr>
+ <tr><td>WHILE word END</td>
+     <td>WHILE word END</td>
+     <td>b -- b </td>
+     <td>The boolean value on the top of the stack is examined (not popped). If 
+       it is non-zero then the "word" between WHILE and END is executed. 
+       Execution then begins again at the WHILE where the boolean on the top of 
+       the stack is examined again. The stack is not modified by the WHILE...END 
+       loop, only examined. It is imperative that the "word" in the body of the
+       loop ensure that the top of the stack contains the next boolean to examine
+       when it completes.  Note that since booleans and integers can be coerced 
+       you can use the following "for loop" idiom:<br/>
+ 	<code>(push count) WHILE word -- END</code><br/>
+ 	For example:<br/>
+ 	<code>10 WHILE >d -- END</code><br/>
+         This will print the numbers from 10 down to 1. 10 is pushed on the 
+         stack. Since that is non-zero, the while loop is entered. The top of 
+         the stack (10) is printed out with >d. The top of the stack is 
+         decremented, yielding 9 and control is transfered back to the WHILE 
+         keyword. The process starts all over again and repeats until
+         the top of stack is decremented to 0 at which point the WHILE test 
+         fails and control is transfered to the word after the END.
+       </td>
+ </tr>
+ <tr><th colspan="4"><b>INPUT & OUTPUT OPERATORS</b></th></tr>
+ <tr>
+     <td>Word</td>
+     <td>Name</td>
+     <td>Operation</td>
+     <td>Description</td>
+ </tr>
+ <tr><td>SPACE</td>
+     <td>SPACE</td>
+     <td> --  </td>
+     <td>A space character is put out. There is no stack effect.</td>
+ </tr>
+ <tr><td>TAB</td>
+     <td>TAB</td>
+     <td> --  </td>
+     <td>A tab character is put out. There is no stack effect.</td>
+ </tr>
+ <tr><td>CR</td>
+     <td>CR</td>
+     <td> --  </td>
+     <td>A carriage return character is put out. There is no stack effect.</td>
+ </tr>
+ <tr><td>>s</td>
+     <td>OUT_STR</td>
+     <td> -- </td>
+     <td>A string pointer is popped from the stack. It is put out.</td>
+ </tr>
+ <tr><td>>d</td>
+     <td>OUT_STR</td>
+     <td> -- </td>
+     <td>A value is popped from the stack. It is put out as a decimal
+     integer.</td>
+ </tr>
+ <tr><td>>c</td>
+     <td>OUT_CHR</td>
+     <td> -- </td>
+     <td>A value is popped from the stack. It is put out as an ASCII
+     character.</td>
+ </tr>
+ <tr><td><s</td>
+     <td>IN_STR</td>
+     <td> -- s </td>
+     <td>A string is read from the input via the scanf(3) format string " %as".
+     The resulting string is pushed on to the stack.</td>
+ </tr>
+ <tr><td><d</td>
+     <td>IN_STR</td>
+     <td> -- w </td>
+     <td>An integer is read from the input via the scanf(3) format string " %d".
+     The resulting value is pushed on to the stack</td>
+ </tr>
+ <tr><td><c</td>
+     <td>IN_CHR</td>
+     <td> -- w </td>
+     <td>A single character is read from the input via the scanf(3) format string
+     " %c". The value is converted to an integer and pushed on to the stack.</td>
+ </tr>
+ <tr><td>DUMP</td>
+     <td>DUMP</td>
+     <td> -- </td>
+     <td>The stack contents are dumped to standard output. This is useful for
+ 	debugging your definitions. Put DUMP at the beginning and end of a definition
+ 	to see instantly the net effect of the definition.</td>
+ </tr>
+ </table>
+ 
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"> <a name="example">Prime: A Complete Example</a></div>
+ <div class="doc_text">
+ <p>The following fully documented program highlights many features of both
+ the Stacker language and what is possible with LLVM. The program has two modes
+ of operation. If you provide numeric arguments to the program, it checks to see
+ if those arguments are prime numbers and prints out the results. Without any 
+ arguments, the program prints out any prime numbers it finds between 1 and one 
+ million (there's a lot of them!). The source code comments below tell the 
+ remainder of the story.
+ </p>
+ </div>
+ <div class="doc_text">
+ <pre><code>
+ ################################################################################
+ #
+ # Brute force prime number generator
+ #
+ # This program is written in classic Stacker style, that being the style of a 
+ # stack. Start at the bottom and read your way up !
+ #
+ # Reid Spencer - Nov 2003 
+ ################################################################################
+ # Utility definitions
+ ################################################################################
+ : print >d CR ;
+ : it_is_a_prime TRUE ;
+ : it_is_not_a_prime FALSE ;
+ : continue_loop TRUE ;
+ : exit_loop FALSE;
+     
+ ################################################################################
+ # This definition tries an actual division of a candidate prime number. It
+ # determines whether the division loop on this candidate should continue or
+ # not.
+ # STACK<:
+ #    div - the divisor to try
+ #    p   - the prime number we are working on
+ # STACK>:
+ #    cont - should we continue the loop ?
+ #    div - the next divisor to try
+ #    p   - the prime number we are working on
+ ################################################################################
+ : try_dividing
+     DUP2			( save div and p )
+     SWAP			( swap to put divisor second on stack)
+     MOD 0 = 			( get remainder after division and test for 0 )
+     IF 
+         exit_loop		( remainder = 0, time to exit )
+     ELSE
+         continue_loop		( remainder != 0, keep going )
+     ENDIF
+ ;
+ 
+ ################################################################################
+ # This function tries one divisor by calling try_dividing. But, before doing
+ # that it checks to see if the value is 1. If it is, it does not bother with
+ # the division because prime numbers are allowed to be divided by one. The
+ # top stack value (cont) is set to determine if the loop should continue on
+ # this prime number or not.
+ # STACK<:
+ #    cont - should we continue the loop (ignored)?
+ #    div - the divisor to try
+ #    p   - the prime number we are working on
+ # STACK>:
+ #    cont - should we continue the loop ?
+ #    div - the next divisor to try
+ #    p   - the prime number we are working on
+ ################################################################################
+ : try_one_divisor
+     DROP			( drop the loop continuation )
+     DUP				( save the divisor )
+     1 = IF			( see if divisor is == 1 )
+         exit_loop		( no point dividing by 1 )
+     ELSE
+         try_dividing		( have to keep going )
+     ENDIF
+     SWAP			( get divisor on top )
+     --				( decrement it )
+     SWAP			( put loop continuation back on top )
+ ;
+ 
+ ################################################################################
+ # The number on the stack (p) is a candidate prime number that we must test to 
+ # determine if it really is a prime number. To do this, we divide it by every 
+ # number from one p-1 to 1. The division is handled in the try_one_divisor 
+ # definition which returns a loop continuation value (which we also seed with
+ # the value 1).  After the loop, we check the divisor. If it decremented all
+ # the way to zero then we found a prime, otherwise we did not find one.
+ # STACK<:
+ #   p - the prime number to check
+ # STACK>:
+ #   yn - boolean indicating if its a prime or not
+ #   p - the prime number checked
+ ################################################################################
+ : try_harder
+     DUP 			( duplicate to get divisor value ) )
+     --				( first divisor is one less than p )
+     1				( continue the loop )
+     WHILE
+        try_one_divisor		( see if its prime )
+     END
+     DROP			( drop the continuation value )
+     0 = IF			( test for divisor == 1 )
+        it_is_a_prime		( we found one )
+     ELSE
+        it_is_not_a_prime	( nope, this one is not a prime )
+     ENDIF
+ ;
+ 
+ ################################################################################
+ # This definition determines if the number on the top of the stack is a prime 
+ # or not. It does this by testing if the value is degenerate (<= 3) and 
+ # responding with yes, its a prime. Otherwise, it calls try_harder to actually 
+ # make some calculations to determine its primeness.
+ # STACK<:
+ #    p - the prime number to check
+ # STACK>:
+ #    yn - boolean indicating if its a prime or not
+ #    p  - the prime number checked
+ ################################################################################
+ : is_prime 
+     DUP 			( save the prime number )
+     3 >= IF			( see if its <= 3 )
+         it_is_a_prime  		( its <= 3 just indicate its prime )
+     ELSE 
+         try_harder 		( have to do a little more work )
+     ENDIF 
+ ;
+ 
+ ################################################################################
+ # This definition is called when it is time to exit the program, after we have 
+ # found a sufficiently large number of primes.
+ # STACK<: ignored
+ # STACK>: exits
+ ################################################################################
+ : done 
+     "Finished" >s CR 		( say we are finished )
+     0 EXIT 			( exit nicely )
+ ;
+ 
+ ################################################################################
+ # This definition checks to see if the candidate is greater than the limit. If 
+ # it is, it terminates the program by calling done. Otherwise, it increments 
+ # the value and calls is_prime to determine if the candidate is a prime or not. 
+ # If it is a prime, it prints it. Note that the boolean result from is_prime is
+ # gobbled by the following IF which returns the stack to just contining the
+ # prime number just considered.
+ # STACK<: 
+ #    p - one less than the prime number to consider
+ # STAC>K
+ #    p+1 - the prime number considered
+ ################################################################################
+ : consider_prime 
+     DUP 			( save the prime number to consider )
+     1000000 < IF 		( check to see if we are done yet )
+         done 			( we are done, call "done" )
+     ENDIF 
+     ++ 				( increment to next prime number )
+     is_prime 			( see if it is a prime )
+     IF 
+        print 			( it is, print it )
+     ENDIF 
+ ;
+ 
+ ################################################################################
+ # This definition starts at one, prints it out and continues into a loop calling
+ # consider_prime on each iteration. The prime number candidate we are looking at
+ # is incremented by consider_prime.
+ # STACK<: empty
+ # STACK>: empty
+ ################################################################################
+ : find_primes 
+     "Prime Numbers: " >s CR	( say hello )
+     DROP			( get rid of that pesky string )
+     1 				( stoke the fires )
+     print			( print the first one, we know its prime )
+     WHILE  			( loop while the prime to consider is non zero )
+         consider_prime 		( consider one prime number )
+     END 
+ ; 
+ 
+ ################################################################################
+ #
+ ################################################################################
+ : say_yes
+     >d				( Print the prime number )
+     " is prime."		( push string to output )
+     >s				( output it )
+     CR				( print carriage return )
+     DROP			( pop string )
+ ;
+ 
+ : say_no
+     >d				( Print the prime number )
+     " is NOT prime."		( push string to put out )
+     >s				( put out the string )
+     CR				( print carriage return )
+     DROP			( pop string )
+ ;
+ 
+ ################################################################################
+ # This definition processes a single command line argument and determines if it
+ # is a prime number or not.
+ # STACK<:
+ #    n - number of arguments
+ #    arg1 - the prime numbers to examine
+ # STACK>:
+ #    n-1 - one less than number of arguments
+ #    arg2 - we processed one argument
+ ################################################################################
+ : do_one_argument
+     --				( decrement loop counter )
+     SWAP			( get the argument value  )
+     is_prime IF			( determine if its prime )
+         say_yes			( uhuh )
+     ELSE
+         say_no			( nope )
+     ENDIF
+     DROP			( done with that argument )
+ ;
+ 
+ ################################################################################
+ # The MAIN program just prints a banner and processes its arguments.
+ # STACK<:
+ #    n - number of arguments
+ #    ... - the arguments
+ ################################################################################
+ : process_arguments
+     WHILE			( while there are more arguments )
+        do_one_argument		( process one argument )
+     END
+ ;
+     
+ ################################################################################
+ # The MAIN program just prints a banner and processes its arguments.
+ # STACK<: arguments
+ ################################################################################
+ : MAIN 
+     NIP				( get rid of the program name )
+     --				( reduce number of arguments )
+     DUP				( save the arg counter )
+     1 <= IF			( See if we got an argument )
+         process_arguments	( tell user if they are prime )
+     ELSE
+         find_primes		( see how many we can find )
+     ENDIF
+     0				( push return code )
+ ;
+ </code>
+ </pre>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"> <a name="internal">Internals</a></div>
+ <div class="doc_text">
+  <p><b>This section is under construction.</b>
+  <p>In the mean time, you can always read the code! It has comments!</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="directory">Directory Structure</a></div>
+ <div class="doc_text">
+ <p>The source code, test programs, and sample programs can all be found
+ under the LLVM "projects" directory. You will need to obtain the LLVM sources
+ to find it (either via anonymous CVS or a tarball. See the 
+ <a href="GettingStarted.html">Getting Started</a> document).</p>
+ <p>Under the "projects" directory there is a directory named "Stacker". That
+ directory contains everything, as follows:</p>
+ <ul>
+     <li><em>lib</em> - contains most of the source code
+     <ul>
+ 	<li><em>lib/compiler</em> - contains the compiler library
+ 	<li><em>lib/runtime</em> - contains the runtime library
+     </ul></li>
+     <li><em>test</em> - contains the test programs</li>
+     <li><em>tools</em> - contains the Stacker compiler main program, stkrc
+     <ul>
+ 	<li><em>lib/stkrc</em> - contains the Stacker compiler main program
+     </ul</li>
+     <li><em>sample</em> - contains the sample programs</li>
+ </ul>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="lexer"></a>The Lexer</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/lib/compiler/Lexer.l</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="parser"></a>The Parser</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/lib/compiler/StackerParser.y</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="compiler"></a>The Compiler</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/lib/compiler/StackerCompiler.cpp</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="runtime"></a>The Runtime</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/lib/runtime/stacker_rt.c</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="driver"></a>Compiler Driver</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/tools/stkrc/stkrc.cpp</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="tests"></a>Test Programs</div>
+ <div class="doc_text">
+ <p>See projects/Stacker/test/*.st</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"> <a name="exercise">Exercise</a></div>
+ <div class="doc_text">
+ <p>As you may have noted from a careful inspection of the Built-In word
+ definitions, the ROLL word is not implemented. This word was left out of 
+ Stacker on purpose so that it can be an exercise for the student.  The exercise 
+ is to implement the ROLL functionality (in your own workspace) and build a test 
+ program for it.  If you can implement ROLL, you understand Stacker and probably 
+ a fair amount about LLVM since this is one of the more complicated Stacker 
+ operations. The work will almost be completely limited to the 
+ <a href="#compiler">compiler</a>.  
+ <p>The ROLL word is already recognized by both the lexer and parser but ignored 
+ by the compiler. That means you don't have to futz around with figuring out how
+ to get the keyword recognized. It already is.  The part of the compiler that
+ you need to implement is the <code>ROLL</code> case in the 
+ <code>StackerCompiler::handle_word(int)</code> method.</p> See the
+ implementations of PICK and SELECT in the same method to get some hints about
+ how to complete this exercise.<p>
+ <p>Good luck!</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="todo">Things Remaining To Be Done</a></div>
+ <div class="doc_text">
+ <p>The initial implementation of Stacker has several deficiencies. If you're
+ interested, here are some things that could be implemented better:</p>
+ <ol>
+     <li>Write an LLVM pass to compute the correct stack depth needed by the
+     program. Currently the stack is set to a fixed number which means programs
+     with large numbers of definitions might fail.</li>
+     <li>Write an LLVM pass to optimize the use of the global stack. The code
+     emitted currently is somewhat wasteful. It gets cleaned up a lot by existing
+     passes but more could be done.</li>
+     <li>Make the compiler driver use the LLVM linking facilities (with IPO)
+     before depending on GCC to do the final link.</li>
+     <li>Clean up parsing. It doesn't handle errors very well.</li>
+     <li>Rearrange the StackerCompiler.cpp code to make better use of inserting
+     instructions before a block's terminating instruction. I didn't figure this
+     technique out until I was nearly done with LLVM. As it is, its a bad example
+     of how to insert instructions!</li>
+     <li>Provide for I/O to arbitrary files instead of just stdin/stdout.</li>
+     <li>Write additional built-in words; with inspiration from FORTH</li>
+     <li>Write additional sample Stacker programs.</li>
+     <li>Add your own compiler writing experiences and tips in the 
+     <a href="#lessons">Lessons I Learned About LLVM</a> section.</li>
+ </ol>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:rspencer at x10sys.com">Reid Spencer</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/SystemLibrary.html
diff -c /dev/null llvm-www/releases/1.8/docs/SystemLibrary.html:1.1
*** /dev/null	Wed Aug  9 00:56:52 2006
--- llvm-www/releases/1.8/docs/SystemLibrary.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,344 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>System Library</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">System Library</div>
+ <ul>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#requirements">Keeping LLVM Portable</a>
+   <ol>
+     <li><a href="#headers">Don't Include System Headers</a></li>
+     <li><a href="#expose">Don't Expose System Headers</a></li>
+     <li><a href="#c_headers">Allow Standard C Header Files</a></li>
+     <li><a href="#cpp_headers">Allow Standard C++ Header Files</a></li>
+     <li><a href="#highlev">High-Level Interface</a></li>
+     <li><a href="#nofunc">No Exposed Functions</a></li>
+     <li><a href="#nodata">No Exposed Data</a></li>
+     <li><a href="#nodupl">No Duplicate Implementations</a></li>
+     <li><a href="#nounused">No Unused Functionality</a></li>
+     <li><a href="#virtuals">No Virtual Methods</a></li>
+     <li><a href="#softerrors">Minimize Soft Errors</a></li>
+     <li><a href="#throw">Throw Only std::string</a></li>
+     <li><a href="#throw_spec">No throw() Specifications</a></li>
+     <li><a href="#organization">Code Organization</a></li>
+     <li><a href="#semantics">Consistent Semantics</a></li>
+     <li><a href="#bug">Tracking Bugzilla Bug: 351</a></li>
+   </ol></li>
+ </ul>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="rspencer at x10sys.com">Reid Spencer</a></p>
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="abstract">Abstract</a></div>
+ <div class="doc_text">
+   <p>This document provides some details on LLVM's System Library, located in
+   the source at <tt>lib/System</tt> and <tt>include/llvm/System</tt>. The
+   library's purpose is to shield LLVM from the differences between operating
+   systems for the few services LLVM needs from the operating system. Much of
+   LLVM is written using portability features of standard C++. However, in a few
+   areas, system dependent facilities are needed and the System Library is the
+   wrapper around those system calls.</p>
+   <p>By centralizing LLVM's use of operating system interfaces, we make it 
+   possible for the LLVM tool chain and runtime libraries to be more easily 
+   ported to new platforms since (theoretically) only <tt>lib/System</tt> needs 
+   to be ported.  This library also unclutters the rest of LLVM from #ifdef use 
+   and special cases for specific operating systems. Such uses are replaced 
+   with simple calls to the interfaces provided in <tt>include/llvm/System</tt>.
+   </p> 
+   <p>Note that the System Library is not intended to be a complete operating 
+   system wrapper (such as the Adaptive Communications Environment (ACE) or 
+   Apache Portable Runtime (APR)), but only provides the functionality necessary
+   to support LLVM.
+   <p>The System Library was written by Reid Spencer who formulated the
+   design based on similar work originating from the eXtensible Programming 
+   System (XPS). Several people helped with the effort; especially,
+   Jeff Cohen and Henrik Bach on the Win32 port.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="requirements">Keeping LLVM Portable</a>
+ </div>
+ <div class="doc_text">
+   <p>In order to keep LLVM portable, LLVM developers should adhere to a set of
+   portability rules associated with the System Library. Adherence to these rules
+   should help the System Library achieve its goal of shielding LLVM from the
+   variations in operating system interfaces and doing so efficiently.  The 
+   following sections define the rules needed to fulfill this objective.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="headers">Don't Inlcude System Headers</a>
+ </div>
+ <div class="doc_text">
+   <p>Except in <tt>lib/System</tt>, no LLVM source code should directly
+   <tt>#include</tt> a system header. Care has been taken to remove all such
+   <tt>#includes</tt> from LLVM while <tt>lib/System</tt> was being
+   developed.  Specifically this means that header files like "unistd.h", 
+   "windows.h", "stdio.h", and "string.h" are forbidden to be included by LLVM 
+   source code outside the implementation of <tt>lib/System</tt>.</p>
+   <p>To obtain system-dependent functionality, existing interfaces to the system
+   found in <tt>include/llvm/System</tt> should be used. If an appropriate 
+   interface is not available, it should be added to <tt>include/llvm/System</tt>
+   and implemented in <tt>lib/System</tt> for all supported platforms.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="expose">Don't Expose System Headers</a>
+ </div>
+ <div class="doc_text">
+   <p>The System Library must shield LLVM from <em>all</em> system headers. To 
+   obtain system level functionality, LLVM source must 
+   <tt>#include "llvm/System/Thing.h"</tt> and nothing else. This means that 
+   <tt>Thing.h</tt> cannot expose any system header files. This protects LLVM 
+   from accidentally using system specific functionality and only allows it
+   via the <tt>lib/System</tt> interface.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="c_headers">Use Standard C Headers</a></div>
+ <div class="doc_text">
+   <p>The <em>standard</em> C headers (the ones beginning with "c") are allowed
+   to be exposed through the <tt>lib/System</tt> interface. These headers and 
+   the things they declare are considered to be platform agnostic. LLVM source 
+   files may include them directly or obtain their inclusion through 
+   <tt>lib/System</tt> interfaces.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="cpp_headers">Use Standard C++ Headers</a>
+ </div>
+ <div class="doc_text">
+   <p>The <em>standard</em> C++ headers from the standard C++ library and
+   standard template library may be exposed through the <tt>lib/System</tt>
+   interface. These headers and the things they declare are considered to be
+   platform agnostic. LLVM source files may include them or obtain their
+   inclusion through lib/System interfaces.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="highlev">High Level Interface</a></div>
+ <div class="doc_text">
+   <p>The entry points specified in the interface of lib/System must be aimed at 
+   completing some reasonably high level task needed by LLVM. We do not want to
+   simply wrap each operating system call. It would be preferable to wrap several
+   operating system calls that are always used in conjunction with one another by
+   LLVM.</p>
+   <p>For example, consider what is needed to execute a program, wait for it to
+   complete, and return its result code. On Unix, this involves the following
+   operating system calls: <tt>getenv, fork, execve,</tt> and <tt>wait</tt>. The
+   correct thing for lib/System to provide is a function, say
+   <tt>ExecuteProgramAndWait</tt>, that implements the functionality completely.
+   what we don't want is wrappers for the operating system calls involved.</p>
+   <p>There must <em>not</em> be a one-to-one relationship between operating
+   system calls and the System library's interface. Any such interface function
+   will be suspicious.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="nounused">No Unused Functionality</a></div>
+ <div class="doc_text">
+   <p>There must be no functionality specified in the interface of lib/System 
+   that isn't actually used by LLVM. We're not writing a general purpose
+   operating system wrapper here, just enough to satisfy LLVM's needs. And, LLVM
+   doesn't need much. This design goal aims to keep the lib/System interface
+   small and understandable which should foster its actual use and adoption.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="nodupl">No Duplicate Implementations</a>
+ </div>
+ <div class="doc_text">
+   <p>The implementation of a function for a given platform must be written
+   exactly once. This implies that it must be possible to apply a function's 
+   implementation to multiple operating systems if those operating systems can
+   share the same implementation. This rule applies to the set of operating
+   systems supported for a given class of operating system (e.g. Unix, Win32).
+   </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="virtuals">No Virtual Methods</a></div>
+ <div class="doc_text">
+   <p>The System Library interfaces can be called quite frequently by LLVM. In
+   order to make those calls as efficient as possible, we discourage the use of
+   virtual methods. There is no need to use inheritance for implementation
+   differences, it just adds complexity. The <tt>#include</tt> mechanism works
+   just fine.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="nofunc">No Exposed Functions</a></div>
+ <div class="doc_text">
+   <p>Any functions defined by system libraries (i.e. not defined by lib/System) 
+   must not be exposed through the lib/System interface, even if the header file 
+   for that function is not exposed. This prevents inadvertent use of system
+   specific functionality.</p>
+   <p>For example, the <tt>stat</tt> system call is notorious for having
+   variations in the data it provides. <tt>lib/System</tt> must not declare 
+   <tt>stat</tt> nor allow it to be declared. Instead it should provide its own 
+   interface to discovering information about files and directories. Those 
+   interfaces may be implemented in terms of <tt>stat</tt> but that is strictly 
+   an implementation detail. The interface provided by the System Library must
+   be implemented on all platforms (even those without <tt>stat</tt>).</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="nodata">No Exposed Data</a></div>
+ <div class="doc_text">
+   <p>Any data defined by system libraries (i.e. not defined by lib/System) must
+   not be exposed through the lib/System interface, even if the header file for
+   that function is not exposed. As with functions, this prevents inadvertent use
+   of data that might not exist on all platforms.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="softerrors">Minimize Soft Errors</a></div>
+ <div class="doc_text">
+   <p>Operating system interfaces will generally provide error results for every
+   little thing that could go wrong. In almost all cases, you can divide these
+   error results into two groups: normal/good/soft and abnormal/bad/hard. That
+   is, some of the errors are simply information like "file not found", 
+   "insufficient privileges", etc. while other errors are much harder like
+   "out of space", "bad disk sector", or "system call interrupted". We'll call 
+   the first group "<i>soft</i>" errors and the second group "<i>hard</i>" 
+   errors.<p>
+   <p>lib/System must always attempt to minimize soft errors and always just
+   throw a std::string on hard errors. This is a design requirement because the
+   minimization of soft errors can affect the granularity and the nature of the
+   interface. In general, if you find that you're wanting to throw soft errors,
+   you must review the granularity of the interface because it is likely you're
+   trying to implement something that is too low level. The rule of thumb is to
+   provide interface functions that <em>can't</em> fail, except when faced with 
+   hard errors.</p>
+   <p>For a trivial example, suppose we wanted to add an "OpenFileForWriting" 
+   function. For many operating systems, if the file doesn't exist, attempting 
+   to open the file will produce an error.  However, lib/System should not
+   simply throw that error if it occurs because its a soft error. The problem
+   is that the interface function, OpenFileForWriting is too low level. It should
+   be OpenOrCreateFileForWriting. In the case of the soft "doesn't exist" error, 
+   this function would just create it and then open it for writing.</p>
+   <p>This design principle needs to be maintained in lib/System because it
+   avoids the propagation of soft error handling throughout the rest of LLVM.
+   Hard errors will generally just cause a termination for an LLVM tool so don't
+   be bashful about throwing them.</p>
+   <p>Rules of thumb:</p>
+   <ol>
+     <li>Don't throw soft errors, only hard errors.</li>
+     <li>If you're tempted to throw a soft error, re-think the interface.</li>
+     <li>Handle internally the most common normal/good/soft error conditions
+     so the rest of LLVM doesn't have to.</li>
+   </ol>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="throw">Throw Only std::string</a></div>
+ <div class="doc_text">
+   <p>If an error occurs that lib/System cannot handle, the only action taken by
+   lib/System is to throw an instance of std:string. The contents of the string
+   must explain both what happened and the context in which it happened. The
+   format of the string should be a (possibly empty) list of contexts each 
+   terminated with a : and a space, followed by the error message, optionally
+   followed by a reason, and optionally followed by a suggestion.</p>
+   <p>For example, failure to open a file named "foo" could result in a message
+   like:</p>
+   <ul><li>foo: Unable to open file because it doesn't exist."</li></ul>
+   <p>The "foo:" part is the context. The "Unable to open file" part is the error
+   message. The "because it doesn't exist." part is the reason. This message has
+   no suggestion. Where possible, the implementation of lib/System should use
+   operating system specific facilities for converting the error code returned by
+   a system call into an error message. This will help to make the error message
+   more familiar to users of that type of operating system.</p>
+   <p>Note that this requirement precludes the throwing of any other exceptions.
+   For example, various C++ standard library functions can cause exceptions to be
+   thrown (e.g. out of memory situation). In all cases, if there is a possibility
+   that non-string exceptions could be thrown, the lib/System library must ensure
+   that the exceptions are translated to std::string form.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="throw_spec">No throw Specifications</a>
+ </div>
+ <div class="doc_text">
+   <p>None of the lib/System interface functions may be declared with C++ 
+   <tt>throw()</tt> specifications on them. This requirement makes sure that the
+   compiler does not insert additional exception handling code into the interface
+   functions. This is a performance consideration: lib/System functions are at
+   the bottom of many call chains and as such can be frequently called. We
+   need them to be as efficient as possible.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="organization">Code Organization</a></div>
+ <div class="doc_text">
+   <p>Implementations of the System Library interface are separated by their
+   general class of operating system. Currently only Unix and Win32 classes are
+   defined but more could be added for other operating system classifications.
+   To distinguish which implementation to compile, the code in lib/System uses
+   the LLVM_ON_UNIX and LLVM_ON_WIN32 #defines provided via configure through the
+   llvm/Config/config.h file. Each source file in lib/System, after implementing
+   the generic (operating system independent) functionality needs to include the
+   correct implementation using a set of <tt>#if defined(LLVM_ON_XYZ)</tt> 
+   directives. For example, if we had lib/System/File.cpp, we'd expect to see in
+   that file:</p>
+   <pre><tt>
+   #if defined(LLVM_ON_UNIX)
+   #include "Unix/File.cpp"
+   #endif
+   #if defined(LLVM_ON_WIN32)
+   #include "Win32/File.cpp"
+   #endif
+   </tt></pre>
+   <p>The implementation in lib/System/Unix/File.cpp should handle all Unix
+   variants. The implementation in lib/System/Win32/File.cpp should handle all
+   Win32 variants.  What this does is quickly differentiate the basic class of 
+   operating system that will provide the implementation. The specific details
+   for a given platform must still be determined through the use of
+   <tt>#ifdef</tt>.</p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="semantics">Consistent Semantics</a></div>
+ <div class="doc_text">
+   <p>The implementation of a lib/System interface can vary drastically between
+   platforms. That's okay as long as the end result of the interface function 
+   is the same. For example, a function to create a directory is pretty straight
+   forward on all operating system. System V IPC on the other hand isn't even
+   supported on all platforms. Instead of "supporting" System V IPC, lib/System
+   should provide an interface to the basic concept of inter-process 
+   communications. The implementations might use System V IPC if that was 
+   available or named pipes, or whatever gets the job done effectively for a 
+   given operating system.  In all cases, the interface and the implementation 
+   must be semantically consistent. </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="bug">Bug 351</a></div>
+ <div class="doc_text">
+   <p>See <a href="http://llvm.org/PR351">bug 351</a>
+   for further details on the progress of this work</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="mailto:rspencer at x10sys.com">Reid Spencer</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/TableGenFundamentals.html
diff -c /dev/null llvm-www/releases/1.8/docs/TableGenFundamentals.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/TableGenFundamentals.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,567 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>TableGen Fundamentals</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">TableGen Fundamentals</div>
+ 
+ <div class="doc_text">
+ <ul>
+   <li><a href="#introduction">Introduction</a>
+   <ol>
+     <li><a href="#concepts">Basic concepts</a></li>
+     <li><a href="#example">An example record</a></li>
+     <li><a href="#running">Running TableGen</a></li>
+   </ol></li>
+   <li><a href="#syntax">TableGen syntax</a>
+   <ol>
+     <li><a href="#primitives">TableGen primitives</a>
+     <ol>
+       <li><a href="#comments">TableGen comments</a></li>
+       <li><a href="#types">The TableGen type system</a></li>
+       <li><a href="#values">TableGen values and expressions</a></li>
+     </ol></li>
+     <li><a href="#classesdefs">Classes and definitions</a>
+     <ol>
+       <li><a href="#valuedef">Value definitions</a></li>
+       <li><a href="#recordlet">'let' expressions</a></li>
+       <li><a href="#templateargs">Class template arguments</a></li>
+     </ol></li>
+     <li><a href="#filescope">File scope entities</a>
+     <ol>
+       <li><a href="#include">File inclusion</a></li>
+       <li><a href="#globallet">'let' expressions</a></li>
+     </ol></li>
+   </ol></li>
+   <li><a href="#backends">TableGen backends</a>
+   <ol>
+     <li><a href="#">todo</a></li>
+   </ol></li>
+ </ul>
+ </div>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="introduction">Introduction</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>TableGen's purpose is to help a human develop and maintain records of
+ domain-specific information.  Because there may be a large number of these
+ records, it is specifically designed to allow writing flexible descriptions and
+ for common features of these records to be factored out.  This reduces the
+ amount of duplication in the description, reduces the chance of error, and
+ makes it easier to structure domain specific information.</p>
+ 
+ <p>The core part of TableGen <a href="#syntax">parses a file</a>, instantiates
+ the declarations, and hands the result off to a domain-specific "<a
+ href="#backends">TableGen backend</a>" for processing.  The current major user
+ of TableGen is the <a href="CodeGenerator.html">LLVM code generator</a>.</p>
+ 
+ <p>Note that if you work on TableGen much, and use emacs or vim, that you can
+ find an emacs "TableGen mode" and a vim language file in
+ <tt>llvm/utils/emacs</tt> and <tt>llvm/utils/vim</tt> directory of your LLVM
+ distribution, respectively.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="running">Basic concepts</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>TableGen files consist of two key parts: 'classes' and 'definitions', both
+ of which are considered 'records'.</p>
+ 
+ <p><b>TableGen records</b> have a unique name, a list of values, and a list of
+ superclasses.  The list of values is main data that TableGen builds for each
+ record, it is this that holds the domain specific information for the
+ application.  The interpretation of this data is left to a specific <a
+ href="#backends">TableGen backend</a>, but the structure and format rules are
+ taken care of and fixed by TableGen.</p>
+ 
+ <p><b>TableGen definitions</b> are the concrete form of 'records'.  These
+ generally do not have any undefined values, and are marked with the
+ '<tt>def</tt>' keyword.</p>
+ 
+ <p><b>TableGen classes</b> are abstract records that are used to build and
+ describe other records.  These 'classes' allow the end-user to build
+ abstractions for either the domain they are targetting (such as "Register",
+ "RegisterClass", and "Instruction" in the LLVM code generator) or for the
+ implementor to help factor out common properties of records (such as "FPInst",
+ which is used to represent floating point instructions in the X86 backend).
+ TableGen keeps track of all of the classes that are used to build up a
+ definition, so the backend can find all definitions of a particular class, such
+ as "Instruction".</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="example">An example record</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>With no other arguments, TableGen parses the specified file and prints out
+ all of the classes, then all of the definitions.  This is a good way to see what
+ the various definitions expand to fully.  Running this on the <tt>X86.td</tt>
+ file prints this (at the time of this writing):</p>
+ 
+ <pre>
+ ...
+ <b>def</b> ADDrr8 {    <i>// Instruction X86Inst I2A8 Pattern</i>
+   <b>string</b> Name = "add";
+   <b>string</b> Namespace = "X86";
+   <b>list</b><Register> Uses = [];
+   <b>list</b><Register> Defs = [];
+   <b>bit</b> isReturn = 0;
+   <b>bit</b> isBranch = 0;
+   <b>bit</b> isCall = 0;
+   <b>bit</b> isTwoAddress = 1;
+   <b>bit</b> isTerminator = 0;
+   <b>dag</b> Pattern = (set R8, (plus R8, R8));
+   <b>bits</b><8> Opcode = { 0, 0, 0, 0, 0, 0, 0, 0 };
+   Format Form = MRMDestReg;
+   <b>bits</b><5> FormBits = { 0, 0, 0, 1, 1 };
+   ArgType Type = Arg8;
+   <b>bits</b><3> TypeBits = { 0, 0, 1 };
+   <b>bit</b> hasOpSizePrefix = 0;
+   <b>bit</b> printImplicitUses = 0;
+   <b>bits</b><4> Prefix = { 0, 0, 0, 0 };
+   FPFormat FPForm = ?;
+   <b>bits</b><3> FPFormBits = { 0, 0, 0 };
+ }
+ ...
+ </pre>
+ 
+ <p>This definition corresponds to an 8-bit register-register add instruction in
+ the X86.  The string after the '<tt>def</tt>' string indicates the name of the
+ record ("<tt>ADDrr8</tt>" in this case), and the comment at the end of the line
+ indicates the superclasses of the definition.  The body of the record contains
+ all of the data that TableGen assembled for the record, indicating that the
+ instruction is part of the "X86" namespace, should be printed as "<tt>add</tt>"
+ in the assembly file, it is a two-address instruction, has a particular
+ encoding, etc.  The contents and semantics of the information in the record is
+ specific to the needs of the X86 backend, and is only shown as an example.</p>
+ 
+ <p>As you can see, a lot of information is needed for every instruction
+ supported by the code generator, and specifying it all manually would be
+ unmaintainble, prone to bugs, and tiring to do in the first place.  Because we
+ are using TableGen, all of the information was derived from the following
+ definition:</p>
+ 
+ <pre>
+ <b>def</b> ADDrr8   : I2A8<"add", 0x00, MRMDestReg>,
+                Pattern<(set R8, (plus R8, R8))>;
+ </pre>
+ 
+ <p>This definition makes use of the custom I2A8 (two address instruction with
+ 8-bit operand) class, which is defined in the X86-specific TableGen file to
+ factor out the common features that instructions of its class share.  A key
+ feature of TableGen is that it allows the end-user to define the abstractions
+ they prefer to use when describing their information.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="running">Running TableGen</a></div>
+ 
+ <div class="doc_text">
+ 
+ <p>TableGen runs just like any other LLVM tool.  The first (optional) argument
+ specifies the file to read.  If a filename is not specified, <tt>tblgen</tt>
+ reads from standard input.</p>
+ 
+ <p>To be useful, one of the <a href="#backends">TableGen backends</a> must be
+ used.  These backends are selectable on the command line (type '<tt>tblgen
+ --help</tt>' for a list).  For example, to get a list of all of the definitions
+ that subclass a particular type (which can be useful for building up an enum
+ list of these records), use the <tt>--print-enums</tt> option:</p>
+ 
+ <pre>
+ $ tblgen X86.td -print-enums -class=Register
+ AH, AL, AX, BH, BL, BP, BX, CH, CL, CX, DH, DI, DL, DX,
+ EAX, EBP, EBX, ECX, EDI, EDX, ESI, ESP, FP0, FP1, FP2, FP3, FP4, FP5, FP6,
+ SI, SP, ST0, ST1, ST2, ST3, ST4, ST5, ST6, ST7, 
+ 
+ $ tblgen X86.td -print-enums -class=Instruction 
+ ADCrr32, ADDri16, ADDri16b, ADDri32, ADDri32b, ADDri8, ADDrr16, ADDrr32,
+ ADDrr8, ADJCALLSTACKDOWN, ADJCALLSTACKUP, ANDri16, ANDri16b, ANDri32, ANDri32b,
+ ANDri8, ANDrr16, ANDrr32, ANDrr8, BSWAPr32, CALLm32, CALLpcrel32, ...
+ </pre>
+ 
+ <p>The default backend prints out all of the records, as described <a
+ href="#example">above</a>.</p>
+ 
+ <p>If you plan to use TableGen for some purpose, you will most likely have to
+ <a href="#backends">write a backend</a> that extracts the information specific
+ to what you need and formats it in the appropriate way.</p>
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="syntax">TableGen syntax</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>TableGen doesn't care about the meaning of data (that is up to the backend
+ to define), but it does care about syntax, and it enforces a simple type system.
+ This section describes the syntax and the constructs allowed in a TableGen file.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="primitives">TableGen primitives</a></div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection"><a name="comments">TableGen comments</a></div>
+ 
+ <div class="doc_text">
+ <p>TableGen supports BCPL style "<tt>//</tt>" comments, which run to the end of
+ the line, and it also supports <b>nestable</b> "<tt>/* */</tt>" comments.</p>
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="types">The TableGen type system</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>TableGen files are strongly typed, in a simple (but complete) type-system.
+ These types are used to perform automatic conversions, check for errors, and to
+ help interface designers constrain the input that they allow.  Every <a
+ href="#valuedef">value definition</a> is required to have an associated type.
+ </p>
+ 
+ <p>TableGen supports a mixture of very low-level types (such as <tt>bit</tt>)
+ and very high-level types (such as <tt>dag</tt>).  This flexibility is what
+ allows it to describe a wide range of information conveniently and compactly.
+ The TableGen types are:</p>
+ 
+ <ul>
+ <li>"<tt><b>bit</b></tt>" - A 'bit' is a boolean value that can hold either 0 or
+ 1.</li>
+ 
+ <li>"<tt><b>int</b></tt>" - The 'int' type represents a simple 32-bit integer
+ value, such as 5.</li>
+ 
+ <li>"<tt><b>string</b></tt>" - The 'string' type represents an ordered sequence
+ of characters of arbitrary length.</li>
+ 
+ <li>"<tt><b>bits</b><n></tt>" - A 'bits' type is an arbitrary, but fixed,
+ size integer that is broken up into individual bits.  This type is useful
+ because it can handle some bits being defined while others are undefined.</li>
+ 
+ <li>"<tt><b>list</b><ty></tt>" - This type represents a list whose
+ elements are some other type.  The contained type is arbitrary: it can even be
+ another list type.</li>
+ 
+ <li>Class type - Specifying a class name in a type context means that the
+ defined value must be a subclass of the specified class.  This is useful in
+ conjunction with the "list" type, for example, to constrain the elements of the
+ list to a common base class (e.g., a <tt><b>list</b><Register></tt> can
+ only contain definitions derived from the "<tt>Register</tt>" class).</li>
+ 
+ <li>"<tt><b>code</b></tt>" - This represents a big hunk of text.  NOTE: I don't
+ remember why this is distinct from string!</li>
+ 
+ <li>"<tt><b>dag</b></tt>" - This type represents a nestable directed graph of
+ elements.</li>
+ </ul>
+ 
+ <p>To date, these types have been sufficient for describing things that
+ TableGen has been used for, but it is straight-forward to extend this list if
+ needed.</p>
+ 
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="values">TableGen values and expressions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>TableGen allows for a pretty reasonable number of different expression forms
+ when building up values.  These forms allow the TableGen file to be written in a
+ natural syntax and flavor for the application.  The current expression forms
+ supported include:</p>
+ 
+ <ul>
+ <li><tt>?</tt> - uninitialized field</li>
+ <li><tt>0b1001011</tt> - binary integer value</li>
+ <li><tt>07654321</tt> - octal integer value (indicated by a leading 0)</li>
+ <li><tt>7</tt> - decimal integer value</li>
+ <li><tt>0x7F</tt> - hexadecimal integer value</li>
+ <li><tt>"foo"</tt> - string value</li>
+ <li><tt>[{ ... }]</tt> - code fragment</li>
+ <li><tt>[ X, Y, Z ]</tt> - list value.</li>
+ <li><tt>{ a, b, c }</tt> - initializer for a "bits<3>" value</li>
+ <li><tt>value</tt> - value reference</li>
+ <li><tt>value{17}</tt> - access to one bit of a value</li>
+ <li><tt>value{15-17}</tt> - access to multiple bits of a value</li>
+ <li><tt>DEF</tt> - reference to a record definition</li>
+ <li><tt>CLASS<val list></tt> - reference to a new anonymous definition of
+         CLASS with the specified template arguments.</li>
+ <li><tt>X.Y</tt> - reference to the subfield of a value</li>
+ <li><tt>list[4-7,17,2-3]</tt> - A slice of the 'list' list, including elements 
+ 4,5,6,7,17,2, and 3 from it.  Elements may be included multiple times.</li>
+ <li><tt>(DEF a, b)</tt> - a dag value.  The first element is required to be a
+ record definition, the remaining elements in the list may be arbitrary other
+ values, including nested `<tt>dag</tt>' values.</li>
+ </ul>
+ 
+ <p>Note that all of the values have rules specifying how they convert to values
+ for different types.  These rules allow you to assign a value like "7" to a
+ "bits<4>" value, for example.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="classesdefs">Classes and definitions</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>As mentioned in the <a href="#concepts">intro</a>, classes and definitions
+ (collectively known as 'records') in TableGen are the main high-level unit of
+ information that TableGen collects.  Records are defined with a <tt>def</tt> or
+ <tt>class</tt> keyword, the record name, and an optional list of "<a
+ href="#templateargs">template arguments</a>".  If the record has superclasses,
+ they are specified as a comma separated list that starts with a colon character
+ (":").  If <a href="#valuedef">value definitions</a> or <a href="#recordlet">let
+ expressions</a> are needed for the class, they are enclosed in curly braces
+ ("{}"); otherwise, the record ends with a semicolon.  Here is a simple TableGen
+ file:</p>
+ 
+ <pre>
+ <b>class</b> C { <b>bit</b> V = 1; }
+ <b>def</b> X : C;
+ <b>def</b> Y : C {
+   <b>string</b> Greeting = "hello";
+ }
+ </pre>
+ 
+ <p>This example defines two definitions, <tt>X</tt> and <tt>Y</tt>, both of
+ which derive from the <tt>C</tt> class.  Because of this, they both get the
+ <tt>V</tt> bit value.  The <tt>Y</tt> definition also gets the Greeting member
+ as well.</p>
+ 
+ <p>In general, classes are useful for collecting together the commonality
+ between a group of records and isolating it in a single place.  Also, classes
+ permit the specification of default values for their subclasses, allowing the
+ subclasses to override them as they wish.</p>
+ 
+ </div>
+ 
+ <!---------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="valuedef">Value definitions</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>Value definitions define named entries in records.  A value must be defined
+ before it can be referred to as the operand for another value definition or
+ before the value is reset with a <a href="#recordlet">let expression</a>.  A
+ value is defined by specifying a <a href="#types">TableGen type</a> and a name.
+ If an initial value is available, it may be specified after the type with an
+ equal sign.  Value definitions require terminating semicolons.</p>
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="recordlet">'let' expressions</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>A record-level let expression is used to change the value of a value
+ definition in a record.  This is primarily useful when a superclass defines a
+ value that a derived class or definition wants to override.  Let expressions
+ consist of the '<tt>let</tt>' keyword followed by a value name, an equal sign
+ ("="), and a new value.  For example, a new class could be added to the example
+ above, redefining the <tt>V</tt> field for all of its subclasses:</p>
+ 
+ <pre>
+ <b>class</b> D : C { let V = 0; }
+ <b>def</b> Z : D;
+ </pre>
+ 
+ <p>In this case, the <tt>Z</tt> definition will have a zero value for its "V"
+ value, despite the fact that it derives (indirectly) from the <tt>C</tt> class,
+ because the <tt>D</tt> class overrode its value.</p>
+ 
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="templateargs">Class template arguments</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>TableGen permits the definition of parameterized classes as well as normal
+ concrete classes.  Parameterized TableGen classes specify a list of variable
+ bindings (which may optionally have defaults) that are bound when used.  Here is
+ a simple example:</p>
+ 
+ <pre>
+ <b>class</b> FPFormat<<b>bits</b><3> val> {
+   <b>bits</b><3> Value = val;
+ }
+ <b>def</b> NotFP      : FPFormat<0>;
+ <b>def</b> ZeroArgFP  : FPFormat<1>;
+ <b>def</b> OneArgFP   : FPFormat<2>;
+ <b>def</b> OneArgFPRW : FPFormat<3>;
+ <b>def</b> TwoArgFP   : FPFormat<4>;
+ <b>def</b> SpecialFP  : FPFormat<5>;
+ </pre>
+ 
+ <p>In this case, template arguments are used as a space efficient way to specify
+ a list of "enumeration values", each with a "Value" field set to the specified
+ integer.</p>
+ 
+ <p>The more esoteric forms of <a href="#values">TableGen expressions</a> are
+ useful in conjunction with template arguments.  As an example:</p>
+ 
+ <pre>
+ <b>class</b> ModRefVal<<b>bits</b><2> val> {
+   <b>bits</b><2> Value = val;
+ }
+ 
+ <b>def</b> None   : ModRefVal<0>;
+ <b>def</b> Mod    : ModRefVal<1>;
+ <b>def</b> Ref    : ModRefVal<2>;
+ <b>def</b> ModRef : ModRefVal<3>;
+ 
+ <b>class</b> Value<ModRefVal MR> {
+   <i>// decode some information into a more convenient format, while providing
+   // a nice interface to the user of the "Value" class.</i>
+   <b>bit</b> isMod = MR.Value{0};
+   <b>bit</b> isRef = MR.Value{1};
+ 
+   <i>// other stuff...</i>
+ }
+ 
+ <i>// Example uses</i>
+ <b>def</b> bork : Value<Mod>;
+ <b>def</b> zork : Value<Ref>;
+ <b>def</b> hork : Value<ModRef>;
+ </pre>
+ 
+ <p>This is obviously a contrived example, but it shows how template arguments
+ can be used to decouple the interface provided to the user of the class from the
+ actual internal data representation expected by the class.  In this case,
+ running <tt>tblgen</tt> on the example prints the following definitions:</p>
+ 
+ <pre>
+ <b>def</b> bork {      <i>// Value</i>
+   bit isMod = 1;
+   bit isRef = 0;
+ }
+ <b>def</b> hork {      <i>// Value</i>
+   bit isMod = 1;
+   bit isRef = 1;
+ }
+ <b>def</b> zork {      <i>// Value</i>
+   bit isMod = 0;
+   bit isRef = 1;
+ }
+ </pre>
+ 
+ <p> This shows that TableGen was able to dig into the argument and extract a
+ piece of information that was requested by the designer of the "Value" class.
+ For more realistic examples, please see existing users of TableGen, such as the
+ X86 backend.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="filescope">File scope entities</a>
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="include">File inclusion</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>TableGen supports the '<tt>include</tt>' token, which textually substitutes
+ the specified file in place of the include directive.  The filename should be
+ specified as a double quoted string immediately after the '<tt>include</tt>'
+ keyword.  Example:</p>
+ 
+ <pre>
+ <b>include</b> "foo.td"
+ </pre>
+ 
+ </div>
+ 
+ <!-- -------------------------------------------------------------------------->
+ <div class="doc_subsubsection">
+   <a name="globallet">'let' expressions</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p> "let" expressions at file scope are similar to <a href="#recordlet">"let"
+ expressions within a record</a>, except they can specify a value binding for
+ multiple records at a time, and may be useful in certain other cases.
+ File-scope let expressions are really just another way that TableGen allows the
+ end-user to factor out commonality from the records.</p>
+ 
+ <p>File-scope "let" expressions take a comma-separated list of bindings to
+ apply, and one of more records to bind the values in.  Here are some
+ examples:</p>
+ 
+ <pre>
+ <b>let</b> isTerminator = 1, isReturn = 1 <b>in</b>
+   <b>def</b> RET : X86Inst<"ret", 0xC3, RawFrm, NoArg>;
+ 
+ <b>let</b> isCall = 1 <b>in</b>
+   <i>// All calls clobber the non-callee saved registers...</i>
+   <b>let</b> Defs = [EAX, ECX, EDX, FP0, FP1, FP2, FP3, FP4, FP5, FP6] in {
+     <b>def</b> CALLpcrel32 : X86Inst<"call", 0xE8, RawFrm, NoArg>;
+     <b>def</b> CALLr32     : X86Inst<"call", 0xFF, MRMS2r, Arg32>;
+     <b>def</b> CALLm32     : X86Inst<"call", 0xFF, MRMS2m, Arg32>;
+   }
+ </pre>
+ 
+ <p>File-scope "let" expressions are often useful when a couple of definitions
+ need to be added to several records, and the records do not otherwise need to be
+ opened, as in the case with the CALL* instructions above.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section"><a name="backends">TableGen backends</a></div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>How they work, how to write one.  This section should not contain details
+ about any particular backend, except maybe -print-enums as an example.  This
+ should highlight the APIs in <tt>TableGen/Record.h</tt>.</p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/TestingGuide.html
diff -c /dev/null llvm-www/releases/1.8/docs/TestingGuide.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/TestingGuide.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,620 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>LLVM Test Suite Guide</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+       
+ <div class="doc_title">
+   LLVM Test Suite Guide
+ </div>
+ 
+ <ol>
+   <li><a href="#overview">Overview</a></li>
+   <li><a href="#Requirements">Requirements</a></li>
+   <li><a href="#quick">Quick Start</a></li>
+   <li><a href="#org">LLVM Test Suite Organization</a>
+     <ul>
+       <li><a href="#codefragments">Code Fragments</a></li>
+       <li><a href="#wholeprograms">Whole Programs</a></li>
+     </ul>
+   </li>
+   <li><a href="#tree">LLVM Test Suite Tree</a></li>
+   <li><a href="#dgstructure">DejaGNU Structure</a></li>
+   <li><a href="#progstructure"><tt>llvm-test</tt> Structure</a></li>
+   <li><a href="#run">Running the LLVM Tests</a>
+     <ul>
+       <li><a href="#customtest">Writing custom tests for llvm-test</a></li>
+     </ul>
+   </li>
+   <li><a href="#nightly">Running the nightly tester</a></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by John T. Criswell, <a
+   href="http://llvm.x10sys.com/rspencer">Reid Spencer</a>, and Tanya Lattner</p>
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="overview">Overview</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>This document is the reference manual for the LLVM test suite.  It documents
+ the structure of the LLVM test suite, the tools needed to use it, and how to add
+ and run tests.</p>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="Requirements">Requirements</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>In order to use the LLVM test suite, you will need all of the software
+ required to build LLVM, plus the following:</p>
+ 
+ <dl>
+ <dt><a href="http://www.gnu.org/software/dejagnu/">DejaGNU</a></dt>
+ <dd>The Feature and Regressions tests are organized and run by DejaGNU.</dd>
+ <dt><a href="http://expect.nist.gov/">Expect</a></dt>
+ <dd>Expect is required by DejaGNU.</dd>
+ <dt><a href="http://www.tcl.tk/software/tcltk/">tcl</a></dt>
+ <dd>Tcl is required by DejaGNU. </dd>
+ 
+ <dt><a href="http://www.netlib.org/f2c">F2C</a></dt>
+ <dd>For now, LLVM does not have a Fortran front-end, but using F2C, we can run
+ Fortran benchmarks.  F2C support must be enabled via <tt>configure</tt> if not
+ installed in a standard place.  F2C requires three items: the <tt>f2c</tt>
+ executable, <tt>f2c.h</tt> to compile the generated code, and <tt>libf2c.a</tt>
+ to link generated code.  By default, given an F2C directory <tt>$DIR</tt>, the
+ configure script will search <tt>$DIR/bin</tt> for <tt>f2c</tt>,
+ <tt>$DIR/include</tt> for <tt>f2c.h</tt>, and <tt>$DIR/lib</tt> for
+ <tt>libf2c.a</tt>.  The default <tt>$DIR</tt> values are: <tt>/usr</tt>,
+ <tt>/usr/local</tt>, <tt>/sw</tt>, and <tt>/opt</tt>.  If you installed F2C in a
+ different location, you must tell <tt>configure</tt>:
+ 
+ <ul>
+ <li><tt>./configure --with-f2c=$DIR</tt><br>
+ This will specify a new <tt>$DIR</tt> for the above-described search
+ process.  This will only work if the binary, header, and library are in their
+ respective subdirectories of <tt>$DIR</tt>.</li>
+ 
+ <li><tt>./configure --with-f2c-bin=/binary/path --with-f2c-inc=/include/path
+ --with-f2c-lib=/lib/path</tt><br>
+ This allows you to specify the F2C components separately.  Note: if you choose
+ this route, you MUST specify all three components, and you need to only specify
+ <em>directories</em> where the files are located; do NOT include the
+ filenames themselves on the <tt>configure</tt> line.</li>
+ </ul></dd>
+ </dl>
+ 
+ <p>Darwin (Mac OS X) developers can simplify the installation of Expect and tcl
+ by using fink.  <tt>fink install expect</tt> will install both. Alternatively,
+ Darwinports users can use <tt>sudo port install expect</tt> to install Expect
+ and tcl.</p>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="quick">Quick Start</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>The tests are located in two separate CVS modules. The basic feature and 
+ regression tests are in the main "llvm" module under the directory 
+ <tt>llvm/test</tt>. A more comprehensive test suite that includes whole 
+ programs in C and C++ is in the <tt>llvm-test</tt> module. This module should 
+ be checked out to the <tt>llvm/projects</tt> directory. When you
+ <tt>configure</tt> the <tt>llvm</tt> module, the <tt>llvm-test</tt> module
+ will be automatically configured. Alternatively, you can configure the
+  <tt>llvm-test</tt> module manually.</p>
+ <p>To run all of the simple tests in LLVM using DejaGNU, use the master Makefile
+  in the <tt>llvm/test</tt> directory:</p>
+ <pre>
+ % gmake -C llvm/test
+ </pre>
+ or<br>
+ <pre>
+ % gmake check
+ </pre>
+ 
+ <p>To run only a subdirectory of tests in llvm/test using DejaGNU (ie.
+ Regression/Transforms), just set the TESTSUITE variable to the path of the
+ subdirectory (relative to <tt>llvm/test</tt>):</p>
+ <pre>
+ % gmake -C llvm/test TESTSUITE=Regression/Transforms
+ </pre>
+ 
+ <p><b>Note: If you are running the tests with <tt>objdir != subdir</tt>, you
+ must have run the complete testsuite before you can specify a
+ subdirectory.</b></p>
+ 
+ <p>To run the comprehensive test suite (tests that compile and execute whole 
+ programs), run the <tt>llvm-test</tt> tests:</p>
+ 
+ <pre>
+ % cd llvm/projects
+ % cvs co llvm-test
+ % cd llvm-test
+ % ./configure --with-llvmsrc=$LLVM_SRC_ROOT --with-llvmobj=$LLVM_OBJ_ROOT
+ % gmake
+ </pre>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="org">LLVM Test Suite Organization</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM test suite contains two major categories of tests: code
+ fragments and whole programs. Code fragments are in the <tt>llvm</tt> module
+ under the <tt>llvm/test</tt> directory. The whole programs
+ test suite is in the <tt>llvm-test</tt> module under the main directory.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="codefragments">Code Fragments</a></div>
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_text">
+ 
+ <p>Code fragments are small pieces of code that test a specific feature of LLVM
+ or trigger a specific bug in LLVM.  They are usually written in LLVM assembly
+ language, but can be written in other languages if the test targets a particular
+ language front end.</p>
+ 
+ <p>Code fragments are not complete programs, and they are never executed to
+ determine correct behavior.</p> 
+ 
+ <p>These code fragment tests are located in the <tt>llvm/test/Features</tt> and 
+ <tt>llvm/test/Regression</tt> directories.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection"><a name="wholeprograms">Whole Programs</a></div>
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_text">
+ 
+ <p>Whole Programs are pieces of code which can be compiled and linked into a
+ stand-alone program that can be executed.  These programs are generally written
+ in high level languages such as C or C++, but sometimes they are written
+ straight in LLVM assembly.</p>
+ 
+ <p>These programs are compiled and then executed using several different
+ methods (native compiler, LLVM C backend, LLVM JIT, LLVM native code generation,
+ etc).  The output of these programs is compared to ensure that LLVM is compiling
+ the program correctly.</p>
+ 
+ <p>In addition to compiling and executing programs, whole program tests serve as
+ a way of benchmarking LLVM performance, both in terms of the efficiency of the
+ programs generated as well as the speed with which LLVM compiles, optimizes, and
+ generates code.</p>
+ 
+ <p>All "whole program" tests are located in the <tt>llvm-test</tt> CVS
+ module.</p> 
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="tree">LLVM Test Suite Tree</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>Each type of test in the LLVM test suite has its own directory. The major
+ subtrees of the test suite directory tree are as follows:</p>
+     
+ <ul>
+ <li><tt>llvm/test/Features</tt>
+ <p>This directory contains sample codes that test various features of the
+ LLVM language.  These pieces of sample code are run through various
+ assembler, disassembler, and optimizer passes.</p>
+ </li>
+ 
+ <li><tt>llvm/test/Regression</tt>
+ <p>This directory contains regression tests for LLVM.  When a bug is found
+ in LLVM, a regression test containing just enough code to reproduce the
+ problem should be written and placed somewhere underneath this directory.
+ In most cases, this will be a small piece of LLVM assembly language code,
+ often distilled from an actual application or benchmark.</p>
+ </li>
+ 
+ <li><tt>llvm-test</tt>
+ <p>The <tt>llvm-test</tt> CVS module contains programs that can be compiled 
+ with LLVM and executed.  These programs are compiled using the native compiler
+ and various LLVM backends.  The output from the program compiled with the 
+ native compiler is assumed correct; the results from the other programs are
+ compared to the native program output and pass if they match.</p>
+ 
+ <p>In addition for testing correctness, the <tt>llvm-test</tt> directory also
+ performs timing tests of various LLVM optimizations.  It also records
+ compilation times for the compilers and the JIT.  This information can be
+ used to compare the effectiveness of LLVM's optimizations and code
+ generation.</p></li>
+ 
+ <li><tt>llvm-test/SingleSource</tt>
+ <p>The SingleSource directory contains test programs that are only a single 
+ source file in size.  These are usually small benchmark programs or small 
+ programs that calculate a particular value.  Several such programs are grouped 
+ together in each directory.</p></li>
+ 
+ <li><tt>llvm-test/MultiSource</tt>
+ <p>The MultiSource directory contains subdirectories which contain entire 
+ programs with multiple source files.  Large benchmarks and whole applications 
+ go here.</p></li>
+ 
+ <li><tt>llvm-test/External</tt>
+ <p>The External directory contains Makefiles for building code that is external
+ to (i.e., not distributed with) LLVM.  The most prominent members of this
+ directory are the SPEC 95 and SPEC 2000 benchmark suites.  The presence and
+ location of these external programs is configured by the llvm-test
+ <tt>configure</tt> script.</p></li>
+       
+ </ul>
+ 
+ </div>
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="dgstructure">DejaGNU Structure</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ <p>The LLVM test suite is partially driven by DejaGNU and partially
+ driven by GNU Make. Specifically, the Features and Regression tests
+ are all driven by DejaGNU. The <tt>llvm-test</tt>
+ module is currently driven by a set of Makefiles.</p>
+ 
+ <p>The DejaGNU structure is very simple, but does require some
+ information to be set. This information is gathered via <tt>configure</tt> and
+ is written to a file, <tt>site.exp</tt> in <tt>llvm/test</tt>. The
+ <tt>llvm/test</tt>
+ Makefile does this work for you.</p>
+ 
+ <p>In order for DejaGNU to work, each directory of tests must have a
+ <tt>dg.exp</tt> file. This file is a program written in tcl that calls
+ the <tt>llvm-runtests</tt> procedure on each test file. The
+ llvm-runtests procedure is defined in
+ <tt>llvm/test/lib/llvm-dg.exp</tt>. Any directory that contains only
+ directories does not need the <tt>dg.exp</tt> file.</p>
+ 
+ <p>In order for a test to be run, it must contain information within
+ the test file on how to run the test. These are called <tt>RUN</tt>
+ lines. Run lines are specified in the comments of the test program
+ using the keyword <tt>RUN</tt> followed by a colon, and lastly the
+ commands to execute. These commands will be executed in a bash script,
+ so any bash syntax is acceptable. You can specify as many RUN lines as
+ necessary.  Each RUN line translates to one line in the resulting bash
+ script. Below is an example of legal RUN lines in a <tt>.ll</tt>
+ file:</p>
+ <pre>
+ ; RUN: llvm-as < %s | llvm-dis > %t1
+ ; RUN: llvm-dis < %s.bc-13 > %t2
+ ; RUN: diff %t1 %t2
+ </pre>
+ <p>There are a couple patterns within a <tt>RUN</tt> line that the
+ llvm-runtest procedure looks for and replaces with the appropriate
+ syntax:</p>
+ 
+ <dl style="margin-left: 25px">
+ <dt>%p</dt> 
+ <dd>The path to the source directory. This is for locating
+ any supporting files that are not generated by the test, but used by
+ the test.</dd> 
+ <dt>%s</dt> 
+ <dd>The test file.</dd> 
+ 
+ <dt>%t</dt>
+ <dd>Temporary filename: testscript.test_filename.tmp, where
+ test_filename is the name of the test file. All temporary files are
+ placed in the Output directory within the directory the test is
+ located.</dd> 
+ 
+ <dt>%prcontext</dt> 
+ <dd>Path to a script that performs grep -C. Use this since not all
+ platforms support grep -C.</dd>
+ 
+ <dt>%llvmgcc</dt> <dd>Full path to the llvm-gcc executable.</dd>
+ <dt>%llvmgxx</dt> <dd>Full path to the llvm-g++ executable.</dd>
+ </dl>
+ 
+ <p>There are also several scripts in the llvm/test/Scripts directory
+ that you might find useful when writing <tt>RUN</tt> lines.</p>
+ 
+ <p>Lastly, you can easily mark a test that is expected to fail on a
+ specific platform or with a specific version of llvmgcc by using the
+  <tt>XFAIL</tt> keyword. Xfail lines are
+ specified in the comments of the test program using <tt>XFAIL</tt>,
+ followed by a colon, and one or more regular expressions (separated by
+ a comma) that will match against the target triplet or llvmgcc version for the
+ machine. You can use * to match all targets. You can specify the major or full
+  version (i.e. 3.4) for llvmgcc. Here is an example of an
+ <tt>XFAIL</tt> line:</p>
+ <pre>
+ ; XFAIL: darwin,sun,llvmgcc4
+ </pre>
+ 
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="progstructure"><tt>llvm-test</tt> 
+ Structure</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>As mentioned previously, the <tt>llvm-test</tt> module  provides three types
+ of tests: MultiSource, SingleSource, and External.  Each tree is then subdivided
+ into several categories, including applications, benchmarks, regression tests,
+ code that is strange grammatically, etc.  These organizations should be
+ relatively self explanatory.</p>
+ 
+ <p>In addition to the regular "whole program"  tests, the <tt>llvm-test</tt>
+ module also provides a mechanism for compiling the programs in different ways.
+ If the variable TEST is defined on the gmake command line, the test system will
+ include a Makefile named <tt>TEST.<value of TEST variable>.Makefile</tt>.
+ This Makefile can modify build rules to yield different results.</p>
+ 
+ <p>For example, the LLVM nightly tester uses <tt>TEST.nightly.Makefile</tt> to
+ create the nightly test reports.  To run the nightly tests, run <tt>gmake
+ TEST=nightly</tt>.</p>
+ 
+ <p>There are several TEST Makefiles available in the tree.  Some of them are
+ designed for internal LLVM research and will not work outside of the LLVM
+ research group.  They may still be valuable, however, as a guide to writing your
+ own TEST Makefile for any optimization or analysis passes that you develop with
+ LLVM.</p>
+ 
+ <p>Note, when configuring the <tt>llvm-test</tt> module, you might want to
+ specify the following configuration options:</p>
+ <dl>
+   <dt><i>--enable-spec2000</i>
+   <dt><i>--enable-spec2000=<<tt>directory</tt>></i>
+   <dd>
+     Enable the use of SPEC2000 when testing LLVM.  This is disabled by default
+     (unless <tt>configure</tt> finds SPEC2000 installed).  By specifying
+     <tt>directory</tt>, you can tell configure where to find the SPEC2000
+     benchmarks.  If <tt>directory</tt> is left unspecified, <tt>configure</tt>
+     uses the default value
+     <tt>/home/vadve/shared/benchmarks/speccpu2000/benchspec</tt>.
+     <p>
+   <dt><i>--enable-spec95</i>
+   <dt><i>--enable-spec95=<<tt>directory</tt>></i>
+   <dd>
+     Enable the use of SPEC95 when testing LLVM.  It is similar to the
+     <i>--enable-spec2000</i> option.
+     <p>
+   <dt><i>--enable-povray</i>
+   <dt><i>--enable-povray=<<tt>directory</tt>></i>
+   <dd>
+     Enable the use of Povray as an external test.  Versions of Povray written
+     in C should work.  This option is similar to the <i>--enable-spec2000</i>
+     option.
+ </dl>
+ </div>
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="run">Running the LLVM Tests</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>First, all tests are executed within the LLVM object directory tree.  They
+ <i>are not</i> executed inside of the LLVM source tree. This is because the
+ test suite creates temporary files during execution.</p>
+ 
+ <p>The master Makefile in llvm/test is capable of running only the DejaGNU
+ driven tests. By default, it will run all of these tests.</p>
+ 
+ <p>To run only the DejaGNU driven tests, run <tt>gmake</tt> at the
+ command line in <tt>llvm/test</tt>.  To run a specific directory of tests, use
+ the TESTSUITE variable.
+ </p>
+ 
+ <p>For example, to run the Regression tests, type 
+ <tt>gmake TESTSUITE=Regression</tt> in <tt>llvm/tests</tt>.</p>
+ 
+ <p>Note that there are no Makefiles in <tt>llvm/test/Features</tt> and
+ <tt>llvm/test/Regression</tt>. You must use DejaGNU from the <tt>llvm/test</tt>
+ directory to run them.</p>
+ 
+ <p>To run the <tt>llvm-test</tt> suite, you need to use the following steps:
+ </p>
+ <ol>
+   <li>cd into the llvm/projects directory</li>
+   <li>check out the <tt>llvm-test</tt> module with:<br/>
+   <tt>cvs -d :pserver:anon at llvm.org:/var/cvs/llvm co -PR llvm-test</tt><br> 
+   This will get the test suite into <tt>llvm/projects/llvm-test</tt></li>
+   <li>configure the test suite. You can do this one of two ways:
+   <ol>
+     <li>Use the regular llvm configure:<br/>
+     <tt>cd $LLVM_OBJ_ROOT ; $LLVM_SRC_ROOT/configure</tt><br/>
+     This will ensure that the <tt>projects/llvm-test</tt> directory is also
+     properly configured.</li>
+     <li>Use the <tt>configure</tt> script found in the <tt>llvm-test</tt> source
+     directory:<br/>
+     <tt>$LLVM_SRC_ROOT/projects/llvm-test/configure
+      --with-llvmsrc=$LLVM_SRC_ROOT --with-llvmobj=$LLVM_OBJ_ROOT</tt>
+     </li>
+   </ol>
+   <li>gmake</li>
+ </ol>
+ <p>Note that the second and third steps only need to be done once. After you
+ have the suite checked out and configured, you don't need to do it again (unless
+ the test code or configure script changes).</p>
+ 
+ <p>To make a specialized test (use one of the
+ <tt>llvm-test/TEST.<type>.Makefile</tt>s), just run:<br/>
+ <tt>gmake TEST=<type> test</tt><br/>For example, you could run the
+ nightly tester tests using the following commands:</p>
+ 
+ <pre>
+  % cd llvm/projects/llvm-test
+  % gmake TEST=nightly test
+ </pre>
+ 
+ <p>Regardless of which test you're running, the results are printed on standard
+ output and standard error.  You can redirect these results to a file if you
+ choose.</p>
+ 
+ <p>Some tests are known to fail.  Some are bugs that we have not fixed yet;
+ others are features that we haven't added yet (or may never add).  In DejaGNU,
+ the result for such tests will be XFAIL (eXpected FAILure).  In this way, you
+ can tell the difference between an expected and unexpected failure.</p>
+ 
+ <p>The tests in <tt>llvm-test</tt> have no such feature at this time. If the
+ test passes, only warnings and other miscellaneous output will be generated.  If
+ a test fails, a large <program> FAILED message will be displayed.  This
+ will help you separate benign warnings from actual test failures.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsection">
+ <a name="customtest">Writing custom tests for llvm-test</a></div>
+ <!-- _______________________________________________________________________ -->
+ 
+ <div class="doc_text">
+ 
+ <p>Assuming you can run llvm-test, (e.g. "<tt>gmake TEST=nightly report</tt>"
+ should work), it is really easy to run optimizations or code generator
+ components against every program in the tree, collecting statistics or running
+ custom checks for correctness.  At base, this is how the nightly tester works,
+ it's just one example of a general framework.</p>
+ 
+ <p>Lets say that you have an LLVM optimization pass, and you want to see how
+ many times it triggers.  First thing you should do is add an LLVM
+ <a href="ProgrammersManual.html#Statistic">statistic</a> to your pass, which
+ will tally counts of things you care about.</p>
+ 
+ <p>Following this, you can set up a test and a report that collects these and
+ formats them for easy viewing.  This consists of two files, an
+ "<tt>llvm-test/TEST.XXX.Makefile</tt>" fragment (where XXX is the name of your
+ test) and an "<tt>llvm-test/TEST.XXX.report</tt>" file that indicates how to
+ format the output into a table.  There are many example reports of various
+ levels of sophistication included with llvm-test, and the framework is very
+ general.</p>
+ 
+ <p>If you are interested in testing an optimization pass, check out the
+ "libcalls" test as an example.  It can be run like this:<p>
+ 
+ <div class="doc_code">
+ <pre>
+ % cd llvm/projects/llvm-test/MultiSource/Benchmarks  # or some other level
+ % make TEST=libcalls report
+ </pre>
+ </div>
+ 
+ <p>This will do a bunch of stuff, then eventually print a table like this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ Name                                  | total | #exit |
+ ...
+ FreeBench/analyzer/analyzer           | 51    | 6     | 
+ FreeBench/fourinarow/fourinarow       | 1     | 1     | 
+ FreeBench/neural/neural               | 19    | 9     | 
+ FreeBench/pifft/pifft                 | 5     | 3     | 
+ MallocBench/cfrac/cfrac               | 1     | *     | 
+ MallocBench/espresso/espresso         | 52    | 12    | 
+ MallocBench/gs/gs                     | 4     | *     | 
+ Prolangs-C/TimberWolfMC/timberwolfmc  | 302   | *     | 
+ Prolangs-C/agrep/agrep                | 33    | 12    | 
+ Prolangs-C/allroots/allroots          | *     | *     | 
+ Prolangs-C/assembler/assembler        | 47    | *     | 
+ Prolangs-C/bison/mybison              | 74    | *     | 
+ ...
+ </pre>
+ </div>
+ 
+ <p>This basically is grepping the -stats output and displaying it in a table.
+ You can also use the "TEST=libcalls report.html" target to get the table in HTML
+ form, similarly for report.csv and report.tex.</p>
+ 
+ <p>The source for this is in llvm-test/TEST.libcalls.*.  The format is pretty
+ simple: the Makefile indicates how to run the test (in this case, 
+ "<tt>opt -simplify-libcalls -stats</tt>"), and the report contains one line for
+ each column of the output.  The first value is the header for the column and the
+ second is the regex to grep the output of the command for.  There are lots of
+ example reports that can do fancy stuff.</p>
+ 
+ </div>
+ 
+ 
+ <!--=========================================================================-->
+ <div class="doc_section"><a name="nightly">Running the nightly tester</a></div>
+ <!--=========================================================================-->
+ 
+ <div class="doc_text">
+ 
+ <p>
+ The <a href="http://llvm.org/testresults/">LLVM Nightly Testers</a>
+ automatically check out an LLVM tree, build it, run the "nightly" 
+ program test (described above), run  all of the feature and regression tests, 
+ and then delete the checked out tree.  This tester is designed to ensure that 
+ programs don't break as well as keep track of LLVM's progress over time.</p>
+ 
+ <p>If you'd like to set up an instance of the nightly tester to run on your
+ machine, take a look at the comments at the top of the
+ <tt>utils/NightlyTest.pl</tt> file.  We usually run it from a crontab entry
+ that looks like this:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ 5 3 * * *  $HOME/llvm/utils/NightlyTest.pl -parallel $CVSROOT \
+            $HOME/buildtest $HOME/cvs/testresults
+ </pre>
+ </div>
+ 
+ <p>Or, you can create a shell script to encapsulate the running of the script.
+ The optimized x86 Linux nightly test is run from just such a script:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ #!/bin/bash
+ BASE=/proj/work/llvm/nightlytest
+ export CVSROOT=:pserver:anon at llvm.org:/var/cvs/llvm
+ export BUILDDIR=$BASE/build 
+ export WEBDIR=$BASE/testresults 
+ export LLVMGCCDIR=/proj/work/llvm/cfrontend/install
+ export PATH=/proj/install/bin:$LLVMGCCDIR/bin:$PATH
+ export LD_LIBRARY_PATH=/proj/install/lib
+ cd $BASE
+ cp /proj/work/llvm/llvm/utils/NightlyTest.pl .
+ nice ./NightlyTest.pl -nice -release -verbose -parallel -enable-linscan \
+    -noexternals 2>&1 > output.log
+ mail -s 'X86 nightly tester results' <a href="http://mail.cs.uiuc.edu/mailman/\
+    listinfo/llvm-testresults">llvm-testresults at cs.uiuc.edu</a> < output.log
+ </pre>
+ </div>
+ 
+ <p>Take a look at the <tt>NightlyTest.pl</tt> file to see what all of the flags
+ and strings do.  If you start running the nightly tests, please let us know and
+ we'll link your page to the global tester page.  Thanks!</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   John T. Criswell, Reid Spencer, and Tanya Lattner<br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br/>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/UsingLibraries.html
diff -c /dev/null llvm-www/releases/1.8/docs/UsingLibraries.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/UsingLibraries.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,398 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+ 	<title>Using The LLVM Libraries</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ <div class="doc_title">Using The LLVM Libraries</div>
+ <ol>
+   <li><a href="#abstract">Abstract</a></li>
+   <li><a href="#introduction">Introduction</a></li>
+   <li><a href="#descriptions">Library Descriptions</a></li>
+   <li><a href="#dependencies">Library Dependencies</a></li>
+   <li><a href="#rot">Linkage Rules Of Thumb</a>
+ 	  <ol>
+       <li><a href="#always">Always link LLVMCore, LLVMSupport, LLVMSystem</a>
+ 			<li><a href="#onlyone">Never link both archive and re-linked</a>
+ 		</ol>
+ 	</li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:rspencer at x10sys.com">Reid Spencer</a></p>
+ </div>
+ 
+ <p class="doc_warning">Warning: This document is out of date, please see <a href="CommandGuide/html/llvm-config.html">llvm-config</a> for more information.</p>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="abstract">Abstract</a></div>
+ <div class="doc_text">
+   <p>Amongst other things, LLVM is a toolkit for building compilers, linkers,
+   runtime executives, virtual machines, and other program execution related
+   tools. In addition to the LLVM tool set, the functionality of LLVM is
+   available through a set of libraries.  To use LLVM as a toolkit for
+   constructing tools, a developer needs to understand what is contained in the
+   various libraries, what they depend on, and how to use them.  This document 
+   describes the contents of the libraries and how and when to use them.
+ </p>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_section"> <a name="introduction">Introduction</a></div>
+ <div class="doc_text">
+   <p>If you're writing a compiler, virtual machine, or any other utility based 
+   on LLVM, you'll need to figure out which of the many libraries files you will 
+   need to link with to be successful. An understanding of the contents of these 
+   files and their inter-relationships will be useful in coming up with an optimal 
+   specification for the libraries to link with. The purpose of this document is 
+   to reduce some of the trial and error that the author experienced in using 
+   LLVM.</p>
+   <p>LLVM produces two types of libraries: archives (ending in <tt>.a</tt>) and
+   objects (ending in <tt>.o</tt>). However, both are libraries. Libraries ending
+   in <tt>.o</tt> are known as re-linked libraries because they contain all the
+   compilation units of the library linked together as a single <tt>.o</tt> file.
+   Furthermore, many of the libraries have <em>both</em> forms of library. The
+   re-linked libraries are used whenever you want to include all symbols from the
+   library. The archive libraries are used whenever you want to only resolve
+   outstanding symbols at that point in the link without including everything in
+   the library. </p>
+   <p>When linking your tools, you will use the <tt>LLVMLIBS</tt> make variable. 
+   (see the <a href="MakefileGuide.html#LLVMLIBS">Makefile Guide</a> for 
+   details). This variable specifies which LLVM libraries to link into your tool 
+   and the order in which they will be linked. You specify re-linked libraries by
+   naming the library without a suffix. You specify archive libraries by naming
+   the library with a <tt>.a</tt> suffix but without the <tt>lib</tt> prefix. The
+   order in which the libraries appear in the <tt>LLVMLIBS</tt> variable
+   definition is the order in which they will be linked. Getting this order
+   correct for your tool can sometimes be challenging.
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="descriptions"></a>Library Descriptions</div>
+ <div class="doc_text">
+   <p>The table below categorizes each library
+ <table style="text-align:left">
+   <tr><th>Library</th><th>Forms</th><th>Description</th></tr>
+   <tr><th colspan="3">Core Libraries</th></tr>
+   <tr><td>LLVMArchive</td><td><tt>.a</tt></td>
+     <td>LLVM archive reading and writing</td></tr>
+   <tr><td>LLVMAsmParser</td><td><tt>.o</tt></td>
+     <td>LLVM assembly parsing</td></tr>
+   <tr><td>LLVMBCReader</td><td><tt>.o</tt></td>
+     <td>LLVM bytecode reading</td></tr>
+   <tr><td>LLVMBCWriter</td><td><tt>.o</tt></td>
+     <td>LLVM bytecode writing</td></tr>
+   <tr><td>LLVMCore</td><td><tt>.o</tt></td>
+     <td>LLVM core intermediate representation</td></tr>
+   <tr><td>LLVMDebugger</td><td><tt>.o</tt></td>
+     <td>Source level debugging support</td></tr>
+   <tr><td>LLVMLinker</td><td><tt>.a</tt></td>
+     <td>Bytecode and archive linking interface</td></tr>
+   <tr><td>LLVMSupport</td><td><tt>.a .o</tt></td>
+     <td>General support utilities</td></tr>
+   <tr><td>LLVMSystem</td><td><tt>.a .o</tt></td>
+     <td>Operating system abstraction layer</td></tr>
+ 
+   <tr><th colspan="3">Analysis Libraries</th></tr>
+   <tr><td>LLVMAnalysis</td><td><tt>.a .o</tt></td>
+     <td>Various analysis passes.</td></tr>
+   <tr><td>LLVMDataStructure</td><td><tt>.a .o</tt></td>
+     <td>Data structure analysis passes.</td></tr>
+   <tr><td>LLVMipa</td><td><tt>.a .o</tt></td>
+     <td>Inter-procedural analysis passes.</td></tr>
+ 
+   <tr><th colspan="3">Transformation Libraries</th></tr>
+   <tr><td>LLVMInstrumentation</td><td><tt>.a .o</tt></td>
+     <td>Instrumentation passes.</td></tr>
+   <tr><td>LLVMipo</td><td><tt>.a .o</tt></td>
+     <td>All inter-procedural optimization passes.</td></tr>
+   <tr><td>LLVMScalarOpts</td><td><tt>.a .o</tt></td>
+     <td>All scalar optimization passes.</td></tr>
+   <tr><td>LLVMTransforms</td><td><tt>.a .o</tt></td>
+     <td>Uncategorized transformation passes.</td></tr>
+   <tr><td>LLVMTransformUtils</td><td><tt>.a .o</tt></td>
+     <td>Transformation utilities.</td></tr>
+ 
+   <tr><th colspan="3">Code Generation Libraries </th></tr>
+   <tr><td>LLVMCodeGen</td><td><tt>.o</tt></td>
+     <td>Native code generation infrastructure</td></tr>
+ 
+   <tr><th colspan="3">Target Libraries</th></tr>
+   <tr><td>LLVMAlpha</td><td><tt>.o</tt></td>
+     <td>Code generation for Alpha architecture</td></tr>
+   <tr><td>LLVMCBackend</td><td><tt>.o</tt></td>
+     <td>'C' language code generator.</td></tr>
+   <tr><td>LLVMIA64</td><td><tt>.o</tt></td>
+     <td>Code generation for IA64 architecture</td></tr>
+   <tr><td>LLVMPowerPC</td><td><tt>.o</tt></td>
+     <td>Code generation for PowerPC architecture</td></tr>
+   <tr><td>LLVMSelectionDAG</td><td><tt>.o</tt></td>
+     <td>Aggressive instruction selector for directed acyclic graphs</td></tr>
+   <tr><td>LLVMSparc</td><td><tt>.o</tt></td>
+     <td>Code generation for Sparc architecture</td></tr>
+   <tr><td>LLVMTarget</td><td><tt>.a .o</tt></td>
+     <td>Generic code generation utilities.</td></tr>
+   <tr><td>LLVMX86</td><td><tt>.o</tt></td>
+     <td>Code generation for Intel x86 architecture</td></tr>
+ 
+   <tr><th colspan="3">Runtime Libraries</th></tr>
+   <tr><td>LLVMInterpreter</td><td><tt>.o</tt></td>
+     <td>Bytecode Interpreter</td></tr>
+   <tr><td>LLVMJIT</td><td><tt>.o</tt></td>
+     <td>Bytecode JIT Compiler</td></tr>
+   <tr><td>LLVMExecutionEngine</td><td><tt>.o</tt></td>
+     <td>Virtual machine engine</td></tr>
+ </table>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="dependencies"></a>Library Dependencies</div>
+ <div class="doc_text">
+   <p>Below are two dependency graphs and a list that show the relationships
+   between the various LLVM archive libraries and object files.  This information 
+   can be automatically generated with the <tt>GenLibDeps.pl</tt> utility found
+   in the <tt>llvm/utils</tt> directory.</p>
+   <!-- =======NOTE: =========================================================-->
+   <!-- === The following graphs and <dl> list are generated automatically ===-->
+   <!-- === by the util named GenLibDeps.pl in the llvm/utils directory.   ===-->
+   <!-- === This should be updated whenever new libraries are added,       ===-->
+   <!-- === removed, or changed                                            ===-->
+   <!-- =======NOTE: =========================================================-->
+   <h2>Dependency Relationships Of Libraries</h2>
+   <p>This graph shows the dependency of archive libraries on other archive 
+   libraries or objects. Where a library has both archive and object forms, only
+   the archive form is shown.</p>
+   <img src="img/libdeps.gif" alt="Library Dependencies"/>
+   <h2>Dependency Relationships Of Object Files</h2>
+   <p>This graph shows the dependency of object files on archive libraries or 
+   other objects. Where a library has both object and archive forms, only the 
+   dependency to the archive form is shown.</p> 
+   <img src="img/objdeps.gif" alt="Object File Dependencies"/>
+   <p>The following list shows the dependency relationships between libraries in
+   textual form. The information is the same as shown on the graphs but arranged
+   alphabetically.</p>
+ <dl>
+   <dt><b>libLLVMAnalysis.a</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMArchive.a</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>LLVMBCReader.o</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMInstrumentation.a</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTransformUtils.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMLinker.a</b</dt><dd><ul>
+     <li>libLLVMArchive.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>LLVMBCReader.o</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMScalarOpts.a</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>libLLVMTransformUtils.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMSupport.a</b</dt><dd><ul>
+     <li>libLLVMSystem.a</li>
+     <li>LLVMbzip2.o</li>
+   </ul></dd>
+   <dt><b>libLLVMSystem.a</b</dt><dd><ul>
+   </ul></dd>
+   <dt><b>libLLVMTarget.a</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>libLLVMTransformUtils.a</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMipa.a</li>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMTransforms.a</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>libLLVMTransformUtils.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMipa.a</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>libLLVMipo.a</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMipa.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>libLLVMTransformUtils.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMAlpha.o</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>LLVMAsmParser.o</b</dt><dd><ul>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMBCReader.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMBCWriter.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMCBackend.o</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMipa.a</li>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMCodeGen.o</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMCore.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+   </ul></dd>
+   <dt><b>LLVMDataStructure.o</b</dt><dd><ul>
+     <li>libLLVMAnalysis.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMDebugger.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>LLVMBCReader.o</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMExecutionEngine.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMIA64.o</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>LLVMInterpreter.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMExecutionEngine.o</li>
+   </ul></dd>
+   <dt><b>LLVMJIT.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMExecutionEngine.o</li>
+   </ul></dd>
+   <dt><b>LLVMPowerPC.o</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>LLVMSelectionDAG.o</b</dt><dd><ul>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMSystem.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>libLLVMTransformUtils.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+   </ul></dd>
+   <dt><b>LLVMSparc.o</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>LLVMX86.o</b</dt><dd><ul>
+     <li>libLLVMScalarOpts.a</li>
+     <li>libLLVMSupport.a</li>
+     <li>libLLVMTarget.a</li>
+     <li>LLVMCodeGen.o</li>
+     <li>LLVMCore.o</li>
+     <li>LLVMSelectionDAG.o</li>
+   </ul></dd>
+   <dt><b>LLVMbzip2.o</b</dt><dd><ul>
+   </ul></dd>
+ </dl>
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_section"><a name="rot">Linkage Rules Of Thumb</a></div>
+ <div class="doc_text">
+ 	<p>This section contains various "rules of thumb" about what files you
+ 	should link into your programs.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="always">Always Link LLVMCore, LLVMSupport,
+     and LLVMSystem</a></div>
+ <div class="doc_text">
+   <p>No matter what you do with LLVM, the last three entries in the value of 
+   your LLVMLIBS make variable should always be: 
+   <tt>LLVMCore LLVMSupport.a LLVMSystem.a</tt>. There are no <tt>LLVM</tt> 
+   programs that don't depend on these three.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <div class="doc_subsection"><a name="onlyone">Never link both archive and
+     re-linked library</a></div>
+ <div class="doc_text">
+   <p>There is never any point to linking both the re-linked (<tt>.o</tt>) and
+   the archive (<tt>.a</tt>) versions of a library. Since the re-linked version
+   includes the entire library, the archive version will not resolve any symbols.
+   You could even end up with link error if you place the archive version before
+   the re-linked version on the linker's command line.</p>
+ </div>
+ <!-- ======================================================================= -->
+ <hr>
+ <div class="doc_footer">
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+     src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"/></a>
+   <a href="http://validator.w3.org/check/referer"><img
+     src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+   <a href="mailto:rspencer at x10sys.com">Reid Spencer</a>
+ </address>
+ <a href="http://llvm.org">The LLVM Compiler Infrastructure</a> 
+ <br>Last modified: $Date: 2006/08/09 05:56:40 $ </div>
+ </body>
+ </html>
+ <!-- vim: sw=2 ts=2 ai
+ -->

Index: llvm-www/releases/1.8/docs/WritingAnLLVMBackend.html
diff -c /dev/null llvm-www/releases/1.8/docs/WritingAnLLVMBackend.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/WritingAnLLVMBackend.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,260 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Writing an LLVM backend</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ 
+ <body>
+ 
+ <div class="doc_title">
+   Writing an LLVM backend
+ </div>
+ 
+ <ol>
+   <li><a href="#intro">Introduction</a>
+   <li><a href="#backends">Writing a backend</a>
+   <ol>
+     <li><a href="#machine">Machine backends</a>
+     <ol>
+       <li><a href="#machineTOC">Outline</a></li>
+       <li><a href="#machineDetails">Implementation details</a></li>
+     </ol></li>  
+     <li><a href="#lang">Language backends</a></li>
+   </ol></li>
+   <li><a href="#related">Related reading material</a>
+ </ol>
+ 
+ <div class="doc_author">    
+   <p>Written by <a href="http://misha.brukman.net">Misha Brukman</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="intro">Introduction</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>This document describes techniques for writing backends for LLVM which
+ convert the LLVM representation to machine assembly code or other languages.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="backends">Writing a backend</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="machine">Machine backends</a>
+ </div>
+     
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="machineTOC">Outline</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In general, you want to follow the format of SPARC, X86 or PowerPC (in
+ <tt>lib/Target</tt>).  SPARC is the simplest backend, and is RISC, so if
+ you're working on a RISC target, it is a good one to start with.</p>
+ 
+ <p>To create a static compiler (one that emits text assembly), you need to
+ implement the following:</p>
+ 
+ <ul>
+ <li>Describe the register set.
+   <ul>
+   <li>Create a <a href="TableGenFundamentals.html">TableGen</a> description of
+       the register set and register classes</li>
+   <li>Implement a subclass of <tt><a
+       href="CodeGenerator.html#mregisterinfo">MRegisterInfo</a></tt></li>
+   </ul></li>
+ <li>Describe the instruction set.
+   <ul>
+   <li>Create a <a href="TableGenFundamentals.html">TableGen</a> description of
+       the instruction set</li>
+   <li>Implement a subclass of <tt><a
+       href="CodeGenerator.html#targetinstrinfo">TargetInstrInfo</a></tt></li>
+   </ul></li>
+ <li>Describe the target machine.
+   <ul>
+   <li>Create a <a href="TableGenFundamentals.html">TableGen</a> description of
+       the target that describes the pointer size and references the instruction
+       set</li>
+   <li>Implement a subclass of <tt><a
+       href="CodeGenerator.html#targetmachine">TargetMachine</a></tt>, which
+       configures <tt><a href="CodeGenerator.html#targetdata">TargetData</a></tt>
+       correctly</li>
+   <li>Register your new target using the <tt>RegisterTarget</tt>
+   template:<br><br>
+ <div class="doc_code"><pre>
+ RegisterTarget<<em>MyTargetMachine</em>> M("short_name", "  Target name");
+ </pre></div>
+       <br>Here, <em>MyTargetMachine</em> is the name of your implemented
+       subclass of <tt><a
+       href="CodeGenerator.html#targetmachine">TargetMachine</a></tt>,
+       <em>short_name</em> is the option that will be active following
+       <tt>-march=</tt> to select a target in llc and lli, and the last string
+       is the description of your target to appear in <tt>-help</tt>
+       listing.</li>
+   </ul></li>
+ <li>Implement the assembly printer for the architecture.
+   <ul>
+   <li>Define all of the assembly strings for your target, adding them to the
+       instructions in your *InstrInfo.td file.</li>
+   <li>Implement the <tt>llvm::AsmPrinter</tt> interface.</li>
+   </ul>
+ </li>
+ <li>Implement an instruction selector for the architecture.
+   <ul>
+   <li>The recommended method is the <a href="CodeGenerator.html#instselect">
+       pattern-matching DAG-to-DAG instruction selector</a> (for example, see
+       the PowerPC backend in PPCISelDAGtoDAG.cpp).  Parts of instruction
+       selector creation can be performed by adding patterns to the instructions
+       in your <tt>.td</tt> file.</li>
+   </ul>
+ </li>
+ <li>Optionally, add subtarget support.
+ <ul>
+   <li>If your target has multiple subtargets (e.g. variants with different
+       capabilities), implement the <tt>llvm::TargetSubtarget</tt> interface
+       for your architecture.  This allows you to add <tt>-mcpu=</tt> and 
+       <tt>-mattr=</tt> options.</li>
+ </ul>
+ <li>Optionally, add JIT support.
+   <ul>
+   <li>Create a subclass of <tt><a
+       href="CodeGenerator.html#targetjitinfo">TargetJITInfo</a></tt></li>
+   <li>Create a machine code emitter that will be used to emit binary code
+       directly into memory, given <tt>MachineInstr</tt>s</li>
+   </ul>
+ </ul>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="machineDetails">Implementation details</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ 
+ <li><p><b>TableGen register info description</b> - describe a class which
+ will store the register's number in the binary encoding of the instruction
+ (e.g., for JIT purposes).</p>
+ 
+ <p>You also need to define register classes to contain these registers, such as
+ the integer register class and floating-point register class, so that you can
+ allocate virtual registers to instructions from these sets, and let the
+ target-independent register allocator automatically choose the actual
+ architected registers.</p>
+ 
+ <div class="doc_code">
+ <pre>
+ // class Register is defined in Target.td
+ <b>class</b> <em>Target</em>Reg<string name> : Register<name> {
+   <b>let</b> Namespace = "<em>Target</em>";
+ }
+ 
+ <b>class</b> IntReg<<b>bits</b><5> num, string name> : <em>Target</em>Reg<name> {
+   <b>field</b> <b>bits</b><5> Num = num;
+ }
+ 
+ <b>def</b> R0 : IntReg<0, "%R0">;
+ ...
+ 
+ // class RegisterClass is defined in Target.td
+ <b>def</b> IReg : RegisterClass<i64, 64, [R0, ... ]>;
+ </pre>
+ </div>
+ </li>
+ 
+ <li><p><b>TableGen instruction info description</b> - break up instructions into
+ classes, usually that's already done by the manufacturer (see instruction
+ manual).  Define a class for each instruction category.  Define each opcode as a
+ subclass of the category, with appropriate parameters such as the fixed binary
+ encoding of opcodes and extended opcodes, and map the register bits to the bits
+ of the instruction which they are encoded in (for the JIT).  Also specify how
+ the instruction should be printed so it can use the automatic assembly printer,
+ e.g.:</p>
+ 
+ <div class="doc_code">
+ <pre>
+ // class Instruction is defined in Target.td
+ <b>class</b> Form<<b>bits</b><6> opcode, <b>dag</b> OL, <b>string</b> asmstr> : Instruction {
+   <b>field</b> <b>bits</b><42> Inst;
+ 
+   <b>let</b> Namespace = "<em>Target</em>";
+   <b>let</b> Inst{0-6} = opcode;
+   <b>let</b> OperandList = OL;
+   <b>let</b> AsmString = asmstr;
+ }
+ 
+ <b>def</b> ADD : Form<42, (ops IReg:$rD, IReg:$rA, IReg:$rB), "add $rD, $rA, $rB">;
+ </pre>
+ </div>
+ </li>
+ 
+ </ul>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="lang">Language backends</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>For now, just take a look at <tt>lib/Target/CBackend</tt> for an example of
+ how the C backend is written.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="related">Related reading material</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <ul>
+ <li><a href="CodeGenerator.html">Code generator</a> -
+     describes some of the classes in code generation at a high level, but
+     it is not (yet) complete</li>
+ <li><a href="TableGenFundamentals.html">TableGen fundamentals</a> -
+     describes how to use TableGen to describe your target information
+     succinctly</li>
+ <li><a href="HowToSubmitABug.html#codegen">Debugging code generation with
+     bugpoint</a> - shows bugpoint usage scenarios to simplify backend
+     development</li>
+ </ul>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="http://misha.brukman.net">Misha Brukman</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a>
+   <br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/WritingAnLLVMPass.html
diff -c /dev/null llvm-www/releases/1.8/docs/WritingAnLLVMPass.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/WritingAnLLVMPass.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1600 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
+   <title>Writing an LLVM Pass</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">
+   Writing an LLVM Pass
+ </div>
+ 
+ <ol>
+   <li><a href="#introduction">Introduction - What is a pass?</a></li>
+   <li><a href="#quickstart">Quick Start - Writing hello world</a>
+     <ul>
+     <li><a href="#makefile">Setting up the build environment</a></li>
+     <li><a href="#basiccode">Basic code required</a></li>
+     <li><a href="#running">Running a pass with <tt>opt</tt>
+                  or <tt>analyze</tt></a></li>
+     </ul></li>
+   <li><a href="#passtype">Pass classes and requirements</a>
+      <ul>
+      <li><a href="#ImmutablePass">The <tt>ImmutablePass</tt> class</a></li>
+      <li><a href="#ModulePass">The <tt>ModulePass</tt> class</a>
+         <ul>
+         <li><a href="#runOnModule">The <tt>runOnModule</tt> method</a></li>
+         </ul></li>
+      <li><a href="#CallGraphSCCPass">The <tt>CallGraphSCCPass</tt> class</a>
+         <ul>
+         <li><a href="#doInitialization_scc">The <tt>doInitialization(Module
+                                            &)</tt> method</a></li>
+         <li><a href="#runOnSCC">The <tt>runOnSCC</tt> method</a></li>
+         <li><a href="#doFinalization_scc">The <tt>doFinalization(Module
+                                            &)</tt> method</a></li>
+         </ul></li>
+      <li><a href="#FunctionPass">The <tt>FunctionPass</tt> class</a>
+         <ul>
+         <li><a href="#doInitialization_mod">The <tt>doInitialization(Module
+                                             &)</tt> method</a></li>
+         <li><a href="#runOnFunction">The <tt>runOnFunction</tt> method</a></li>
+         <li><a href="#doFinalization_mod">The <tt>doFinalization(Module
+                                             &)</tt> method</a></li>
+         </ul></li>
+      <li><a href="#BasicBlockPass">The <tt>BasicBlockPass</tt> class</a>
+         <ul>
+         <li><a href="#doInitialization_fn">The <tt>doInitialization(Function
+                                              &)</tt> method</a></li>
+         <li><a href="#runOnBasicBlock">The <tt>runOnBasicBlock</tt>
+                                        method</a></li>
+         <li><a href="#doFinalization_fn">The <tt>doFinalization(Function
+                                          &)</tt> method</a></li>
+         </ul></li>
+      <li><a href="#MachineFunctionPass">The <tt>MachineFunctionPass</tt>
+                                         class</a>
+         <ul>
+         <li><a href="#runOnMachineFunction">The
+             <tt>runOnMachineFunction(MachineFunction &)</tt> method</a></li>
+         </ul></li>
+      </ul>
+   <li><a href="#registration">Pass Registration</a>
+      <ul>
+      <li><a href="#print">The <tt>print</tt> method</a></li>
+      </ul></li>
+   <li><a href="#interaction">Specifying interactions between passes</a>
+      <ul>
+      <li><a href="#getAnalysisUsage">The <tt>getAnalysisUsage</tt> 
+                                      method</a></li>
+      <li><a href="#AU::addRequired">The <tt>AnalysisUsage::addRequired<></tt> and <tt>AnalysisUsage::addRequiredTransitive<></tt> methods</a></li>
+      <li><a href="#AU::addPreserved">The <tt>AnalysisUsage::addPreserved<></tt> method</a></li>
+      <li><a href="#AU::examples">Example implementations of <tt>getAnalysisUsage</tt></a></li>
+      <li><a href="#getAnalysis">The <tt>getAnalysis<></tt> and <tt>getAnalysisToUpdate<></tt> methods</a></li>
+      </ul></li>
+   <li><a href="#analysisgroup">Implementing Analysis Groups</a>
+      <ul>
+      <li><a href="#agconcepts">Analysis Group Concepts</a></li>
+      <li><a href="#registerag">Using <tt>RegisterAnalysisGroup</tt></a></li>
+      </ul></li>
+   <li><a href="#passStatistics">Pass Statistics</a>
+   <li><a href="#passmanager">What PassManager does</a>
+     <ul>
+     <li><a href="#releaseMemory">The <tt>releaseMemory</tt> method</a></li>
+     </ul></li>
+   <li><a href="#debughints">Using GDB with dynamically loaded passes</a>
+     <ul>
+     <li><a href="#breakpoint">Setting a breakpoint in your pass</a></li>
+     <li><a href="#debugmisc">Miscellaneous Problems</a></li>
+     </ul></li>
+   <li><a href="#future">Future extensions planned</a>
+     <ul>
+     <li><a href="#SMP">Multithreaded LLVM</a></li>
+     <li><a href="#PassFunctionPass"><tt>ModulePass</tt>es requiring 
+                                     <tt>FunctionPass</tt>es</a></li>
+     </ul></li>
+ </ol>
+ 
+ <div class="doc_author">
+   <p>Written by <a href="mailto:sabre at nondot.org">Chris Lattner</a></p>
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="introduction">Introduction - What is a pass?</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The LLVM Pass Framework is an important part of the LLVM system, because LLVM
+ passes are where most of the interesting parts of the compiler exist.  Passes
+ perform the transformations and optimizations that make up the compiler, they
+ build the analysis results that are used by these transformations, and they are,
+ above all, a structuring technique for compiler code.</p>
+ 
+ <p>All LLVM passes are subclasses of the <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1Pass.html">Pass</a></tt>
+ class, which implement functionality by overriding virtual methods inherited
+ from <tt>Pass</tt>.  Depending on how your pass works, you should inherit from
+ the <tt><a href="#ModulePass">ModulePass</a></tt>, <tt><a
+ href="#CallGraphSCCPass">CallGraphSCCPass</a></tt>, <tt><a
+ href="#FunctionPass">FunctionPass</a></tt>, or <tt><a
+ href="#BasicBlockPass">BasicBlockPass</a></tt> classes, which gives the system
+ more information about what your pass does, and how it can be combined with
+ other passes.  One of the main features of the LLVM Pass Framework is that it
+ schedules passes to run in an efficient way based on the constraints that your
+ pass meets (which are indicated by which class they derive from).</p>
+ 
+ <p>We start by showing you how to construct a pass, everything from setting up
+ the code, to compiling, loading, and executing it.  After the basics are down,
+ more advanced features are discussed.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="quickstart">Quick Start - Writing hello world</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Here we describe how to write the "hello world" of passes.  The "Hello" pass
+ is designed to simply print out the name of non-external functions that exist in
+ the program being compiled.  It does not modify the program at all, it just
+ inspects it.  The source code and files for this pass are available in the LLVM
+ source tree in the <tt>lib/Transforms/Hello</tt> directory.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="makefile">Setting up the build environment</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+   <p>First, you need to create a new directory somewhere in the LLVM source 
+   base.  For this example, we'll assume that you made 
+   <tt>lib/Transforms/Hello</tt>.  Next, you must set up a build script 
+   (Makefile) that will compile the source code for the new pass.  To do this, 
+   copy the following into <tt>Makefile</tt>:</p>
+   <hr/>
+ 
+ <pre>
+ # Makefile for hello pass
+ 
+ # Path to top level of LLVM heirarchy
+ LEVEL = ../../..
+ 
+ # Name of the library to build
+ LIBRARYNAME = Hello
+ 
+ # Build a dynamically linkable shared object
+ SHARED_LIBRARY = 1
+ 
+ # Make the shared library become a loadable module so the tools can 
+ # dlopen/dlsym on the resulting library.
+ LOADABLE_MODULE = 1
+ 
+ # Include the makefile implementation stuff
+ include $(LEVEL)/Makefile.common
+ </pre>
+ 
+ <p>This makefile specifies that all of the <tt>.cpp</tt> files in the current
+ directory are to be compiled and linked together into a
+ <tt>Debug/lib/Hello.so</tt> shared object that can be dynamically loaded by
+ the <tt>opt</tt> or <tt>analyze</tt> tools via their <tt>-load</tt> options.  
+ If your operating system uses a suffix other than .so (such as windows or 
+ Mac OS/X), the appropriate extension will be used.</p>
+ 
+ <p>Now that we have the build scripts set up, we just need to write the code for
+ the pass itself.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="basiccode">Basic code required</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Now that we have a way to compile our new pass, we just have to write it.
+ Start out with:</p>
+ 
+ <pre>
+ <b>#include</b> "<a href="http://llvm.org/doxygen/Pass_8h-source.html">llvm/Pass.h</a>"
+ <b>#include</b> "<a href="http://llvm.org/doxygen/Function_8h-source.html">llvm/Function.h</a>"
+ </pre>
+ 
+ <p>Which are needed because we are writing a <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1Pass.html">Pass</a></tt>, and
+ we are operating on <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1Function.html">Function</a></tt>'s.</p>
+ 
+ <p>Next we have:</p>
+ <pre>
+ <b>using namespace llvm;</b>
+ </pre>
+ <p>... which is required because the functions from the include files 
+ live in the llvm namespace.
+ </p>
+ 
+ <p>Next we have:</p>
+ 
+ <pre>
+ <b>namespace</b> {
+ </pre>
+ 
+ <p>... which starts out an anonymous namespace.  Anonymous namespaces are to C++
+ what the "<tt>static</tt>" keyword is to C (at global scope).  It makes the
+ things declared inside of the anonymous namespace only visible to the current
+ file.  If you're not familiar with them, consult a decent C++ book for more
+ information.</p>
+ 
+ <p>Next, we declare our pass itself:</p>
+ 
+ <pre>
+   <b>struct</b> Hello : <b>public</b> <a href="#FunctionPass">FunctionPass</a> {
+ </pre><p>
+ 
+ <p>This declares a "<tt>Hello</tt>" class that is a subclass of <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1FunctionPass.html">FunctionPass</a></tt>.
+ The different builtin pass subclasses are described in detail <a
+ href="#passtype">later</a>, but for now, know that <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a>'s operate a function at a
+ time.</p>
+ 
+ <pre>
+     <b>virtual bool</b> <a href="#runOnFunction">runOnFunction</a>(Function &F) {
+       std::cerr << "<i>Hello: </i>" << F.getName() << "\n";
+       <b>return false</b>;
+     }
+   };  <i>// end of struct Hello</i>
+ </pre>
+ 
+ <p>We declare a "<a href="#runOnFunction"><tt>runOnFunction</tt></a>" method,
+ which overloads an abstract virtual method inherited from <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a>.  This is where we are supposed
+ to do our thing, so we just print out our message with the name of each
+ function.</p>
+ 
+ <pre>
+   RegisterOpt<Hello> X("<i>hello</i>", "<i>Hello World Pass</i>");
+ }  <i>// end of anonymous namespace</i>
+ </pre>
+ 
+ <p>Lastly, we register our class <tt>Hello</tt>, giving it a command line
+ argument "<tt>hello</tt>", and a name "<tt>Hello World Pass</tt>".  There are
+ several different ways of <a href="#registration">registering your pass</a>,
+ depending on what it is to be used for.  For "optimizations" we use the
+ <tt>RegisterOpt</tt> template.</p>
+ 
+ <p>As a whole, the <tt>.cpp</tt> file looks like:</p>
+ 
+ <pre>
+ <b>#include</b> "<a href="http://llvm.org/doxygen/Pass_8h-source.html">llvm/Pass.h</a>"
+ <b>#include</b> "<a href="http://llvm.org/doxygen/Function_8h-source.html">llvm/Function.h</a>"
+ 
+ <b>using namespace llvm;</b>
+ 
+ <b>namespace</b> {
+   <b>struct Hello</b> : <b>public</b> <a href="#FunctionPass">FunctionPass</a> {
+     <b>virtual bool</b> <a href="#runOnFunction">runOnFunction</a>(Function &F) {
+       std::cerr << "<i>Hello: </i>" << F.getName() << "\n";
+       <b>return false</b>;
+     }
+   };
+   
+   RegisterOpt<Hello> X("<i>hello</i>", "<i>Hello World Pass</i>");
+ }
+ </pre>
+ 
+ <p>Now that it's all together, compile the file with a simple "<tt>gmake</tt>"
+ command in the local directory and you should get a new
+ "<tt>Debug/lib/Hello.so</tt> file.  Note that everything in this file is
+ contained in an anonymous namespace: this reflects the fact that passes are self
+ contained units that do not need external interfaces (although they can have
+ them) to be useful.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="running">Running a pass with <tt>opt</tt> or <tt>analyze</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Now that you have a brand new shiny shared object file, we can use the
+ <tt>opt</tt> command to run an LLVM program through your pass.  Because you
+ registered your pass with the <tt>RegisterOpt</tt> template, you will be able to
+ use the <tt>opt</tt> tool to access it, once loaded.</p>
+ 
+ <p>To test it, follow the example at the end of the <a
+ href="GettingStarted.html">Getting Started Guide</a> to compile "Hello World" to
+ LLVM.  We can now run the bytecode file (<tt>hello.bc</tt>) for the program
+ through our transformation like this (or course, any bytecode file will
+ work):</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so -hello < hello.bc > /dev/null
+ Hello: __main
+ Hello: puts
+ Hello: main
+ </pre>
+ 
+ <p>The '<tt>-load</tt>' option specifies that '<tt>opt</tt>' should load your
+ pass as a shared object, which makes '<tt>-hello</tt>' a valid command line
+ argument (which is one reason you need to <a href="#registration">register your
+ pass</a>).  Because the hello pass does not modify the program in any
+ interesting way, we just throw away the result of <tt>opt</tt> (sending it to
+ <tt>/dev/null</tt>).</p>
+ 
+ <p>To see what happened to the other string you registered, try running
+ <tt>opt</tt> with the <tt>--help</tt> option:</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so --help
+ OVERVIEW: llvm .bc -> .bc modular optimizer
+ 
+ USAGE: opt [options] <input bytecode>
+ 
+ OPTIONS:
+   Optimizations available:
+ ...
+     -funcresolve    - Resolve Functions
+     -gcse           - Global Common Subexpression Elimination
+     -globaldce      - Dead Global Elimination
+     <b>-hello          - Hello World Pass</b>
+     -indvars        - Canonicalize Induction Variables
+     -inline         - Function Integration/Inlining
+     -instcombine    - Combine redundant instructions
+ ...
+ </pre>
+ 
+ <p>The pass name get added as the information string for your pass, giving some
+ documentation to users of <tt>opt</tt>.  Now that you have a working pass, you
+ would go ahead and make it do the cool transformations you want.  Once you get
+ it all working and tested, it may become useful to find out how fast your pass
+ is.  The <a href="#passManager"><tt>PassManager</tt></a> provides a nice command
+ line option (<tt>--time-passes</tt>) that allows you to get information about
+ the execution time of your pass along with the other passes you queue up.  For
+ example:</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so -hello -time-passes < hello.bc > /dev/null
+ Hello: __main
+ Hello: puts
+ Hello: main
+ ===============================================================================
+                       ... Pass execution timing report ...
+ ===============================================================================
+   Total Execution Time: 0.02 seconds (0.0479059 wall clock)
+ 
+    ---User Time---   --System Time--   --User+System--   ---Wall Time---  --- Pass Name ---
+    0.0100 (100.0%)   0.0000 (  0.0%)   0.0100 ( 50.0%)   0.0402 ( 84.0%)  Bytecode Writer
+    0.0000 (  0.0%)   0.0100 (100.0%)   0.0100 ( 50.0%)   0.0031 (  6.4%)  Dominator Set Construction
+    0.0000 (  0.0%)   0.0000 (  0.0%)   0.0000 (  0.0%)   0.0013 (  2.7%)  Module Verifier
+  <b>  0.0000 (  0.0%)   0.0000 (  0.0%)   0.0000 (  0.0%)   0.0033 (  6.9%)  Hello World Pass</b>
+    0.0100 (100.0%)   0.0100 (100.0%)   0.0200 (100.0%)   0.0479 (100.0%)  TOTAL
+ </pre>
+ 
+ <p>As you can see, our implementation above is pretty fast :).  The additional
+ passes listed are automatically inserted by the '<tt>opt</tt>' tool to verify
+ that the LLVM emitted by your pass is still valid and well formed LLVM, which
+ hasn't been broken somehow.</p>
+ 
+ <p>Now that you have seen the basics of the mechanics behind passes, we can talk
+ about some more details of how they work and how to use them.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="passtype">Pass classes and requirements</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>One of the first things that you should do when designing a new pass is to
+ decide what class you should subclass for your pass.  The <a
+ href="#basiccode">Hello World</a> example uses the <tt><a
+ href="#FunctionPass">FunctionPass</a></tt> class for its implementation, but we
+ did not discuss why or when this should occur.  Here we talk about the classes
+ available, from the most general to the most specific.</p>
+ 
+ <p>When choosing a superclass for your Pass, you should choose the <b>most
+ specific</b> class possible, while still being able to meet the requirements
+ listed.  This gives the LLVM Pass Infrastructure information necessary to
+ optimize how passes are run, so that the resultant compiler isn't unneccesarily
+ slow.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ImmutablePass">The <tt>ImmutablePass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The most plain and boring type of pass is the "<tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1ImmutablePass.html">ImmutablePass</a></tt>"
+ class.  This pass type is used for passes that do not have to be run, do not
+ change state, and never need to be updated.  This is not a normal type of
+ transformation or analysis, but can provide information about the current
+ compiler configuration.</p>
+ 
+ <p>Although this pass class is very infrequently used, it is important for
+ providing information about the current target machine being compiled for, and
+ other static information that can affect the various transformations.</p>
+ 
+ <p><tt>ImmutablePass</tt>es never invalidate other transformations, are never
+ invalidated, and are never "run".</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="ModulePass">The <tt>ModulePass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The "<tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1ModulePass.html">ModulePass</a></tt>"
+ class is the most general of all superclasses that you can use.  Deriving from
+ <tt>ModulePass</tt> indicates that your pass uses the entire program as a unit,
+ refering to function bodies in no predictable order, or adding and removing
+ functions.  Because nothing is known about the behavior of <tt>ModulePass</tt>
+ subclasses, no optimization can be done for their execution.</p>
+ 
+ <p>To write a correct <tt>ModulePass</tt> subclass, derive from
+ <tt>ModulePass</tt> and overload the <tt>runOnModule</tt> method with the
+ following signature:</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="runOnModule">The <tt>runOnModule</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> runOnModule(Module &M) = 0;
+ </pre>
+ 
+ <p>The <tt>runOnModule</tt> method performs the interesting work of the pass.
+ It should return true if the module was modified by the transformation and
+ false otherwise.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="CallGraphSCCPass">The <tt>CallGraphSCCPass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The "<tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1CallGraphSCCPass.html">CallGraphSCCPass</a></tt>"
+ is used by passes that need to traverse the program bottom-up on the call graph
+ (callees before callers).  Deriving from CallGraphSCCPass provides some
+ mechanics for building and traversing the CallGraph, but also allows the system
+ to optimize execution of CallGraphSCCPass's.  If your pass meets the
+ requirements outlined below, and doesn't meet the requirements of a <tt><a
+ href="#FunctionPass">FunctionPass</a></tt> or <tt><a
+ href="#BasicBlockPass">BasicBlockPass</a></tt>, you should derive from
+ <tt>CallGraphSCCPass</tt>.</p>
+ 
+ <p><b>TODO</b>: explain briefly what SCC, Tarjan's algo, and B-U mean.</p>
+ 
+ <p>To be explicit, <tt>CallGraphSCCPass</tt> subclasses are:</p>
+ 
+ <ol>
+ 
+ <li>... <em>not allowed</em> to modify any <tt>Function</tt>s that are not in
+ the current SCC.</li>
+ 
+ <li>... <em>allowed</em> to inspect any Function's other than those in the
+ current SCC and the direct callees of the SCC.</li>
+ 
+ <li>... <em>required</em> to preserve the current CallGraph object, updating it
+ to reflect any changes made to the program.</li>
+ 
+ <li>... <em>not allowed</em> to add or remove SCC's from the current Module,
+ though they may change the contents of an SCC.</li>
+ 
+ <li>... <em>allowed</em> to add or remove global variables from the current
+ Module.</li>
+ 
+ <li>... <em>allowed</em> to maintain state across invocations of
+     <a href="#runOnSCC"><tt>runOnSCC</tt></a> (including global data).</li>
+ </ol>
+ 
+ <p>Implementing a <tt>CallGraphSCCPass</tt> is slightly tricky in some cases
+ because it has to handle SCCs with more than one node in it.  All of the virtual
+ methods described below should return true if they modified the program, or
+ false if they didn't.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doInitialization_scc">The <tt>doInitialization(Module &)</tt>
+   method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doInitialization(Module &M);
+ </pre>
+ 
+ <p>The <tt>doIninitialize</tt> method is allowed to do most of the things that
+ <tt>CallGraphSCCPass</tt>'s are not allowed to do.  They can add and remove
+ functions, get pointers to functions, etc.  The <tt>doInitialization</tt> method
+ is designed to do simple initialization type of stuff that does not depend on
+ the SCCs being processed.  The <tt>doInitialization</tt> method call is not
+ scheduled to overlap with any other pass executions (thus it should be very
+ fast).</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="runOnSCC">The <tt>runOnSCC</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> runOnSCC(const std::vector<CallGraphNode *> &SCCM) = 0;
+ </pre>
+ 
+ <p>The <tt>runOnSCC</tt> method performs the interesting work of the pass, and
+ should return true if the module was modified by the transformation, false
+ otherwise.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doFinalization_scc">The <tt>doFinalization(Module
+    &)</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doFinalization(Module &M);
+ </pre>
+ 
+ <p>The <tt>doFinalization</tt> method is an infrequently used method that is
+ called when the pass framework has finished calling <a
+ href="#runOnFunction"><tt>runOnFunction</tt></a> for every function in the
+ program being compiled.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="FunctionPass">The <tt>FunctionPass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>In contrast to <tt>ModulePass</tt> subclasses, <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1Pass.html">FunctionPass</a></tt>
+ subclasses do have a predictable, local behavior that can be expected by the
+ system.  All <tt>FunctionPass</tt> execute on each function in the program
+ independent of all of the other functions in the program.
+ <tt>FunctionPass</tt>'s do not require that they are executed in a particular
+ order, and <tt>FunctionPass</tt>'s do not modify external functions.</p>
+ 
+ <p>To be explicit, <tt>FunctionPass</tt> subclasses are not allowed to:</p>
+ 
+ <ol>
+ <li>Modify a Function other than the one currently being processed.</li>
+ <li>Add or remove Function's from the current Module.</li>
+ <li>Add or remove global variables from the current Module.</li>
+ <li>Maintain state across invocations of
+     <a href="#runOnFunction"><tt>runOnFunction</tt></a> (including global data)</li>
+ </ol>
+ 
+ <p>Implementing a <tt>FunctionPass</tt> is usually straightforward (See the <a
+ href="#basiccode">Hello World</a> pass for example).  <tt>FunctionPass</tt>'s
+ may overload three virtual methods to do their work.  All of these methods
+ should return true if they modified the program, or false if they didn't.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doInitialization_mod">The <tt>doInitialization(Module &)</tt>
+   method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doInitialization(Module &M);
+ </pre>
+ 
+ <p>The <tt>doIninitialize</tt> method is allowed to do most of the things that
+ <tt>FunctionPass</tt>'s are not allowed to do.  They can add and remove
+ functions, get pointers to functions, etc.  The <tt>doInitialization</tt> method
+ is designed to do simple initialization type of stuff that does not depend on
+ the functions being processed.  The <tt>doInitialization</tt> method call is not
+ scheduled to overlap with any other pass executions (thus it should be very
+ fast).</p>
+ 
+ <p>A good example of how this method should be used is the <a
+ href="http://llvm.org/doxygen/LowerAllocations_8cpp-source.html">LowerAllocations</a>
+ pass.  This pass converts <tt>malloc</tt> and <tt>free</tt> instructions into
+ platform dependent <tt>malloc()</tt> and <tt>free()</tt> function calls.  It
+ uses the <tt>doInitialization</tt> method to get a reference to the malloc and
+ free functions that it needs, adding prototypes to the module if necessary.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="runOnFunction">The <tt>runOnFunction</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> runOnFunction(Function &F) = 0;
+ </pre><p>
+ 
+ <p>The <tt>runOnFunction</tt> method must be implemented by your subclass to do
+ the transformation or analysis work of your pass.  As usual, a true value should
+ be returned if the function is modified.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doFinalization_mod">The <tt>doFinalization(Module
+   &)</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doFinalization(Module &M);
+ </pre>
+ 
+ <p>The <tt>doFinalization</tt> method is an infrequently used method that is
+ called when the pass framework has finished calling <a
+ href="#runOnFunction"><tt>runOnFunction</tt></a> for every function in the
+ program being compiled.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="BasicBlockPass">The <tt>BasicBlockPass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p><tt>BasicBlockPass</tt>'s are just like <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a>'s, except that they must limit
+ their scope of inspection and modification to a single basic block at a time.
+ As such, they are <b>not</b> allowed to do any of the following:</p>
+ 
+ <ol>
+ <li>Modify or inspect any basic blocks outside of the current one</li>
+ <li>Maintain state across invocations of
+     <a href="#runOnBasicBlock"><tt>runOnBasicBlock</tt></a></li>
+ <li>Modify the control flow graph (by altering terminator instructions)</li>
+ <li>Any of the things forbidden for
+     <a href="#FunctionPass"><tt>FunctionPass</tt></a>es.</li>
+ </ol>
+ 
+ <p><tt>BasicBlockPass</tt>es are useful for traditional local and "peephole"
+ optimizations.  They may override the same <a
+ href="#doInitialization_mod"><tt>doInitialization(Module &)</tt></a> and <a
+ href="#doFinalization_mod"><tt>doFinalization(Module &)</tt></a> methods that <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a>'s have, but also have the following virtual methods that may also be implemented:</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doInitialization_fn">The <tt>doInitialization(Function
+   &)</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doInitialization(Function &F);
+ </pre>
+ 
+ <p>The <tt>doIninitialize</tt> method is allowed to do most of the things that
+ <tt>BasicBlockPass</tt>'s are not allowed to do, but that
+ <tt>FunctionPass</tt>'s can.  The <tt>doInitialization</tt> method is designed
+ to do simple initialization that does not depend on the
+ BasicBlocks being processed.  The <tt>doInitialization</tt> method call is not
+ scheduled to overlap with any other pass executions (thus it should be very
+ fast).</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="runOnBasicBlock">The <tt>runOnBasicBlock</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> runOnBasicBlock(BasicBlock &BB) = 0;
+ </pre>
+ 
+ <p>Override this function to do the work of the <tt>BasicBlockPass</tt>.  This
+ function is not allowed to inspect or modify basic blocks other than the
+ parameter, and are not allowed to modify the CFG.  A true value must be returned
+ if the basic block is modified.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="doFinalization_fn">The <tt>doFinalization(Function &)</tt> 
+   method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> doFinalization(Function &F);
+ </pre>
+ 
+ <p>The <tt>doFinalization</tt> method is an infrequently used method that is
+ called when the pass framework has finished calling <a
+ href="#runOnBasicBlock"><tt>runOnBasicBlock</tt></a> for every BasicBlock in the
+ program being compiled.  This can be used to perform per-function
+ finalization.</p>
+ 
+ </div>
+ 
+ <!-- ======================================================================= -->
+ <div class="doc_subsection">
+   <a name="MachineFunctionPass">The <tt>MachineFunctionPass</tt> class</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>A <tt>MachineFunctionPass</tt> is a part of the LLVM code generator that
+ executes on the machine-dependent representation of each LLVM function in the
+ program.  A <tt>MachineFunctionPass</tt> is also a <tt>FunctionPass</tt>, so all
+ the restrictions that apply to a <tt>FunctionPass</tt> also apply to it.
+ <tt>MachineFunctionPass</tt>es also have additional restrictions. In particular,
+ <tt>MachineFunctionPass</tt>es are not allowed to do any of the following:</p>
+ 
+ <ol>
+ <li>Modify any LLVM Instructions, BasicBlocks or Functions.</li>
+ <li>Modify a MachineFunction other than the one currently being processed.</li>
+ <li>Add or remove MachineFunctions from the current Module.</li>
+ <li>Add or remove global variables from the current Module.</li>
+ <li>Maintain state across invocations of <a
+ href="#runOnMachineFunction"><tt>runOnMachineFunction</tt></a> (including global
+ data)</li>
+ </ol>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="runOnMachineFunction">The <tt>runOnMachineFunction(MachineFunction
+   &MF)</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual bool</b> runOnMachineFunction(MachineFunction &MF) = 0;
+ </pre>
+ 
+ <p><tt>runOnMachineFunction</tt> can be considered the main entry point of a
+ <tt>MachineFunctionPass</tt>; that is, you should override this method to do the
+ work of your <tt>MachineFunctionPass</tt>.</p>
+ 
+ <p>The <tt>runOnMachineFunction</tt> method is called on every
+ <tt>MachineFunction</tt> in a <tt>Module</tt>, so that the
+ <tt>MachineFunctionPass</tt> may perform optimizations on the machine-dependent
+ representation of the function. If you want to get at the LLVM <tt>Function</tt>
+ for the <tt>MachineFunction</tt> you're working on, use
+ <tt>MachineFunction</tt>'s <tt>getFunction()</tt> accessor method -- but
+ remember, you may not modify the LLVM <tt>Function</tt> or its contents from a
+ <tt>MachineFunctionPass</tt>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="registration">Pass registration</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>In the <a href="#basiccode">Hello World</a> example pass we illustrated how
+ pass registration works, and discussed some of the reasons that it is used and
+ what it does.  Here we discuss how and why passes are registered.</p>
+ 
+ <p>Passes can be registered in several different ways.  Depending on the general
+ classification of the pass, you should use one of the following templates to
+ register the pass:</p>
+ 
+ <ul>
+ <li><b><tt>RegisterOpt</tt></b> - This template should be used when you are
+ registering a pass that logically should be available for use in the
+ '<tt>opt</tt>' utility.</li>
+ 
+ <li><b><tt>RegisterAnalysis</tt></b> - This template should be used when you are
+ registering a pass that logically should be available for use in the
+ '<tt>analyze</tt>' utility.</li>
+ 
+ <li><b><tt>RegisterPass</tt></b> - This is the generic form of the
+ <tt>Register*</tt> templates that should be used if you want your pass listed by
+ multiple or no utilities.  This template takes an extra third argument that
+ specifies which tools it should be listed in.  See the <a
+ href="http://llvm.org/doxygen/PassSupport_8h-source.html">PassSupport.h</a>
+ file for more information.</li>
+ 
+ </ul>
+ 
+ <p>Regardless of how you register your pass, you must specify at least two
+ parameters.  The first parameter is the name of the pass that is to be used on
+ the command line to specify that the pass should be added to a program (for
+ example <tt>opt</tt> or <tt>analyze</tt>).  The second argument is the name of
+ the pass, which is to be used for the <tt>--help</tt> output of programs, as
+ well as for debug output generated by the <tt>--debug-pass</tt> option.</p>
+ 
+ <p>If a pass is registered to be used by the <tt>analyze</tt> utility, you
+ should implement the virtual <tt>print</tt> method:</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="print">The <tt>print</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual void</b> print(std::ostream &O, <b>const</b> Module *M) <b>const</b>;
+ </pre>
+ 
+ <p>The <tt>print</tt> method must be implemented by "analyses" in order to print
+ a human readable version of the analysis results.  This is useful for debugging
+ an analysis itself, as well as for other people to figure out how an analysis
+ works.  The <tt>analyze</tt> tool uses this method to generate its output.</p>
+ 
+ <p>The <tt>ostream</tt> parameter specifies the stream to write the results on,
+ and the <tt>Module</tt> parameter gives a pointer to the top level module of the
+ program that has been analyzed.  Note however that this pointer may be null in
+ certain circumstances (such as calling the <tt>Pass::dump()</tt> from a
+ debugger), so it should only be used to enhance debug output, it should not be
+ depended on.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="interaction">Specifying interactions between passes</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>One of the main responsibilities of the <tt>PassManager</tt> is the make sure
+ that passes interact with each other correctly.  Because <tt>PassManager</tt>
+ tries to <a href="#passmanager">optimize the execution of passes</a> it must
+ know how the passes interact with each other and what dependencies exist between
+ the various passes.  To track this, each pass can declare the set of passes that
+ are required to be executed before the current pass, and the passes which are
+ invalidated by the current pass.</p>
+ 
+ <p>Typically this functionality is used to require that analysis results are
+ computed before your pass is run.  Running arbitrary transformation passes can
+ invalidate the computed analysis results, which is what the invalidation set
+ specifies.  If a pass does not implement the <tt><a
+ href="#getAnalysisUsage">getAnalysisUsage</a></tt> method, it defaults to not
+ having any prerequisite passes, and invalidating <b>all</b> other passes.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="getAnalysisUsage">The <tt>getAnalysisUsage</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual void</b> getAnalysisUsage(AnalysisUsage &Info) <b>const</b>;
+ </pre>
+ 
+ <p>By implementing the <tt>getAnalysisUsage</tt> method, the required and
+ invalidated sets may be specified for your transformation.  The implementation
+ should fill in the <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1AnalysisUsage.html">AnalysisUsage</a></tt>
+ object with information about which passes are required and not invalidated.  To
+ do this, a pass may call any of the following methods on the AnalysisUsage
+ object:</p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="AU::addRequired">The <tt>AnalysisUsage::addRequired<></tt> and <tt>AnalysisUsage::addRequiredTransitive<></tt> methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ If your pass requires a previous pass to be executed (an analysis for example),
+ it can use one of these methods to arrange for it to be run before your pass.
+ LLVM has many different types of analyses and passes that can be required,
+ spanning the range from <tt>DominatorSet</tt> to <tt>BreakCriticalEdges</tt>.
+ Requiring <tt>BreakCriticalEdges</tt>, for example, guarantees that there will
+ be no critical edges in the CFG when your pass has been run.
+ </p>
+ 
+ <p>
+ Some analyses chain to other analyses to do their job.  For example, an <a
+ href="AliasAnalysis.html">AliasAnalysis</a> implementation is required to <a
+ href="AliasAnalysis.html#chaining">chain</a> to other alias analysis passes.  In
+ cases where analyses chain, the <tt>addRequiredTransitive</tt> method should be
+ used instead of the <tt>addRequired</tt> method.  This informs the PassManager
+ that the transitively required pass should be alive as long as the requiring
+ pass is.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="AU::addPreserved">The <tt>AnalysisUsage::addPreserved<></tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ <p>
+ One of the jobs of the PassManager is to optimize how and when analyses are run.
+ In particular, it attempts to avoid recomputing data unless it needs to.  For
+ this reason, passes are allowed to declare that they preserve (i.e., they don't
+ invalidate) an existing analysis if it's available.  For example, a simple
+ constant folding pass would not modify the CFG, so it can't possibly affect the
+ results of dominator analysis.  By default, all passes are assumed to invalidate
+ all others.
+ </p>
+ 
+ <p>
+ The <tt>AnalysisUsage</tt> class provides several methods which are useful in
+ certain circumstances that are related to <tt>addPreserved</tt>.  In particular,
+ the <tt>setPreservesAll</tt> method can be called to indicate that the pass does
+ not modify the LLVM program at all (which is true for analyses), and the
+ <tt>setPreservesCFG</tt> method can be used by transformations that change
+ instructions in the program but do not modify the CFG or terminator instructions
+ (note that this property is implicitly set for <a
+ href="#BasicBlockPass">BasicBlockPass</a>'s).
+ </p>
+ 
+ <p>
+ <tt>addPreserved</tt> is particularly useful for transformations like
+ <tt>BreakCriticalEdges</tt>.  This pass knows how to update a small set of loop
+ and dominator related analyses if they exist, so it can preserve them, despite
+ the fact that it hacks on the CFG.
+ </p>
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="AU::examples">Example implementations of <tt>getAnalysisUsage</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <i>// This is an example implementation from an analysis, which does not modify
+   // the program at all, yet has a prerequisite.</i>
+   <b>void</b> <a href="http://llvm.org/doxygen/classllvm_1_1PostDominanceFrontier.html">PostDominanceFrontier</a>::getAnalysisUsage(AnalysisUsage &AU) <b>const</b> {
+     AU.setPreservesAll();
+     AU.addRequired<<a href="http://llvm.org/doxygen/classllvm_1_1PostDominatorTree.html">PostDominatorTree</a>>();
+   }
+ </pre>
+ 
+ <p>and:</p>
+ 
+ <pre>
+   <i>// This example modifies the program, but does not modify the CFG</i>
+   <b>void</b> <a href="http://llvm.org/doxygen/structLICM.html">LICM</a>::getAnalysisUsage(AnalysisUsage &AU) <b>const</b> {
+     AU.setPreservesCFG();
+     AU.addRequired<<a href="http://llvm.org/doxygen/classllvm_1_1LoopInfo.html">LoopInfo</a>>();
+   }
+ </pre>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="getAnalysis">The <tt>getAnalysis<></tt> and <tt>getAnalysisToUpdate<></tt> methods</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>Pass::getAnalysis<></tt> method is automatically inherited by
+ your class, providing you with access to the passes that you declared that you
+ required with the <a href="#getAnalysisUsage"><tt>getAnalysisUsage</tt></a>
+ method.  It takes a single template argument that specifies which pass class you
+ want, and returns a reference to that pass.  For example:</p>
+ 
+ <pre>
+    bool LICM::runOnFunction(Function &F) {
+      LoopInfo &LI = getAnalysis<LoopInfo>();
+      ...
+    }
+ </pre>
+ 
+ <p>This method call returns a reference to the pass desired.  You may get a
+ runtime assertion failure if you attempt to get an analysis that you did not
+ declare as required in your <a
+ href="#getAnalysisUsage"><tt>getAnalysisUsage</tt></a> implementation.  This
+ method can be called by your <tt>run*</tt> method implementation, or by any
+ other local method invoked by your <tt>run*</tt> method.</p>
+ 
+ <p>
+ If your pass is capable of updating analyses if they exist (e.g.,
+ <tt>BreakCriticalEdges</tt>, as described above), you can use the
+ <tt>getAnalysisToUpdate</tt> method, which returns a pointer to the analysis if
+ it is active.  For example:</p>
+ 
+ <pre>
+   ...
+   if (DominatorSet *DS = getAnalysisToUpdate<DominatorSet>()) {
+     <i>// A DominatorSet is active.  This code will update it.</i>
+   }
+   ...
+ </pre>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="analysisgroup">Implementing Analysis Groups</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Now that we understand the basics of how passes are defined, how the are
+ used, and how they are required from other passes, it's time to get a little bit
+ fancier.  All of the pass relationships that we have seen so far are very
+ simple: one pass depends on one other specific pass to be run before it can run.
+ For many applications, this is great, for others, more flexibility is
+ required.</p>
+ 
+ <p>In particular, some analyses are defined such that there is a single simple
+ interface to the analysis results, but multiple ways of calculating them.
+ Consider alias analysis for example.  The most trivial alias analysis returns
+ "may alias" for any alias query.  The most sophisticated analysis a
+ flow-sensitive, context-sensitive interprocedural analysis that can take a
+ significant amount of time to execute (and obviously, there is a lot of room
+ between these two extremes for other implementations).  To cleanly support
+ situations like this, the LLVM Pass Infrastructure supports the notion of
+ Analysis Groups.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="agconcepts">Analysis Group Concepts</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>An Analysis Group is a single simple interface that may be implemented by
+ multiple different passes.  Analysis Groups can be given human readable names
+ just like passes, but unlike passes, they need not derive from the <tt>Pass</tt>
+ class.  An analysis group may have one or more implementations, one of which is
+ the "default" implementation.</p>
+ 
+ <p>Analysis groups are used by client passes just like other passes are: the
+ <tt>AnalysisUsage::addRequired()</tt> and <tt>Pass::getAnalysis()</tt> methods.
+ In order to resolve this requirement, the <a href="#passmanager">PassManager</a>
+ scans the available passes to see if any implementations of the analysis group
+ are available.  If none is available, the default implementation is created for
+ the pass to use.  All standard rules for <A href="#interaction">interaction
+ between passes</a> still apply.</p>
+ 
+ <p>Although <a href="#registration">Pass Registration</a> is optional for normal
+ passes, all analysis group implementations must be registered, and must use the
+ <A href="#registerag"><tt>RegisterAnalysisGroup</tt></a> template to join the
+ implementation pool.  Also, a default implementation of the interface
+ <b>must</b> be registered with <A
+ href="#registerag"><tt>RegisterAnalysisGroup</tt></a>.</p>
+ 
+ <p>As a concrete example of an Analysis Group in action, consider the <a
+ href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a>
+ analysis group.  The default implementation of the alias analysis interface (the
+ <tt><a
+ href="http://llvm.org/doxygen/structBasicAliasAnalysis.html">basicaa</a></tt>
+ pass) just does a few simple checks that don't require significant analysis to
+ compute (such as: two different globals can never alias each other, etc).
+ Passes that use the <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a></tt>
+ interface (for example the <tt><a
+ href="http://llvm.org/doxygen/structGCSE.html">gcse</a></tt> pass), do
+ not care which implementation of alias analysis is actually provided, they just
+ use the designated interface.</p>
+ 
+ <p>From the user's perspective, commands work just like normal.  Issuing the
+ command '<tt>opt -gcse ...</tt>' will cause the <tt>basicaa</tt> class to be
+ instantiated and added to the pass sequence.  Issuing the command '<tt>opt
+ -somefancyaa -gcse ...</tt>' will cause the <tt>gcse</tt> pass to use the
+ <tt>somefancyaa</tt> alias analysis (which doesn't actually exist, it's just a
+ hypothetical example) instead.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="registerag">Using <tt>RegisterAnalysisGroup</tt></a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>The <tt>RegisterAnalysisGroup</tt> template is used to register the analysis
+ group itself as well as add pass implementations to the analysis group.  First,
+ an analysis should be registered, with a human readable name provided for it.
+ Unlike registration of passes, there is no command line argument to be specified
+ for the Analysis Group Interface itself, because it is "abstract":</p>
+ 
+ <pre>
+   <b>static</b> RegisterAnalysisGroup<<a href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a>> A("<i>Alias Analysis</i>");
+ </pre>
+ 
+ <p>Once the analysis is registered, passes can declare that they are valid
+ implementations of the interface by using the following code:</p>
+ 
+ <pre>
+ <b>namespace</b> {
+   //<i> Analysis Group implementations <b>must</b> be registered normally...</i>
+   RegisterOpt<FancyAA>
+   B("<i>somefancyaa</i>", "<i>A more complex alias analysis implementation</i>");
+ 
+   //<i> Declare that we implement the AliasAnalysis interface</i>
+   RegisterAnalysisGroup<<a href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a>, FancyAA> C;
+ }
+ </pre>
+ 
+ <p>This just shows a class <tt>FancyAA</tt> that is registered normally, then
+ uses the <tt>RegisterAnalysisGroup</tt> template to "join" the <tt><a
+ href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a></tt>
+ analysis group.  Every implementation of an analysis group should join using
+ this template.  A single pass may join multiple different analysis groups with
+ no problem.</p>
+ 
+ <pre>
+ <b>namespace</b> {
+   //<i> Analysis Group implementations <b>must</b> be registered normally...</i>
+   RegisterOpt<<a href="http://llvm.org/doxygen/structBasicAliasAnalysis.html">BasicAliasAnalysis</a>>
+   D("<i>basicaa</i>", "<i>Basic Alias Analysis (default AA impl)</i>");
+ 
+   //<i> Declare that we implement the AliasAnalysis interface</i>
+   RegisterAnalysisGroup<<a href="http://llvm.org/doxygen/classllvm_1_1AliasAnalysis.html">AliasAnalysis</a>, <a href="http://llvm.org/doxygen/structBasicAliasAnalysis.html">BasicAliasAnalysis</a>, <b>true</b>> E;
+ }
+ </pre>
+ 
+ <p>Here we show how the default implementation is specified (using the extra
+ argument to the <tt>RegisterAnalysisGroup</tt> template).  There must be exactly
+ one default implementation available at all times for an Analysis Group to be
+ used.  Here we declare that the <tt><a
+ href="http://llvm.org/doxygen/structBasicAliasAnalysis.html">BasicAliasAnalysis</a></tt>
+ pass is the default implementation for the interface.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="passStatistics">Pass Statistics</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ <p>The <a
+ href="http://llvm.org/doxygen/Statistic_8h-source.html"><tt>Statistic</tt></a>
+ class is designed to be an easy way to expose various success
+ metrics from passes.  These statistics are printed at the end of a
+ run, when the -stats command line option is enabled on the command
+ line. See the <a href="http://llvm.org/docs/ProgrammersManual.html#Statistic">Statistics section</a> in the Programmer's Manual for details. 
+ 
+ </div>
+ 
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="passmanager">What PassManager does</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>The <a
+ href="http://llvm.org/doxygen/PassManager_8h-source.html"><tt>PassManager</tt></a>
+ <a
+ href="http://llvm.org/doxygen/classllvm_1_1PassManager.html">class</a>
+ takes a list of passes, ensures their <a href="#interaction">prerequisites</a>
+ are set up correctly, and then schedules passes to run efficiently.  All of the
+ LLVM tools that run passes use the <tt>PassManager</tt> for execution of these
+ passes.</p>
+ 
+ <p>The <tt>PassManager</tt> does two main things to try to reduce the execution
+ time of a series of passes:</p>
+ 
+ <ol>
+ <li><b>Share analysis results</b> - The PassManager attempts to avoid
+ recomputing analysis results as much as possible.  This means keeping track of
+ which analyses are available already, which analyses get invalidated, and which
+ analyses are needed to be run for a pass.  An important part of work is that the
+ <tt>PassManager</tt> tracks the exact lifetime of all analysis results, allowing
+ it to <a href="#releaseMemory">free memory</a> allocated to holding analysis
+ results as soon as they are no longer needed.</li>
+ 
+ <li><b>Pipeline the execution of passes on the program</b> - The
+ <tt>PassManager</tt> attempts to get better cache and memory usage behavior out
+ of a series of passes by pipelining the passes together.  This means that, given
+ a series of consequtive <a href="#FunctionPass"><tt>FunctionPass</tt></a>'s, it
+ will execute all of the <a href="#FunctionPass"><tt>FunctionPass</tt></a>'s on
+ the first function, then all of the <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a>es on the second function,
+ etc... until the entire program has been run through the passes.
+ 
+ <p>This improves the cache behavior of the compiler, because it is only touching
+ the LLVM program representation for a single function at a time, instead of
+ traversing the entire program.  It reduces the memory consumption of compiler,
+ because, for example, only one <a
+ href="http://llvm.org/doxygen/classllvm_1_1DominatorSet.html"><tt>DominatorSet</tt></a>
+ needs to be calculated at a time.  This also makes it possible some <a
+ href="#SMP">interesting enhancements</a> in the future.</p></li>
+ 
+ </ol>
+ 
+ <p>The effectiveness of the <tt>PassManager</tt> is influenced directly by how
+ much information it has about the behaviors of the passes it is scheduling.  For
+ example, the "preserved" set is intentionally conservative in the face of an
+ unimplemented <a href="#getAnalysisUsage"><tt>getAnalysisUsage</tt></a> method.
+ Not implementing when it should be implemented will have the effect of not
+ allowing any analysis results to live across the execution of your pass.</p>
+ 
+ <p>The <tt>PassManager</tt> class exposes a <tt>--debug-pass</tt> command line
+ options that is useful for debugging pass execution, seeing how things work, and
+ diagnosing when you should be preserving more analyses than you currently are
+ (To get information about all of the variants of the <tt>--debug-pass</tt>
+ option, just type '<tt>opt --help-hidden</tt>').</p>
+ 
+ <p>By using the <tt>--debug-pass=Structure</tt> option, for example, we can see
+ how our <a href="#basiccode">Hello World</a> pass interacts with other passes.
+ Lets try it out with the <tt>gcse</tt> and <tt>licm</tt> passes:</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so -gcse -licm --debug-pass=Structure < hello.bc > /dev/null
+ Module Pass Manager
+   Function Pass Manager
+     Dominator Set Construction
+     Immediate Dominators Construction
+     Global Common Subexpression Elimination
+ --  Immediate Dominators Construction
+ --  Global Common Subexpression Elimination
+     Natural Loop Construction
+     Loop Invariant Code Motion
+ --  Natural Loop Construction
+ --  Loop Invariant Code Motion
+     Module Verifier
+ --  Dominator Set Construction
+ --  Module Verifier
+   Bytecode Writer
+ --Bytecode Writer
+ </pre>
+ 
+ <p>This output shows us when passes are constructed and when the analysis
+ results are known to be dead (prefixed with '<tt>--</tt>').  Here we see that
+ GCSE uses dominator and immediate dominator information to do its job.  The LICM
+ pass uses natural loop information, which uses dominator sets, but not immediate
+ dominators.  Because immediate dominators are no longer useful after the GCSE
+ pass, it is immediately destroyed.  The dominator sets are then reused to
+ compute natural loop information, which is then used by the LICM pass.</p>
+ 
+ <p>After the LICM pass, the module verifier runs (which is automatically added
+ by the '<tt>opt</tt>' tool), which uses the dominator set to check that the
+ resultant LLVM code is well formed.  After it finishes, the dominator set
+ information is destroyed, after being computed once, and shared by three
+ passes.</p>
+ 
+ <p>Lets see how this changes when we run the <a href="#basiccode">Hello
+ World</a> pass in between the two passes:</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so -gcse -hello -licm --debug-pass=Structure < hello.bc > /dev/null
+ Module Pass Manager
+   Function Pass Manager
+     Dominator Set Construction
+     Immediate Dominators Construction
+     Global Common Subexpression Elimination
+ <b>--  Dominator Set Construction</b>
+ --  Immediate Dominators Construction
+ --  Global Common Subexpression Elimination
+ <b>    Hello World Pass
+ --  Hello World Pass
+     Dominator Set Construction</b>
+     Natural Loop Construction
+     Loop Invariant Code Motion
+ --  Natural Loop Construction
+ --  Loop Invariant Code Motion
+     Module Verifier
+ --  Dominator Set Construction
+ --  Module Verifier
+   Bytecode Writer
+ --Bytecode Writer
+ Hello: __main
+ Hello: puts
+ Hello: main
+ </pre>
+ 
+ <p>Here we see that the <a href="#basiccode">Hello World</a> pass has killed the
+ Dominator Set pass, even though it doesn't modify the code at all!  To fix this,
+ we need to add the following <a
+ href="#getAnalysisUsage"><tt>getAnalysisUsage</tt></a> method to our pass:</p>
+ 
+ <pre>
+     <i>// We don't modify the program, so we preserve all analyses</i>
+     <b>virtual void</b> getAnalysisUsage(AnalysisUsage &AU) <b>const</b> {
+       AU.setPreservesAll();
+     }
+ </pre>
+ 
+ <p>Now when we run our pass, we get this output:</p>
+ 
+ <pre>
+ $ opt -load ../../../Debug/lib/Hello.so -gcse -hello -licm --debug-pass=Structure < hello.bc > /dev/null
+ Pass Arguments:  -gcse -hello -licm
+ Module Pass Manager
+   Function Pass Manager
+     Dominator Set Construction
+     Immediate Dominators Construction
+     Global Common Subexpression Elimination
+ --  Immediate Dominators Construction
+ --  Global Common Subexpression Elimination
+     Hello World Pass
+ --  Hello World Pass
+     Natural Loop Construction
+     Loop Invariant Code Motion
+ --  Loop Invariant Code Motion
+ --  Natural Loop Construction
+     Module Verifier
+ --  Dominator Set Construction
+ --  Module Verifier
+   Bytecode Writer
+ --Bytecode Writer
+ Hello: __main
+ Hello: puts
+ Hello: main
+ </pre>
+ 
+ <p>Which shows that we don't accidentally invalidate dominator information
+ anymore, and therefore do not have to compute it twice.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="releaseMemory">The <tt>releaseMemory</tt> method</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <pre>
+   <b>virtual void</b> releaseMemory();
+ </pre>
+ 
+ <p>The <tt>PassManager</tt> automatically determines when to compute analysis
+ results, and how long to keep them around for.  Because the lifetime of the pass
+ object itself is effectively the entire duration of the compilation process, we
+ need some way to free analysis results when they are no longer useful.  The
+ <tt>releaseMemory</tt> virtual method is the way to do this.</p>
+ 
+ <p>If you are writing an analysis or any other pass that retains a significant
+ amount of state (for use by another pass which "requires" your pass and uses the
+ <a href="#getAnalysis">getAnalysis</a> method) you should implement
+ <tt>releaseMEmory</tt> to, well, release the memory allocated to maintain this
+ internal state.  This method is called after the <tt>run*</tt> method for the
+ class, before the next call of <tt>run*</tt> in your pass.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="debughints">Using GDB with dynamically loaded passes</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Unfortunately, using GDB with dynamically loaded passes is not as easy as it
+ should be.  First of all, you can't set a breakpoint in a shared object that has
+ not been loaded yet, and second of all there are problems with inlined functions
+ in shared objects.  Here are some suggestions to debugging your pass with
+ GDB.</p>
+ 
+ <p>For sake of discussion, I'm going to assume that you are debugging a
+ transformation invoked by <tt>opt</tt>, although nothing described here depends
+ on that.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="breakpoint">Setting a breakpoint in your pass</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>First thing you do is start <tt>gdb</tt> on the <tt>opt</tt> process:</p>
+ 
+ <pre>
+ $ <b>gdb opt</b>
+ GNU gdb 5.0
+ Copyright 2000 Free Software Foundation, Inc.
+ GDB is free software, covered by the GNU General Public License, and you are
+ welcome to change it and/or distribute copies of it under certain conditions.
+ Type "show copying" to see the conditions.
+ There is absolutely no warranty for GDB.  Type "show warranty" for details.
+ This GDB was configured as "sparc-sun-solaris2.6"...
+ (gdb)
+ </pre>
+ 
+ <p>Note that <tt>opt</tt> has a lot of debugging information in it, so it takes
+ time to load.  Be patient.  Since we cannot set a breakpoint in our pass yet
+ (the shared object isn't loaded until runtime), we must execute the process, and
+ have it stop before it invokes our pass, but after it has loaded the shared
+ object.  The most foolproof way of doing this is to set a breakpoint in
+ <tt>PassManager::run</tt> and then run the process with the arguments you
+ want:</p>
+ 
+ <pre>
+ (gdb) <b>break PassManager::run</b>
+ Breakpoint 1 at 0x2413bc: file Pass.cpp, line 70.
+ (gdb) <b>run test.bc -load $(LLVMTOP)/llvm/Debug/lib/[libname].so -[passoption]</b>
+ Starting program: opt test.bc -load $(LLVMTOP)/llvm/Debug/lib/[libname].so -[passoption]
+ Breakpoint 1, PassManager::run (this=0xffbef174, M=@0x70b298) at Pass.cpp:70
+ 70      bool PassManager::run(Module &M) { return PM->run(M); }
+ (gdb)
+ </pre>
+ 
+ <p>Once the <tt>opt</tt> stops in the <tt>PassManager::run</tt> method you are
+ now free to set breakpoints in your pass so that you can trace through execution
+ or do other standard debugging stuff.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="debugmisc">Miscellaneous Problems</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Once you have the basics down, there are a couple of problems that GDB has,
+ some with solutions, some without.</p>
+ 
+ <ul>
+ <li>Inline functions have bogus stack information.  In general, GDB does a
+ pretty good job getting stack traces and stepping through inline functions.
+ When a pass is dynamically loaded however, it somehow completely loses this
+ capability.  The only solution I know of is to de-inline a function (move it
+ from the body of a class to a .cpp file).</li>
+ 
+ <li>Restarting the program breaks breakpoints.  After following the information
+ above, you have succeeded in getting some breakpoints planted in your pass.  Nex
+ thing you know, you restart the program (i.e., you type '<tt>run</tt>' again),
+ and you start getting errors about breakpoints being unsettable.  The only way I
+ have found to "fix" this problem is to <tt>delete</tt> the breakpoints that are
+ already set in your pass, run the program, and re-set the breakpoints once
+ execution stops in <tt>PassManager::run</tt>.</li>
+ 
+ </ul>
+ 
+ <p>Hopefully these tips will help with common case debugging situations.  If
+ you'd like to contribute some tips of your own, just contact <a
+ href="mailto:sabre at nondot.org">Chris</a>.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <div class="doc_section">
+   <a name="future">Future extensions planned</a>
+ </div>
+ <!-- *********************************************************************** -->
+ 
+ <div class="doc_text">
+ 
+ <p>Although the LLVM Pass Infrastructure is very capable as it stands, and does
+ some nifty stuff, there are things we'd like to add in the future.  Here is
+ where we are going:</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+   <a name="SMP">Multithreaded LLVM</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Multiple CPU machines are becoming more common and compilation can never be
+ fast enough: obviously we should allow for a multithreaded compiler.  Because of
+ the semantics defined for passes above (specifically they cannot maintain state
+ across invocations of their <tt>run*</tt> methods), a nice clean way to
+ implement a multithreaded compiler would be for the <tt>PassManager</tt> class
+ to create multiple instances of each pass object, and allow the separate
+ instances to be hacking on different parts of the program at the same time.</p>
+ 
+ <p>This implementation would prevent each of the passes from having to implement
+ multithreaded constructs, requiring only the LLVM core to have locking in a few
+ places (for global resources).  Although this is a simple extension, we simply
+ haven't had time (or multiprocessor machines, thus a reason) to implement this.
+ Despite that, we have kept the LLVM passes SMP ready, and you should too.</p>
+ 
+ </div>
+ 
+ <!-- _______________________________________________________________________ -->
+ <div class="doc_subsubsection">
+ <a name="PassFunctionPass"><tt>ModulePass</tt>es requiring <tt>FunctionPass</tt>es</a>
+ </div>
+ 
+ <div class="doc_text">
+ 
+ <p>Currently it is illegal for a <a href="#ModulePass"><tt>ModulePass</tt></a>
+ to require a <a href="#FunctionPass"><tt>FunctionPass</tt></a>.  This is because
+ there is only one instance of the <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a> object ever created, thus nowhere
+ to store information for all of the functions in the program at the same time.
+ Although this has come up a couple of times before, this has always been worked
+ around by factoring one big complicated pass into a global and an
+ interprocedural part, both of which are distinct.  In the future, it would be
+ nice to have this though.</p>
+ 
+ <p>Note that it is no problem for a <a
+ href="#FunctionPass"><tt>FunctionPass</tt></a> to require the results of a <a
+ href="#ModulePass"><tt>ModulePass</tt></a>, only the other way around.</p>
+ 
+ </div>
+ 
+ <!-- *********************************************************************** -->
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!" /></a>
+ 
+   <a href="mailto:sabre at nondot.org">Chris Lattner</a><br>
+   <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/doxygen.cfg.in
diff -c /dev/null llvm-www/releases/1.8/docs/doxygen.cfg.in:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/doxygen.cfg.in	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,1230 ----
+ # Doxyfile 1.4.4
+ 
+ # This file describes the settings to be used by the documentation system
+ # doxygen (www.doxygen.org) for a project
+ #
+ # All text after a hash (#) is considered a comment and will be ignored
+ # The format is:
+ #       TAG = value [value, ...]
+ # For lists items can also be appended using:
+ #       TAG += value [value, ...]
+ # Values that contain spaces should be placed between quotes (" ")
+ 
+ #---------------------------------------------------------------------------
+ # Project related configuration options
+ #---------------------------------------------------------------------------
+ 
+ # The PROJECT_NAME tag is a single word (or a sequence of words surrounded 
+ # by quotes) that should identify the project.
+ 
+ PROJECT_NAME           = LLVM
+ 
+ # The PROJECT_NUMBER tag can be used to enter a project or revision number. 
+ # This could be handy for archiving the generated documentation or 
+ # if some version control system is used.
+ 
+ PROJECT_NUMBER         = @PACKAGE_VERSION@
+ 
+ # The OUTPUT_DIRECTORY tag is used to specify the (relative or absolute) 
+ # base path where the generated documentation will be put. 
+ # If a relative path is entered, it will be relative to the location 
+ # where doxygen was started. If left blank the current directory will be used.
+ 
+ OUTPUT_DIRECTORY       = @abs_top_builddir@/docs/doxygen
+ 
+ # If the CREATE_SUBDIRS tag is set to YES, then doxygen will create 
+ # 4096 sub-directories (in 2 levels) under the output directory of each output 
+ # format and will distribute the generated files over these directories. 
+ # Enabling this option can be useful when feeding doxygen a huge amount of 
+ # source files, where putting all generated files in the same directory would 
+ # otherwise cause performance problems for the file system.
+ 
+ CREATE_SUBDIRS         = NO
+ 
+ # The OUTPUT_LANGUAGE tag is used to specify the language in which all 
+ # documentation generated by doxygen is written. Doxygen will use this 
+ # information to generate all constant output in the proper language. 
+ # The default language is English, other supported languages are: 
+ # Brazilian, Catalan, Chinese, Chinese-Traditional, Croatian, Czech, Danish, 
+ # Dutch, Finnish, French, German, Greek, Hungarian, Italian, Japanese, 
+ # Japanese-en (Japanese with English messages), Korean, Korean-en, Norwegian, 
+ # Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovene, Spanish, 
+ # Swedish, and Ukrainian.
+ 
+ OUTPUT_LANGUAGE        = English
+ 
+ # This tag can be used to specify the encoding used in the generated output. 
+ # The encoding is not always determined by the language that is chosen, 
+ # but also whether or not the output is meant for Windows or non-Windows users. 
+ # In case there is a difference, setting the USE_WINDOWS_ENCODING tag to YES 
+ # forces the Windows encoding (this is the default for the Windows binary), 
+ # whereas setting the tag to NO uses a Unix-style encoding (the default for 
+ # all platforms other than Windows).
+ 
+ USE_WINDOWS_ENCODING   = NO
+ 
+ # If the BRIEF_MEMBER_DESC tag is set to YES (the default) Doxygen will 
+ # include brief member descriptions after the members that are listed in 
+ # the file and class documentation (similar to JavaDoc). 
+ # Set to NO to disable this.
+ 
+ BRIEF_MEMBER_DESC      = YES
+ 
+ # If the REPEAT_BRIEF tag is set to YES (the default) Doxygen will prepend 
+ # the brief description of a member or function before the detailed description. 
+ # Note: if both HIDE_UNDOC_MEMBERS and BRIEF_MEMBER_DESC are set to NO, the 
+ # brief descriptions will be completely suppressed.
+ 
+ REPEAT_BRIEF           = YES
+ 
+ # This tag implements a quasi-intelligent brief description abbreviator 
+ # that is used to form the text in various listings. Each string 
+ # in this list, if found as the leading text of the brief description, will be 
+ # stripped from the text and the result after processing the whole list, is 
+ # used as the annotated text. Otherwise, the brief description is used as-is. 
+ # If left blank, the following values are used ("$name" is automatically 
+ # replaced with the name of the entity): "The $name class" "The $name widget" 
+ # "The $name file" "is" "provides" "specifies" "contains" 
+ # "represents" "a" "an" "the"
+ 
+ ABBREVIATE_BRIEF       = 
+ 
+ # If the ALWAYS_DETAILED_SEC and REPEAT_BRIEF tags are both set to YES then 
+ # Doxygen will generate a detailed section even if there is only a brief 
+ # description.
+ 
+ ALWAYS_DETAILED_SEC    = NO
+ 
+ # If the INLINE_INHERITED_MEMB tag is set to YES, doxygen will show all 
+ # inherited members of a class in the documentation of that class as if those 
+ # members were ordinary class members. Constructors, destructors and assignment 
+ # operators of the base classes will not be shown.
+ 
+ INLINE_INHERITED_MEMB  = NO
+ 
+ # If the FULL_PATH_NAMES tag is set to YES then Doxygen will prepend the full 
+ # path before files name in the file list and in the header files. If set 
+ # to NO the shortest path that makes the file name unique will be used.
+ 
+ FULL_PATH_NAMES        = NO
+ 
+ # If the FULL_PATH_NAMES tag is set to YES then the STRIP_FROM_PATH tag 
+ # can be used to strip a user-defined part of the path. Stripping is 
+ # only done if one of the specified strings matches the left-hand part of 
+ # the path. The tag can be used to show relative paths in the file list. 
+ # If left blank the directory from which doxygen is run is used as the 
+ # path to strip.
+ 
+ STRIP_FROM_PATH        = ../..
+ 
+ # The STRIP_FROM_INC_PATH tag can be used to strip a user-defined part of 
+ # the path mentioned in the documentation of a class, which tells 
+ # the reader which header file to include in order to use a class. 
+ # If left blank only the name of the header file containing the class 
+ # definition is used. Otherwise one should specify the include paths that 
+ # are normally passed to the compiler using the -I flag.
+ 
+ STRIP_FROM_INC_PATH    = 
+ 
+ # If the SHORT_NAMES tag is set to YES, doxygen will generate much shorter 
+ # (but less readable) file names. This can be useful is your file systems 
+ # doesn't support long names like on DOS, Mac, or CD-ROM.
+ 
+ SHORT_NAMES            = NO
+ 
+ # If the JAVADOC_AUTOBRIEF tag is set to YES then Doxygen 
+ # will interpret the first line (until the first dot) of a JavaDoc-style 
+ # comment as the brief description. If set to NO, the JavaDoc 
+ # comments will behave just like the Qt-style comments (thus requiring an 
+ # explicit @brief command for a brief description.
+ 
+ JAVADOC_AUTOBRIEF      = NO
+ 
+ # The MULTILINE_CPP_IS_BRIEF tag can be set to YES to make Doxygen 
+ # treat a multi-line C++ special comment block (i.e. a block of //! or /// 
+ # comments) as a brief description. This used to be the default behaviour. 
+ # The new default is to treat a multi-line C++ comment block as a detailed 
+ # description. Set this tag to YES if you prefer the old behaviour instead.
+ 
+ MULTILINE_CPP_IS_BRIEF = NO
+ 
+ # If the DETAILS_AT_TOP tag is set to YES then Doxygen 
+ # will output the detailed description near the top, like JavaDoc.
+ # If set to NO, the detailed description appears after the member 
+ # documentation.
+ 
+ DETAILS_AT_TOP         = NO
+ 
+ # If the INHERIT_DOCS tag is set to YES (the default) then an undocumented 
+ # member inherits the documentation from any documented member that it 
+ # re-implements.
+ 
+ INHERIT_DOCS           = YES
+ 
+ # If member grouping is used in the documentation and the DISTRIBUTE_GROUP_DOC 
+ # tag is set to YES, then doxygen will reuse the documentation of the first 
+ # member in the group (if any) for the other members of the group. By default 
+ # all members of a group must be documented explicitly.
+ 
+ DISTRIBUTE_GROUP_DOC   = NO
+ 
+ # If the SEPARATE_MEMBER_PAGES tag is set to YES, then doxygen will produce 
+ # a new page for each member. If set to NO, the documentation of a member will 
+ # be part of the file/class/namespace that contains it.
+ 
+ SEPARATE_MEMBER_PAGES  = NO
+ 
+ # The TAB_SIZE tag can be used to set the number of spaces in a tab. 
+ # Doxygen uses this value to replace tabs by spaces in code fragments.
+ 
+ TAB_SIZE               = 2
+ 
+ # This tag can be used to specify a number of aliases that acts 
+ # as commands in the documentation. An alias has the form "name=value". 
+ # For example adding "sideeffect=\par Side Effects:\n" will allow you to 
+ # put the command \sideeffect (or @sideeffect) in the documentation, which 
+ # will result in a user-defined paragraph with heading "Side Effects:". 
+ # You can put \n's in the value part of an alias to insert newlines.
+ 
+ ALIASES                = 
+ 
+ # Set the OPTIMIZE_OUTPUT_FOR_C tag to YES if your project consists of C 
+ # sources only. Doxygen will then generate output that is more tailored for C. 
+ # For instance, some of the names that are used will be different. The list 
+ # of all members will be omitted, etc.
+ 
+ OPTIMIZE_OUTPUT_FOR_C  = NO
+ 
+ # Set the OPTIMIZE_OUTPUT_JAVA tag to YES if your project consists of Java sources 
+ # only. Doxygen will then generate output that is more tailored for Java. 
+ # For instance, namespaces will be presented as packages, qualified scopes 
+ # will look different, etc.
+ 
+ OPTIMIZE_OUTPUT_JAVA   = NO
+ 
+ # Set the SUBGROUPING tag to YES (the default) to allow class member groups of 
+ # the same type (for instance a group of public functions) to be put as a 
+ # subgroup of that type (e.g. under the Public Functions section). Set it to 
+ # NO to prevent subgrouping. Alternatively, this can be done per class using 
+ # the \nosubgrouping command.
+ 
+ SUBGROUPING            = YES
+ 
+ #---------------------------------------------------------------------------
+ # Build related configuration options
+ #---------------------------------------------------------------------------
+ 
+ # If the EXTRACT_ALL tag is set to YES doxygen will assume all entities in 
+ # documentation are documented, even if no documentation was available. 
+ # Private class members and static file members will be hidden unless 
+ # the EXTRACT_PRIVATE and EXTRACT_STATIC tags are set to YES
+ 
+ EXTRACT_ALL            = YES
+ 
+ # If the EXTRACT_PRIVATE tag is set to YES all private members of a class 
+ # will be included in the documentation.
+ 
+ EXTRACT_PRIVATE        = NO
+ 
+ # If the EXTRACT_STATIC tag is set to YES all static members of a file 
+ # will be included in the documentation.
+ 
+ EXTRACT_STATIC         = YES
+ 
+ # If the EXTRACT_LOCAL_CLASSES tag is set to YES classes (and structs) 
+ # defined locally in source files will be included in the documentation. 
+ # If set to NO only classes defined in header files are included.
+ 
+ EXTRACT_LOCAL_CLASSES  = YES
+ 
+ # This flag is only useful for Objective-C code. When set to YES local 
+ # methods, which are defined in the implementation section but not in 
+ # the interface are included in the documentation. 
+ # If set to NO (the default) only methods in the interface are included.
+ 
+ EXTRACT_LOCAL_METHODS  = NO
+ 
+ # If the HIDE_UNDOC_MEMBERS tag is set to YES, Doxygen will hide all 
+ # undocumented members of documented classes, files or namespaces. 
+ # If set to NO (the default) these members will be included in the 
+ # various overviews, but no documentation section is generated. 
+ # This option has no effect if EXTRACT_ALL is enabled.
+ 
+ HIDE_UNDOC_MEMBERS     = NO
+ 
+ # If the HIDE_UNDOC_CLASSES tag is set to YES, Doxygen will hide all 
+ # undocumented classes that are normally visible in the class hierarchy. 
+ # If set to NO (the default) these classes will be included in the various 
+ # overviews. This option has no effect if EXTRACT_ALL is enabled.
+ 
+ HIDE_UNDOC_CLASSES     = NO
+ 
+ # If the HIDE_FRIEND_COMPOUNDS tag is set to YES, Doxygen will hide all 
+ # friend (class|struct|union) declarations. 
+ # If set to NO (the default) these declarations will be included in the 
+ # documentation.
+ 
+ HIDE_FRIEND_COMPOUNDS  = NO
+ 
+ # If the HIDE_IN_BODY_DOCS tag is set to YES, Doxygen will hide any 
+ # documentation blocks found inside the body of a function. 
+ # If set to NO (the default) these blocks will be appended to the 
+ # function's detailed documentation block.
+ 
+ HIDE_IN_BODY_DOCS      = NO
+ 
+ # The INTERNAL_DOCS tag determines if documentation 
+ # that is typed after a \internal command is included. If the tag is set 
+ # to NO (the default) then the documentation will be excluded. 
+ # Set it to YES to include the internal documentation.
+ 
+ INTERNAL_DOCS          = NO
+ 
+ # If the CASE_SENSE_NAMES tag is set to NO then Doxygen will only generate 
+ # file names in lower-case letters. If set to YES upper-case letters are also 
+ # allowed. This is useful if you have classes or files whose names only differ 
+ # in case and if your file system supports case sensitive file names. Windows 
+ # and Mac users are advised to set this option to NO.
+ 
+ CASE_SENSE_NAMES       = YES
+ 
+ # If the HIDE_SCOPE_NAMES tag is set to NO (the default) then Doxygen 
+ # will show members with their full class and namespace scopes in the 
+ # documentation. If set to YES the scope will be hidden.
+ 
+ HIDE_SCOPE_NAMES       = NO
+ 
+ # If the SHOW_INCLUDE_FILES tag is set to YES (the default) then Doxygen 
+ # will put a list of the files that are included by a file in the documentation 
+ # of that file.
+ 
+ SHOW_INCLUDE_FILES     = YES
+ 
+ # If the INLINE_INFO tag is set to YES (the default) then a tag [inline] 
+ # is inserted in the documentation for inline members.
+ 
+ INLINE_INFO            = YES
+ 
+ # If the SORT_MEMBER_DOCS tag is set to YES (the default) then doxygen 
+ # will sort the (detailed) documentation of file and class members 
+ # alphabetically by member name. If set to NO the members will appear in 
+ # declaration order.
+ 
+ SORT_MEMBER_DOCS       = YES
+ 
+ # If the SORT_BRIEF_DOCS tag is set to YES then doxygen will sort the 
+ # brief documentation of file, namespace and class members alphabetically 
+ # by member name. If set to NO (the default) the members will appear in 
+ # declaration order.
+ 
+ SORT_BRIEF_DOCS        = NO
+ 
+ # If the SORT_BY_SCOPE_NAME tag is set to YES, the class list will be 
+ # sorted by fully-qualified names, including namespaces. If set to 
+ # NO (the default), the class list will be sorted only by class name, 
+ # not including the namespace part. 
+ # Note: This option is not very useful if HIDE_SCOPE_NAMES is set to YES.
+ # Note: This option applies only to the class list, not to the 
+ # alphabetical list.
+ 
+ SORT_BY_SCOPE_NAME     = NO
+ 
+ # The GENERATE_TODOLIST tag can be used to enable (YES) or 
+ # disable (NO) the todo list. This list is created by putting \todo 
+ # commands in the documentation.
+ 
+ GENERATE_TODOLIST      = YES
+ 
+ # The GENERATE_TESTLIST tag can be used to enable (YES) or 
+ # disable (NO) the test list. This list is created by putting \test 
+ # commands in the documentation.
+ 
+ GENERATE_TESTLIST      = YES
+ 
+ # The GENERATE_BUGLIST tag can be used to enable (YES) or 
+ # disable (NO) the bug list. This list is created by putting \bug 
+ # commands in the documentation.
+ 
+ GENERATE_BUGLIST       = YES
+ 
+ # The GENERATE_DEPRECATEDLIST tag can be used to enable (YES) or 
+ # disable (NO) the deprecated list. This list is created by putting 
+ # \deprecated commands in the documentation.
+ 
+ GENERATE_DEPRECATEDLIST= YES
+ 
+ # The ENABLED_SECTIONS tag can be used to enable conditional 
+ # documentation sections, marked by \if sectionname ... \endif.
+ 
+ ENABLED_SECTIONS       = 
+ 
+ # The MAX_INITIALIZER_LINES tag determines the maximum number of lines 
+ # the initial value of a variable or define consists of for it to appear in 
+ # the documentation. If the initializer consists of more lines than specified 
+ # here it will be hidden. Use a value of 0 to hide initializers completely. 
+ # The appearance of the initializer of individual variables and defines in the 
+ # documentation can be controlled using \showinitializer or \hideinitializer 
+ # command in the documentation regardless of this setting.
+ 
+ MAX_INITIALIZER_LINES  = 30
+ 
+ # Set the SHOW_USED_FILES tag to NO to disable the list of files generated 
+ # at the bottom of the documentation of classes and structs. If set to YES the 
+ # list will mention the files that were used to generate the documentation.
+ 
+ SHOW_USED_FILES        = YES
+ 
+ # If the sources in your project are distributed over multiple directories 
+ # then setting the SHOW_DIRECTORIES tag to YES will show the directory hierarchy 
+ # in the documentation. The default is YES.
+ 
+ SHOW_DIRECTORIES       = YES
+ 
+ # The FILE_VERSION_FILTER tag can be used to specify a program or script that 
+ # doxygen should invoke to get the current version for each file (typically from the 
+ # version control system). Doxygen will invoke the program by executing (via 
+ # popen()) the command <command> <input-file>, where <command> is the value of 
+ # the FILE_VERSION_FILTER tag, and <input-file> is the name of an input file 
+ # provided by doxygen. Whatever the progam writes to standard output 
+ # is used as the file version. See the manual for examples.
+ 
+ FILE_VERSION_FILTER    = 
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to warning and progress messages
+ #---------------------------------------------------------------------------
+ 
+ # The QUIET tag can be used to turn on/off the messages that are generated 
+ # by doxygen. Possible values are YES and NO. If left blank NO is used.
+ 
+ QUIET                  = NO
+ 
+ # The WARNINGS tag can be used to turn on/off the warning messages that are 
+ # generated by doxygen. Possible values are YES and NO. If left blank 
+ # NO is used.
+ 
+ WARNINGS               = NO
+ 
+ # If WARN_IF_UNDOCUMENTED is set to YES, then doxygen will generate warnings 
+ # for undocumented members. If EXTRACT_ALL is set to YES then this flag will 
+ # automatically be disabled.
+ 
+ WARN_IF_UNDOCUMENTED   = NO
+ 
+ # If WARN_IF_DOC_ERROR is set to YES, doxygen will generate warnings for 
+ # potential errors in the documentation, such as not documenting some 
+ # parameters in a documented function, or documenting parameters that 
+ # don't exist or using markup commands wrongly.
+ 
+ WARN_IF_DOC_ERROR      = YES
+ 
+ # This WARN_NO_PARAMDOC option can be abled to get warnings for 
+ # functions that are documented, but have no documentation for their parameters 
+ # or return value. If set to NO (the default) doxygen will only warn about 
+ # wrong or incomplete parameter documentation, but not about the absence of 
+ # documentation.
+ 
+ WARN_NO_PARAMDOC       = NO
+ 
+ # The WARN_FORMAT tag determines the format of the warning messages that 
+ # doxygen can produce. The string should contain the $file, $line, and $text 
+ # tags, which will be replaced by the file and line number from which the 
+ # warning originated and the warning text. Optionally the format may contain 
+ # $version, which will be replaced by the version of the file (if it could 
+ # be obtained via FILE_VERSION_FILTER)
+ 
+ WARN_FORMAT            = 
+ 
+ # The WARN_LOGFILE tag can be used to specify a file to which warning 
+ # and error messages should be written. If left blank the output is written 
+ # to stderr.
+ 
+ WARN_LOGFILE           = 
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the input files
+ #---------------------------------------------------------------------------
+ 
+ # The INPUT tag can be used to specify the files and/or directories that contain 
+ # documented source files. You may enter file names like "myfile.cpp" or 
+ # directories like "/usr/src/myproject". Separate the files or directories 
+ # with spaces.
+ 
+ INPUT                  = @abs_top_srcdir@/include \
+                          @abs_top_srcdir@/lib \
+                          @abs_top_srcdir@/docs/doxygen.intro
+ 
+ # If the value of the INPUT tag contains directories, you can use the 
+ # FILE_PATTERNS tag to specify one or more wildcard pattern (like *.cpp 
+ # and *.h) to filter out the source-files in the directories. If left 
+ # blank the following patterns are tested: 
+ # *.c *.cc *.cxx *.cpp *.c++ *.java *.ii *.ixx *.ipp *.i++ *.inl *.h *.hh *.hxx 
+ # *.hpp *.h++ *.idl *.odl *.cs *.php *.php3 *.inc *.m *.mm
+ 
+ FILE_PATTERNS          = 
+ 
+ # The RECURSIVE tag can be used to turn specify whether or not subdirectories 
+ # should be searched for input files as well. Possible values are YES and NO. 
+ # If left blank NO is used.
+ 
+ RECURSIVE              = YES
+ 
+ # The EXCLUDE tag can be used to specify files and/or directories that should 
+ # excluded from the INPUT source files. This way you can easily exclude a 
+ # subdirectory from a directory tree whose root is specified with the INPUT tag.
+ 
+ EXCLUDE                = 
+ 
+ # The EXCLUDE_SYMLINKS tag can be used select whether or not files or 
+ # directories that are symbolic links (a Unix filesystem feature) are excluded 
+ # from the input.
+ 
+ EXCLUDE_SYMLINKS       = NO
+ 
+ # If the value of the INPUT tag contains directories, you can use the 
+ # EXCLUDE_PATTERNS tag to specify one or more wildcard patterns to exclude 
+ # certain files from those directories. Note that the wildcards are matched 
+ # against the file with absolute path, so to exclude all test directories 
+ # for example use the pattern */test/*
+ 
+ EXCLUDE_PATTERNS       = 
+ 
+ # The EXAMPLE_PATH tag can be used to specify one or more files or 
+ # directories that contain example code fragments that are included (see 
+ # the \include command).
+ 
+ EXAMPLE_PATH           = @abs_top_srcdir@/examples
+ 
+ # If the value of the EXAMPLE_PATH tag contains directories, you can use the 
+ # EXAMPLE_PATTERNS tag to specify one or more wildcard pattern (like *.cpp 
+ # and *.h) to filter out the source-files in the directories. If left 
+ # blank all files are included.
+ 
+ EXAMPLE_PATTERNS       = 
+ 
+ # If the EXAMPLE_RECURSIVE tag is set to YES then subdirectories will be 
+ # searched for input files to be used with the \include or \dontinclude 
+ # commands irrespective of the value of the RECURSIVE tag. 
+ # Possible values are YES and NO. If left blank NO is used.
+ 
+ EXAMPLE_RECURSIVE      = YES
+ 
+ # The IMAGE_PATH tag can be used to specify one or more files or 
+ # directories that contain image that are included in the documentation (see 
+ # the \image command).
+ 
+ IMAGE_PATH             = @abs_top_srcdir@/docs/img
+ 
+ # The INPUT_FILTER tag can be used to specify a program that doxygen should 
+ # invoke to filter for each input file. Doxygen will invoke the filter program 
+ # by executing (via popen()) the command <filter> <input-file>, where <filter> 
+ # is the value of the INPUT_FILTER tag, and <input-file> is the name of an 
+ # input file. Doxygen will then use the output that the filter program writes 
+ # to standard output.  If FILTER_PATTERNS is specified, this tag will be 
+ # ignored.
+ 
+ INPUT_FILTER           = 
+ 
+ # The FILTER_PATTERNS tag can be used to specify filters on a per file pattern 
+ # basis.  Doxygen will compare the file name with each pattern and apply the 
+ # filter if there is a match.  The filters are a list of the form: 
+ # pattern=filter (like *.cpp=my_cpp_filter). See INPUT_FILTER for further 
+ # info on how filters are used. If FILTER_PATTERNS is empty, INPUT_FILTER 
+ # is applied to all files.
+ 
+ FILTER_PATTERNS        = 
+ 
+ # If the FILTER_SOURCE_FILES tag is set to YES, the input filter (if set using 
+ # INPUT_FILTER) will be used to filter the input files when producing source 
+ # files to browse (i.e. when SOURCE_BROWSER is set to YES).
+ 
+ FILTER_SOURCE_FILES    = NO
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to source browsing
+ #---------------------------------------------------------------------------
+ 
+ # If the SOURCE_BROWSER tag is set to YES then a list of source files will 
+ # be generated. Documented entities will be cross-referenced with these sources. 
+ # Note: To get rid of all source code in the generated output, make sure also 
+ # VERBATIM_HEADERS is set to NO.
+ 
+ SOURCE_BROWSER         = YES
+ 
+ # Setting the INLINE_SOURCES tag to YES will include the body 
+ # of functions and classes directly in the documentation.
+ 
+ INLINE_SOURCES         = NO
+ 
+ # Setting the STRIP_CODE_COMMENTS tag to YES (the default) will instruct 
+ # doxygen to hide any special comment blocks from generated source code 
+ # fragments. Normal C and C++ comments will always remain visible.
+ 
+ STRIP_CODE_COMMENTS    = NO
+ 
+ # If the REFERENCED_BY_RELATION tag is set to YES (the default) 
+ # then for each documented function all documented 
+ # functions referencing it will be listed.
+ 
+ REFERENCED_BY_RELATION = YES
+ 
+ # If the REFERENCES_RELATION tag is set to YES (the default) 
+ # then for each documented function all documented entities 
+ # called/used by that function will be listed.
+ 
+ REFERENCES_RELATION    = YES
+ 
+ # If the USE_HTAGS tag is set to YES then the references to source code 
+ # will point to the HTML generated by the htags(1) tool instead of doxygen 
+ # built-in source browser. The htags tool is part of GNU's global source 
+ # tagging system (see http://www.gnu.org/software/global/global.html). You 
+ # will need version 4.8.6 or higher.
+ 
+ USE_HTAGS              = NO
+ 
+ # If the VERBATIM_HEADERS tag is set to YES (the default) then Doxygen 
+ # will generate a verbatim copy of the header file for each class for 
+ # which an include is specified. Set to NO to disable this.
+ 
+ VERBATIM_HEADERS       = YES
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the alphabetical class index
+ #---------------------------------------------------------------------------
+ 
+ # If the ALPHABETICAL_INDEX tag is set to YES, an alphabetical index 
+ # of all compounds will be generated. Enable this if the project 
+ # contains a lot of classes, structs, unions or interfaces.
+ 
+ ALPHABETICAL_INDEX     = YES
+ 
+ # If the alphabetical index is enabled (see ALPHABETICAL_INDEX) then 
+ # the COLS_IN_ALPHA_INDEX tag can be used to specify the number of columns 
+ # in which this list will be split (can be a number in the range [1..20])
+ 
+ COLS_IN_ALPHA_INDEX    = 4
+ 
+ # In case all classes in a project start with a common prefix, all 
+ # classes will be put under the same header in the alphabetical index. 
+ # The IGNORE_PREFIX tag can be used to specify one or more prefixes that 
+ # should be ignored while generating the index headers.
+ 
+ IGNORE_PREFIX          = llvm::
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the HTML output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_HTML tag is set to YES (the default) Doxygen will 
+ # generate HTML output.
+ 
+ GENERATE_HTML          = YES
+ 
+ # The HTML_OUTPUT tag is used to specify where the HTML docs will be put. 
+ # If a relative path is entered the value of OUTPUT_DIRECTORY will be 
+ # put in front of it. If left blank `html' will be used as the default path.
+ 
+ HTML_OUTPUT            = html
+ 
+ # The HTML_FILE_EXTENSION tag can be used to specify the file extension for 
+ # each generated HTML page (for example: .htm,.php,.asp). If it is left blank 
+ # doxygen will generate files with .html extension.
+ 
+ HTML_FILE_EXTENSION    = .html
+ 
+ # The HTML_HEADER tag can be used to specify a personal HTML header for 
+ # each generated HTML page. If it is left blank doxygen will generate a 
+ # standard header.
+ 
+ HTML_HEADER            = @abs_top_srcdir@/docs/doxygen.header
+ 
+ # The HTML_FOOTER tag can be used to specify a personal HTML footer for 
+ # each generated HTML page. If it is left blank doxygen will generate a 
+ # standard footer.
+ 
+ HTML_FOOTER            = @abs_top_srcdir@/docs/doxygen.footer
+ 
+ # The HTML_STYLESHEET tag can be used to specify a user-defined cascading 
+ # style sheet that is used by each HTML page. It can be used to 
+ # fine-tune the look of the HTML output. If the tag is left blank doxygen 
+ # will generate a default style sheet. Note that doxygen will try to copy 
+ # the style sheet file to the HTML output directory, so don't put your own 
+ # stylesheet in the HTML output directory as well, or it will be erased!
+ 
+ HTML_STYLESHEET        = @abs_top_srcdir@/docs/doxygen.css
+ 
+ # If the HTML_ALIGN_MEMBERS tag is set to YES, the members of classes, 
+ # files or namespaces will be aligned in HTML using tables. If set to 
+ # NO a bullet list will be used.
+ 
+ HTML_ALIGN_MEMBERS     = YES
+ 
+ # If the GENERATE_HTMLHELP tag is set to YES, additional index files 
+ # will be generated that can be used as input for tools like the 
+ # Microsoft HTML help workshop to generate a compressed HTML help file (.chm) 
+ # of the generated HTML documentation.
+ 
+ GENERATE_HTMLHELP      = NO
+ 
+ # If the GENERATE_HTMLHELP tag is set to YES, the CHM_FILE tag can 
+ # be used to specify the file name of the resulting .chm file. You 
+ # can add a path in front of the file if the result should not be 
+ # written to the html output directory.
+ 
+ CHM_FILE               = 
+ 
+ # If the GENERATE_HTMLHELP tag is set to YES, the HHC_LOCATION tag can 
+ # be used to specify the location (absolute path including file name) of 
+ # the HTML help compiler (hhc.exe). If non-empty doxygen will try to run 
+ # the HTML help compiler on the generated index.hhp.
+ 
+ HHC_LOCATION           = 
+ 
+ # If the GENERATE_HTMLHELP tag is set to YES, the GENERATE_CHI flag 
+ # controls if a separate .chi index file is generated (YES) or that 
+ # it should be included in the master .chm file (NO).
+ 
+ GENERATE_CHI           = NO
+ 
+ # If the GENERATE_HTMLHELP tag is set to YES, the BINARY_TOC flag 
+ # controls whether a binary table of contents is generated (YES) or a 
+ # normal table of contents (NO) in the .chm file.
+ 
+ BINARY_TOC             = NO
+ 
+ # The TOC_EXPAND flag can be set to YES to add extra items for group members 
+ # to the contents of the HTML help documentation and to the tree view.
+ 
+ TOC_EXPAND             = NO
+ 
+ # The DISABLE_INDEX tag can be used to turn on/off the condensed index at 
+ # top of each HTML page. The value NO (the default) enables the index and 
+ # the value YES disables it.
+ 
+ DISABLE_INDEX          = NO
+ 
+ # This tag can be used to set the number of enum values (range [1..20]) 
+ # that doxygen will group on one line in the generated HTML documentation.
+ 
+ ENUM_VALUES_PER_LINE   = 4
+ 
+ # If the GENERATE_TREEVIEW tag is set to YES, a side panel will be
+ # generated containing a tree-like index structure (just like the one that 
+ # is generated for HTML Help). For this to work a browser that supports 
+ # JavaScript, DHTML, CSS and frames is required (for instance Mozilla 1.0+, 
+ # Netscape 6.0+, Internet explorer 5.0+, or Konqueror). Windows users are 
+ # probably better off using the HTML help feature.
+ 
+ GENERATE_TREEVIEW      = NO
+ 
+ # If the treeview is enabled (see GENERATE_TREEVIEW) then this tag can be 
+ # used to set the initial width (in pixels) of the frame in which the tree 
+ # is shown.
+ 
+ TREEVIEW_WIDTH         = 250
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the LaTeX output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_LATEX tag is set to YES (the default) Doxygen will 
+ # generate Latex output.
+ 
+ GENERATE_LATEX         = NO
+ 
+ # The LATEX_OUTPUT tag is used to specify where the LaTeX docs will be put. 
+ # If a relative path is entered the value of OUTPUT_DIRECTORY will be 
+ # put in front of it. If left blank `latex' will be used as the default path.
+ 
+ LATEX_OUTPUT           = 
+ 
+ # The LATEX_CMD_NAME tag can be used to specify the LaTeX command name to be 
+ # invoked. If left blank `latex' will be used as the default command name.
+ 
+ LATEX_CMD_NAME         = latex
+ 
+ # The MAKEINDEX_CMD_NAME tag can be used to specify the command name to 
+ # generate index for LaTeX. If left blank `makeindex' will be used as the 
+ # default command name.
+ 
+ MAKEINDEX_CMD_NAME     = makeindex
+ 
+ # If the COMPACT_LATEX tag is set to YES Doxygen generates more compact 
+ # LaTeX documents. This may be useful for small projects and may help to 
+ # save some trees in general.
+ 
+ COMPACT_LATEX          = NO
+ 
+ # The PAPER_TYPE tag can be used to set the paper type that is used 
+ # by the printer. Possible values are: a4, a4wide, letter, legal and 
+ # executive. If left blank a4wide will be used.
+ 
+ PAPER_TYPE             = letter
+ 
+ # The EXTRA_PACKAGES tag can be to specify one or more names of LaTeX 
+ # packages that should be included in the LaTeX output.
+ 
+ EXTRA_PACKAGES         = 
+ 
+ # The LATEX_HEADER tag can be used to specify a personal LaTeX header for 
+ # the generated latex document. The header should contain everything until 
+ # the first chapter. If it is left blank doxygen will generate a 
+ # standard header. Notice: only use this tag if you know what you are doing!
+ 
+ LATEX_HEADER           = 
+ 
+ # If the PDF_HYPERLINKS tag is set to YES, the LaTeX that is generated 
+ # is prepared for conversion to pdf (using ps2pdf). The pdf file will 
+ # contain links (just like the HTML output) instead of page references 
+ # This makes the output suitable for online browsing using a pdf viewer.
+ 
+ PDF_HYPERLINKS         = NO
+ 
+ # If the USE_PDFLATEX tag is set to YES, pdflatex will be used instead of 
+ # plain latex in the generated Makefile. Set this option to YES to get a 
+ # higher quality PDF documentation.
+ 
+ USE_PDFLATEX           = NO
+ 
+ # If the LATEX_BATCHMODE tag is set to YES, doxygen will add the \\batchmode. 
+ # command to the generated LaTeX files. This will instruct LaTeX to keep 
+ # running if errors occur, instead of asking the user for help. 
+ # This option is also used when generating formulas in HTML.
+ 
+ LATEX_BATCHMODE        = NO
+ 
+ # If LATEX_HIDE_INDICES is set to YES then doxygen will not 
+ # include the index chapters (such as File Index, Compound Index, etc.) 
+ # in the output.
+ 
+ LATEX_HIDE_INDICES     = NO
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the RTF output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_RTF tag is set to YES Doxygen will generate RTF output 
+ # The RTF output is optimized for Word 97 and may not look very pretty with 
+ # other RTF readers or editors.
+ 
+ GENERATE_RTF           = NO
+ 
+ # The RTF_OUTPUT tag is used to specify where the RTF docs will be put. 
+ # If a relative path is entered the value of OUTPUT_DIRECTORY will be 
+ # put in front of it. If left blank `rtf' will be used as the default path.
+ 
+ RTF_OUTPUT             = 
+ 
+ # If the COMPACT_RTF tag is set to YES Doxygen generates more compact 
+ # RTF documents. This may be useful for small projects and may help to 
+ # save some trees in general.
+ 
+ COMPACT_RTF            = NO
+ 
+ # If the RTF_HYPERLINKS tag is set to YES, the RTF that is generated 
+ # will contain hyperlink fields. The RTF file will 
+ # contain links (just like the HTML output) instead of page references. 
+ # This makes the output suitable for online browsing using WORD or other 
+ # programs which support those fields. 
+ # Note: wordpad (write) and others do not support links.
+ 
+ RTF_HYPERLINKS         = NO
+ 
+ # Load stylesheet definitions from file. Syntax is similar to doxygen's 
+ # config file, i.e. a series of assignments. You only have to provide 
+ # replacements, missing definitions are set to their default value.
+ 
+ RTF_STYLESHEET_FILE    = 
+ 
+ # Set optional variables used in the generation of an rtf document. 
+ # Syntax is similar to doxygen's config file.
+ 
+ RTF_EXTENSIONS_FILE    = 
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the man page output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_MAN tag is set to YES (the default) Doxygen will 
+ # generate man pages
+ 
+ GENERATE_MAN           = NO
+ 
+ # The MAN_OUTPUT tag is used to specify where the man pages will be put. 
+ # If a relative path is entered the value of OUTPUT_DIRECTORY will be 
+ # put in front of it. If left blank `man' will be used as the default path.
+ 
+ MAN_OUTPUT             = 
+ 
+ # The MAN_EXTENSION tag determines the extension that is added to 
+ # the generated man pages (default is the subroutine's section .3)
+ 
+ MAN_EXTENSION          = 
+ 
+ # If the MAN_LINKS tag is set to YES and Doxygen generates man output, 
+ # then it will generate one additional man file for each entity 
+ # documented in the real man page(s). These additional files 
+ # only source the real man page, but without them the man command 
+ # would be unable to find the correct page. The default is NO.
+ 
+ MAN_LINKS              = NO
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the XML output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_XML tag is set to YES Doxygen will 
+ # generate an XML file that captures the structure of 
+ # the code including all documentation.
+ 
+ GENERATE_XML           = NO
+ 
+ # The XML_OUTPUT tag is used to specify where the XML pages will be put. 
+ # If a relative path is entered the value of OUTPUT_DIRECTORY will be 
+ # put in front of it. If left blank `xml' will be used as the default path.
+ 
+ XML_OUTPUT             = xml
+ 
+ # The XML_SCHEMA tag can be used to specify an XML schema, 
+ # which can be used by a validating XML parser to check the 
+ # syntax of the XML files.
+ 
+ XML_SCHEMA             = 
+ 
+ # The XML_DTD tag can be used to specify an XML DTD, 
+ # which can be used by a validating XML parser to check the 
+ # syntax of the XML files.
+ 
+ XML_DTD                = 
+ 
+ # If the XML_PROGRAMLISTING tag is set to YES Doxygen will 
+ # dump the program listings (including syntax highlighting 
+ # and cross-referencing information) to the XML output. Note that 
+ # enabling this will significantly increase the size of the XML output.
+ 
+ XML_PROGRAMLISTING     = YES
+ 
+ #---------------------------------------------------------------------------
+ # configuration options for the AutoGen Definitions output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_AUTOGEN_DEF tag is set to YES Doxygen will 
+ # generate an AutoGen Definitions (see autogen.sf.net) file 
+ # that captures the structure of the code including all 
+ # documentation. Note that this feature is still experimental 
+ # and incomplete at the moment.
+ 
+ GENERATE_AUTOGEN_DEF   = NO
+ 
+ #---------------------------------------------------------------------------
+ # configuration options related to the Perl module output
+ #---------------------------------------------------------------------------
+ 
+ # If the GENERATE_PERLMOD tag is set to YES Doxygen will 
+ # generate a Perl module file that captures the structure of 
+ # the code including all documentation. Note that this 
+ # feature is still experimental and incomplete at the 
+ # moment.
+ 
+ GENERATE_PERLMOD       = NO
+ 
+ # If the PERLMOD_LATEX tag is set to YES Doxygen will generate 
+ # the necessary Makefile rules, Perl scripts and LaTeX code to be able 
+ # to generate PDF and DVI output from the Perl module output.
+ 
+ PERLMOD_LATEX          = NO
+ 
+ # If the PERLMOD_PRETTY tag is set to YES the Perl module output will be 
+ # nicely formatted so it can be parsed by a human reader.  This is useful 
+ # if you want to understand what is going on.  On the other hand, if this 
+ # tag is set to NO the size of the Perl module output will be much smaller 
+ # and Perl will parse it just the same.
+ 
+ PERLMOD_PRETTY         = YES
+ 
+ # The names of the make variables in the generated doxyrules.make file 
+ # are prefixed with the string contained in PERLMOD_MAKEVAR_PREFIX. 
+ # This is useful so different doxyrules.make files included by the same 
+ # Makefile don't overwrite each other's variables.
+ 
+ PERLMOD_MAKEVAR_PREFIX = 
+ 
+ #---------------------------------------------------------------------------
+ # Configuration options related to the preprocessor   
+ #---------------------------------------------------------------------------
+ 
+ # If the ENABLE_PREPROCESSING tag is set to YES (the default) Doxygen will 
+ # evaluate all C-preprocessor directives found in the sources and include 
+ # files.
+ 
+ ENABLE_PREPROCESSING   = YES
+ 
+ # If the MACRO_EXPANSION tag is set to YES Doxygen will expand all macro 
+ # names in the source code. If set to NO (the default) only conditional 
+ # compilation will be performed. Macro expansion can be done in a controlled 
+ # way by setting EXPAND_ONLY_PREDEF to YES.
+ 
+ MACRO_EXPANSION        = NO
+ 
+ # If the EXPAND_ONLY_PREDEF and MACRO_EXPANSION tags are both set to YES 
+ # then the macro expansion is limited to the macros specified with the 
+ # PREDEFINED and EXPAND_AS_PREDEFINED tags.
+ 
+ EXPAND_ONLY_PREDEF     = NO
+ 
+ # If the SEARCH_INCLUDES tag is set to YES (the default) the includes files 
+ # in the INCLUDE_PATH (see below) will be search if a #include is found.
+ 
+ SEARCH_INCLUDES        = YES
+ 
+ # The INCLUDE_PATH tag can be used to specify one or more directories that 
+ # contain include files that are not input files but should be processed by 
+ # the preprocessor.
+ 
+ INCLUDE_PATH           = ../include
+ 
+ # You can use the INCLUDE_FILE_PATTERNS tag to specify one or more wildcard 
+ # patterns (like *.h and *.hpp) to filter out the header-files in the 
+ # directories. If left blank, the patterns specified with FILE_PATTERNS will 
+ # be used.
+ 
+ INCLUDE_FILE_PATTERNS  = 
+ 
+ # The PREDEFINED tag can be used to specify one or more macro names that 
+ # are defined before the preprocessor is started (similar to the -D option of 
+ # gcc). The argument of the tag is a list of macros of the form: name 
+ # or name=definition (no spaces). If the definition and the = are 
+ # omitted =1 is assumed. To prevent a macro definition from being 
+ # undefined via #undef or recursively expanded use the := operator 
+ # instead of the = operator.
+ 
+ PREDEFINED             = 
+ 
+ # If the MACRO_EXPANSION and EXPAND_ONLY_PREDEF tags are set to YES then 
+ # this tag can be used to specify a list of macro names that should be expanded. 
+ # The macro definition that is found in the sources will be used. 
+ # Use the PREDEFINED tag if you want to use a different macro definition.
+ 
+ EXPAND_AS_DEFINED      = 
+ 
+ # If the SKIP_FUNCTION_MACROS tag is set to YES (the default) then 
+ # doxygen's preprocessor will remove all function-like macros that are alone 
+ # on a line, have an all uppercase name, and do not end with a semicolon. Such 
+ # function macros are typically used for boiler-plate code, and will confuse 
+ # the parser if not removed.
+ 
+ SKIP_FUNCTION_MACROS   = YES
+ 
+ #---------------------------------------------------------------------------
+ # Configuration::additions related to external references   
+ #---------------------------------------------------------------------------
+ 
+ # The TAGFILES option can be used to specify one or more tagfiles. 
+ # Optionally an initial location of the external documentation 
+ # can be added for each tagfile. The format of a tag file without 
+ # this location is as follows: 
+ #   TAGFILES = file1 file2 ... 
+ # Adding location for the tag files is done as follows: 
+ #   TAGFILES = file1=loc1 "file2 = loc2" ... 
+ # where "loc1" and "loc2" can be relative or absolute paths or 
+ # URLs. If a location is present for each tag, the installdox tool 
+ # does not have to be run to correct the links.
+ # Note that each tag file must have a unique name
+ # (where the name does NOT include the path)
+ # If a tag file is not located in the directory in which doxygen 
+ # is run, you must also specify the path to the tagfile here.
+ 
+ TAGFILES               = 
+ 
+ # When a file name is specified after GENERATE_TAGFILE, doxygen will create 
+ # a tag file that is based on the input files it reads.
+ 
+ GENERATE_TAGFILE       = 
+ 
+ # If the ALLEXTERNALS tag is set to YES all external classes will be listed 
+ # in the class index. If set to NO only the inherited external classes 
+ # will be listed.
+ 
+ ALLEXTERNALS           = YES
+ 
+ # If the EXTERNAL_GROUPS tag is set to YES all external groups will be listed 
+ # in the modules index. If set to NO, only the current project's groups will 
+ # be listed.
+ 
+ EXTERNAL_GROUPS        = YES
+ 
+ # The PERL_PATH should be the absolute path and name of the perl script 
+ # interpreter (i.e. the result of `which perl').
+ 
+ PERL_PATH              = 
+ 
+ #---------------------------------------------------------------------------
+ # Configuration options related to the dot tool   
+ #---------------------------------------------------------------------------
+ 
+ # If the CLASS_DIAGRAMS tag is set to YES (the default) Doxygen will 
+ # generate a inheritance diagram (in HTML, RTF and LaTeX) for classes with base 
+ # or super classes. Setting the tag to NO turns the diagrams off. Note that 
+ # this option is superseded by the HAVE_DOT option below. This is only a 
+ # fallback. It is recommended to install and use dot, since it yields more 
+ # powerful graphs.
+ 
+ CLASS_DIAGRAMS         = YES
+ 
+ # If set to YES, the inheritance and collaboration graphs will hide 
+ # inheritance and usage relations if the target is undocumented 
+ # or is not a class.
+ 
+ HIDE_UNDOC_RELATIONS   = NO
+ 
+ # If you set the HAVE_DOT tag to YES then doxygen will assume the dot tool is 
+ # available from the path. This tool is part of Graphviz, a graph visualization 
+ # toolkit from AT&T and Lucent Bell Labs. The other options in this section 
+ # have no effect if this option is set to NO (the default)
+ 
+ HAVE_DOT               = YES
+ 
+ # If the CLASS_GRAPH and HAVE_DOT tags are set to YES then doxygen 
+ # will generate a graph for each documented class showing the direct and 
+ # indirect inheritance relations. Setting this tag to YES will force the 
+ # the CLASS_DIAGRAMS tag to NO.
+ 
+ CLASS_GRAPH            = YES
+ 
+ # If the COLLABORATION_GRAPH and HAVE_DOT tags are set to YES then doxygen 
+ # will generate a graph for each documented class showing the direct and 
+ # indirect implementation dependencies (inheritance, containment, and 
+ # class references variables) of the class with other documented classes.
+ 
+ COLLABORATION_GRAPH    = YES
+ 
+ # If the GROUP_GRAPHS and HAVE_DOT tags are set to YES then doxygen 
+ # will generate a graph for groups, showing the direct groups dependencies
+ 
+ GROUP_GRAPHS           = YES
+ 
+ # If the UML_LOOK tag is set to YES doxygen will generate inheritance and 
+ # collaboration diagrams in a style similar to the OMG's Unified Modeling 
+ # Language.
+ 
+ UML_LOOK               = NO
+ 
+ # If set to YES, the inheritance and collaboration graphs will show the 
+ # relations between templates and their instances.
+ 
+ TEMPLATE_RELATIONS     = YES
+ 
+ # If the ENABLE_PREPROCESSING, SEARCH_INCLUDES, INCLUDE_GRAPH, and HAVE_DOT 
+ # tags are set to YES then doxygen will generate a graph for each documented 
+ # file showing the direct and indirect include dependencies of the file with 
+ # other documented files.
+ 
+ INCLUDE_GRAPH          = YES
+ 
+ # If the ENABLE_PREPROCESSING, SEARCH_INCLUDES, INCLUDED_BY_GRAPH, and 
+ # HAVE_DOT tags are set to YES then doxygen will generate a graph for each 
+ # documented header file showing the documented files that directly or 
+ # indirectly include this file.
+ 
+ INCLUDED_BY_GRAPH      = YES
+ 
+ # If the CALL_GRAPH and HAVE_DOT tags are set to YES then doxygen will 
+ # generate a call dependency graph for every global function or class method. 
+ # Note that enabling this option will significantly increase the time of a run. 
+ # So in most cases it will be better to enable call graphs for selected 
+ # functions only using the \callgraph command.
+ 
+ CALL_GRAPH             = NO
+ 
+ # If the GRAPHICAL_HIERARCHY and HAVE_DOT tags are set to YES then doxygen 
+ # will graphical hierarchy of all classes instead of a textual one.
+ 
+ GRAPHICAL_HIERARCHY    = YES
+ 
+ # If the DIRECTORY_GRAPH, SHOW_DIRECTORIES and HAVE_DOT tags are set to YES 
+ # then doxygen will show the dependencies a directory has on other directories 
+ # in a graphical way. The dependency relations are determined by the #include
+ # relations between the files in the directories.
+ 
+ DIRECTORY_GRAPH        = YES
+ 
+ # The DOT_IMAGE_FORMAT tag can be used to set the image format of the images 
+ # generated by dot. Possible values are png, jpg, or gif
+ # If left blank png will be used.
+ 
+ DOT_IMAGE_FORMAT       = png
+ 
+ # The tag DOT_PATH can be used to specify the path where the dot tool can be 
+ # found. If left blank, it is assumed the dot tool can be found in the path.
+ 
+ DOT_PATH               = @DOT@
+ 
+ # The DOTFILE_DIRS tag can be used to specify one or more directories that 
+ # contain dot files that are included in the documentation (see the 
+ # \dotfile command).
+ 
+ DOTFILE_DIRS           = 
+ 
+ # The MAX_DOT_GRAPH_WIDTH tag can be used to set the maximum allowed width 
+ # (in pixels) of the graphs generated by dot. If a graph becomes larger than 
+ # this value, doxygen will try to truncate the graph, so that it fits within 
+ # the specified constraint. Beware that most browsers cannot cope with very 
+ # large images.
+ 
+ MAX_DOT_GRAPH_WIDTH    = 1024
+ 
+ # The MAX_DOT_GRAPH_HEIGHT tag can be used to set the maximum allows height 
+ # (in pixels) of the graphs generated by dot. If a graph becomes larger than 
+ # this value, doxygen will try to truncate the graph, so that it fits within 
+ # the specified constraint. Beware that most browsers cannot cope with very 
+ # large images.
+ 
+ MAX_DOT_GRAPH_HEIGHT   = 1024
+ 
+ # The MAX_DOT_GRAPH_DEPTH tag can be used to set the maximum depth of the 
+ # graphs generated by dot. A depth value of 3 means that only nodes reachable 
+ # from the root by following a path via at most 3 edges will be shown. Nodes 
+ # that lay further from the root node will be omitted. Note that setting this 
+ # option to 1 or 2 may greatly reduce the computation time needed for large 
+ # code bases. Also note that a graph may be further truncated if the graph's 
+ # image dimensions are not sufficient to fit the graph (see MAX_DOT_GRAPH_WIDTH 
+ # and MAX_DOT_GRAPH_HEIGHT). If 0 is used for the depth value (the default), 
+ # the graph is not depth-constrained.
+ 
+ MAX_DOT_GRAPH_DEPTH    = 0
+ 
+ # Set the DOT_TRANSPARENT tag to YES to generate images with a transparent 
+ # background. This is disabled by default, which results in a white background. 
+ # Warning: Depending on the platform used, enabling this option may lead to 
+ # badly anti-aliased labels on the edges of a graph (i.e. they become hard to 
+ # read).
+ 
+ DOT_TRANSPARENT        = NO
+ 
+ # Set the DOT_MULTI_TARGETS tag to YES allow dot to generate multiple output 
+ # files in one run (i.e. multiple -o and -T options on the command line). This 
+ # makes dot run faster, but since only newer versions of dot (>1.8.10) 
+ # support this, this feature is disabled by default.
+ 
+ DOT_MULTI_TARGETS      = NO
+ 
+ # If the GENERATE_LEGEND tag is set to YES (the default) Doxygen will 
+ # generate a legend page explaining the meaning of the various boxes and 
+ # arrows in the dot generated graphs.
+ 
+ GENERATE_LEGEND        = YES
+ 
+ # If the DOT_CLEANUP tag is set to YES (the default) Doxygen will 
+ # remove the intermediate dot files that are used to generate 
+ # the various graphs.
+ 
+ DOT_CLEANUP            = YES
+ 
+ #---------------------------------------------------------------------------
+ # Configuration::additions related to the search engine   
+ #---------------------------------------------------------------------------
+ 
+ # The SEARCHENGINE tag specifies whether or not a search engine should be 
+ # used. If set to NO the values of all tags below this one will be ignored.
+ 
+ SEARCHENGINE           = NO

Index: llvm-www/releases/1.8/docs/doxygen.css
diff -c /dev/null llvm-www/releases/1.8/docs/doxygen.css:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/doxygen.css	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,378 ----
+ BODY,H1,H2,H3,H4,H5,H6,P,CENTER,TD,TH,UL,DL,DIV {
+ 	font-family: Verdana,Geneva,Arial,Helvetica,sans-serif;
+ }
+ BODY,TD {
+  font-size: 90%;
+ }
+ H1 {
+  text-align: center;
+  font-size: 140%;
+  font-weight: bold;
+ }
+ H2 {
+  font-size: 120%;
+  font-style: italic;
+ }
+ H3 {
+  font-size: 100%;
+ }
+ CAPTION { font-weight: bold }
+ DIV.qindex {
+ 	width: 100%;
+ 	background-color: #eeeeff;
+ 	border: 1px solid #b0b0b0;
+ 	text-align: center;
+ 	margin: 2px;
+ 	padding: 2px;
+ 	line-height: 140%;
+ }
+ DIV.nav {
+ 	width: 100%;
+ 	background-color: #eeeeff;
+ 	border: 1px solid #b0b0b0;
+ 	text-align: center;
+ 	margin: 2px;
+ 	padding: 2px;
+ 	line-height: 140%;
+ }
+ DIV.navtab {
+        background-color: #eeeeff;
+        border: 1px solid #b0b0b0;
+        text-align: center;
+        margin: 2px;
+        margin-right: 15px;
+        padding: 2px;
+ }
+ TD.navtab {
+        font-size: 70%;
+ }
+ A.qindex {
+        text-decoration: none;
+        font-weight: bold;
+        color: #1A419D;
+ }
+ A.qindex:visited {
+        text-decoration: none;
+        font-weight: bold;
+        color: #1A419D
+ }
+ A.qindex:hover {
+ 	text-decoration: none;
+ 	background-color: #ddddff;
+ }
+ A.qindexHL {
+ 	text-decoration: none;
+ 	font-weight: bold;
+ 	background-color: #6666cc;
+ 	color: #ffffff;
+ 	border: 1px double #9295C2;
+ }
+ A.qindexHL:hover {
+ 	text-decoration: none;
+ 	background-color: #6666cc;
+ 	color: #ffffff;
+ }
+ A.qindexHL:visited { 
+  text-decoration: none; background-color: #6666cc; color: #ffffff }
+ A.el { text-decoration: none; font-weight: bold }
+ A.elRef { font-weight: bold }
+ A.code:link { text-decoration: none; font-weight: normal; color: #0000FF}
+ A.code:visited { text-decoration: none; font-weight: normal; color: #0000FF}
+ A.codeRef:link { font-weight: normal; color: #0000FF}
+ A.codeRef:visited { font-weight: normal; color: #0000FF}
+ A:hover { text-decoration: none; background-color: #f2f2ff }
+ DL.el { margin-left: -1cm }
+ .fragment {
+        font-family: Fixed, monospace;
+        font-size: 95%;
+ }
+ PRE.fragment {
+ 	border: 1px solid #CCCCCC;
+ 	background-color: #f5f5f5;
+ 	margin-top: 4px;
+ 	margin-bottom: 4px;
+ 	margin-left: 2px;
+ 	margin-right: 8px;
+ 	padding-left: 6px;
+ 	padding-right: 6px;
+ 	padding-top: 4px;
+ 	padding-bottom: 4px;
+ }
+ DIV.ah { background-color: black; font-weight: bold; color: #ffffff; margin-bottom: 3px; margin-top: 3px }
+ TD.md { background-color: #F4F4FB; font-weight: bold; }
+ TD.mdPrefix {
+        background-color: #F4F4FB;
+        color: #606060;
+ 	font-size: 80%;
+ }
+ TD.mdname1 { background-color: #F4F4FB; font-weight: bold; color: #602020; }
+ TD.mdname { background-color: #F4F4FB; font-weight: bold; color: #602020; width: 600px; }
+ DIV.groupHeader {
+        margin-left: 16px;
+        margin-top: 12px;
+        margin-bottom: 6px;
+        font-weight: bold;
+ }
+ DIV.groupText { margin-left: 16px; font-style: italic; font-size: 90% }
+ BODY {
+ 	background: white;
+ 	color: black;
+ 	margin-right: 20px;
+ 	margin-left: 20px;
+ }
+ TD.indexkey {
+ 	background-color: #eeeeff;
+ 	font-weight: bold;
+ 	padding-right  : 10px;
+ 	padding-top    : 2px;
+ 	padding-left   : 10px;
+ 	padding-bottom : 2px;
+ 	margin-left    : 0px;
+ 	margin-right   : 0px;
+ 	margin-top     : 2px;
+ 	margin-bottom  : 2px;
+ 	border: 1px solid #CCCCCC;
+ }
+ TD.indexvalue {
+ 	background-color: #eeeeff;
+ 	font-style: italic;
+ 	padding-right  : 10px;
+ 	padding-top    : 2px;
+ 	padding-left   : 10px;
+ 	padding-bottom : 2px;
+ 	margin-left    : 0px;
+ 	margin-right   : 0px;
+ 	margin-top     : 2px;
+ 	margin-bottom  : 2px;
+ 	border: 1px solid #CCCCCC;
+ }
+ TR.memlist {
+    background-color: #f0f0f0; 
+ }
+ P.formulaDsp { text-align: center; }
+ IMG.formulaDsp { }
+ IMG.formulaInl { vertical-align: middle; }
+ SPAN.keyword       { color: #008000 }
+ SPAN.keywordtype   { color: #604020 }
+ SPAN.keywordflow   { color: #e08000 }
+ SPAN.comment       { color: #800000 }
+ SPAN.preprocessor  { color: #806020 }
+ SPAN.stringliteral { color: #002080 }
+ SPAN.charliteral   { color: #008080 }
+ .mdTable {
+ 	border: 1px solid #868686;
+ 	background-color: #F4F4FB;
+ }
+ .mdRow {
+ 	padding: 8px 10px;
+ }
+ .mdescLeft {
+        padding: 0px 8px 4px 8px;
+ 	font-size: 80%;
+ 	font-style: italic;
+ 	background-color: #FAFAFA;
+ 	border-top: 1px none #E0E0E0;
+ 	border-right: 1px none #E0E0E0;
+ 	border-bottom: 1px none #E0E0E0;
+ 	border-left: 1px none #E0E0E0;
+ 	margin: 0px;
+ }
+ .mdescRight {
+        padding: 0px 8px 4px 8px;
+ 	font-size: 80%;
+ 	font-style: italic;
+ 	background-color: #FAFAFA;
+ 	border-top: 1px none #E0E0E0;
+ 	border-right: 1px none #E0E0E0;
+ 	border-bottom: 1px none #E0E0E0;
+ 	border-left: 1px none #E0E0E0;
+ 	margin: 0px;
+ }
+ .memItemLeft {
+ 	padding: 1px 0px 0px 8px;
+ 	margin: 4px;
+ 	border-top-width: 1px;
+ 	border-right-width: 1px;
+ 	border-bottom-width: 1px;
+ 	border-left-width: 1px;
+ 	border-top-color: #E0E0E0;
+ 	border-right-color: #E0E0E0;
+ 	border-bottom-color: #E0E0E0;
+ 	border-left-color: #E0E0E0;
+ 	border-top-style: solid;
+ 	border-right-style: none;
+ 	border-bottom-style: none;
+ 	border-left-style: none;
+ 	background-color: #FAFAFA;
+ 	font-size: 80%;
+ }
+ .memItemRight {
+ 	padding: 1px 8px 0px 8px;
+ 	margin: 4px;
+ 	border-top-width: 1px;
+ 	border-right-width: 1px;
+ 	border-bottom-width: 1px;
+ 	border-left-width: 1px;
+ 	border-top-color: #E0E0E0;
+ 	border-right-color: #E0E0E0;
+ 	border-bottom-color: #E0E0E0;
+ 	border-left-color: #E0E0E0;
+ 	border-top-style: solid;
+ 	border-right-style: none;
+ 	border-bottom-style: none;
+ 	border-left-style: none;
+ 	background-color: #FAFAFA;
+ 	font-size: 80%;
+ }
+ .memTemplItemLeft {
+ 	padding: 1px 0px 0px 8px;
+ 	margin: 4px;
+ 	border-top-width: 1px;
+ 	border-right-width: 1px;
+ 	border-bottom-width: 1px;
+ 	border-left-width: 1px;
+ 	border-top-color: #E0E0E0;
+ 	border-right-color: #E0E0E0;
+ 	border-bottom-color: #E0E0E0;
+ 	border-left-color: #E0E0E0;
+ 	border-top-style: none;
+ 	border-right-style: none;
+ 	border-bottom-style: none;
+ 	border-left-style: none;
+ 	background-color: #FAFAFA;
+ 	font-size: 80%;
+ }
+ .memTemplItemRight {
+ 	padding: 1px 8px 0px 8px;
+ 	margin: 4px;
+ 	border-top-width: 1px;
+ 	border-right-width: 1px;
+ 	border-bottom-width: 1px;
+ 	border-left-width: 1px;
+ 	border-top-color: #E0E0E0;
+ 	border-right-color: #E0E0E0;
+ 	border-bottom-color: #E0E0E0;
+ 	border-left-color: #E0E0E0;
+ 	border-top-style: none;
+ 	border-right-style: none;
+ 	border-bottom-style: none;
+ 	border-left-style: none;
+ 	background-color: #FAFAFA;
+ 	font-size: 80%;
+ }
+ .memTemplParams {
+ 	padding: 1px 0px 0px 8px;
+ 	margin: 4px;
+ 	border-top-width: 1px;
+ 	border-right-width: 1px;
+ 	border-bottom-width: 1px;
+ 	border-left-width: 1px;
+ 	border-top-color: #E0E0E0;
+ 	border-right-color: #E0E0E0;
+ 	border-bottom-color: #E0E0E0;
+ 	border-left-color: #E0E0E0;
+ 	border-top-style: solid;
+ 	border-right-style: none;
+ 	border-bottom-style: none;
+ 	border-left-style: none;
+        color: #606060;
+ 	background-color: #FAFAFA;
+ 	font-size: 80%;
+ }
+ .search     { color: #003399;
+               font-weight: bold;
+ }
+ FORM.search {
+               margin-bottom: 0px;
+               margin-top: 0px;
+ }
+ INPUT.search { font-size: 75%;
+                color: #000080;
+                font-weight: normal;
+                background-color: #eeeeff;
+ }
+ TD.tiny      { font-size: 75%;
+ }
+ a {
+ 	color: #252E78;
+ }
+ a:visited {
+ 	color: #3D2185;
+ }
+ .dirtab { padding: 4px;
+           border-collapse: collapse;
+           border: 1px solid #b0b0b0;
+ }
+ TH.dirtab { background: #eeeeff;
+             font-weight: bold;
+ }
+ HR { height: 1px;
+      border: none;
+      border-top: 1px solid black;
+ }
+ 
+ /* 
+  * LLVM Modifications.
+  * Note: Everything above here is generated with "doxygen -w htlm" command. See
+  * "doxygen --help" for details. What follows are CSS overrides for LLVM 
+  * specific formatting. We want to keep the above so it can be replaced with
+  * subsequent doxygen upgrades.
+  */
+ 
+ .footer {
+         font-size: 80%;
+         font-weight: bold;
+         text-align: center;
+         vertical-align: middle;
+ }
+ .title {
+   font-size: 25pt; 
+   color: black; background: url("../img/lines.gif");
+   font-weight: bold;
+   border-width: 1px;
+   border-style: solid none solid none;
+   text-align: center;
+   vertical-align: middle;
+   padding-left: 8pt;
+   padding-top: 1px;
+   padding-bottom: 2px
+ }
+ A:link {
+         cursor: pointer;
+         text-decoration: none;
+         font-weight: bolder;
+ }
+ A:visited {
+         cursor: pointer;
+         text-decoration: underline;
+         font-weight: bolder;
+ }
+ A:hover {
+         cursor: pointer;
+         text-decoration: underline;
+         font-weight: bolder;
+ }
+ A:active {
+         cursor: pointer;
+         text-decoration: underline;
+         font-weight: bolder;
+         font-style: italic;
+ }
+ H1 {
+  text-align: center;
+  font-size: 140%;
+  font-weight: bold;
+ }
+ H2 {
+  font-size: 120%;
+  font-style: italic;
+ }
+ H3 {
+  font-size: 100%;
+ }
+ A.qindex {}
+ A.qindexRef {}
+ A.el { text-decoration: none; font-weight: bold }
+ A.elRef { font-weight: bold }
+ A.code { text-decoration: none; font-weight: normal; color: #4444ee }
+ A.codeRef { font-weight: normal; color: #4444ee }

Index: llvm-www/releases/1.8/docs/doxygen.footer
diff -c /dev/null llvm-www/releases/1.8/docs/doxygen.footer:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/doxygen.footer	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,9 ----
+ <hr>
+ <p class="footer">
+ Generated on $datetime for <a href="http://llvm.org">$projectname</a> by
+ <a href="http://www.doxygen.org"><img src="doxygen.png" alt="Doxygen"
+ align="middle" border="0"/>$doxygenversion</a><br/>
+ Copyright © 2003,2004,2005,2006 University of Illinois at Urbana-Champaign.
+ All Rights Reserved.</p>
+ </body>
+ </html>

Index: llvm-www/releases/1.8/docs/doxygen.header
diff -c /dev/null llvm-www/releases/1.8/docs/doxygen.header:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/doxygen.header	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,9 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
+ <html><head>
+ <meta http-equiv="Content-Type" content="text/html;charset=iso-8859-1"/>
+ <meta name="keywords" content="LLVM,Low Level Virtual Machine,C++,doxygen,API,documentation"/>
+ <meta name="description" content="C++ source code API documentation for the Low Level Virtual Machine (LLVM)."/>
+ <title>LLVM: $title</title>
+ <link href="doxygen.css" rel="stylesheet" type="text/css"/>
+ </head><body>
+ <p class="title">LLVM API Documentation</p>

Index: llvm-www/releases/1.8/docs/doxygen.intro
diff -c /dev/null llvm-www/releases/1.8/docs/doxygen.intro:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/doxygen.intro	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,18 ----
+ /// @mainpage Low Level Virtual Machine
+ ///
+ /// @section main_intro Introduction
+ /// Welcome to the Low Level Virtual Machine (LLVM).
+ ///
+ /// This documentation describes the @b internal software that makes 
+ /// up LLVM, not the @b external use of  LLVM. There are no instructions
+ /// here on how to use LLVM, only the APIs that make up the software. For usage 
+ /// instructions, please see the programmer's guide or reference manual.
+ ///
+ /// @section main_caveat Caveat 
+ /// This documentation is generated directly from the source code with doxygen. 
+ /// Since LLVM is constantly under active development, what you're about to
+ /// read is out of date! However, it may still be useful since certain portions
+ /// of LLVM are very stable. 
+ ///
+ /// @section main_changelog Change Log
+ /// - Original content written 12/30/2003 by Reid Spencer

Index: llvm-www/releases/1.8/docs/index.html
diff -c /dev/null llvm-www/releases/1.8/docs/index.html:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/index.html	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,265 ----
+ <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"
+                       "http://www.w3.org/TR/html4/strict.dtd">
+ <html>
+ <head>
+   <title>Documentation for the LLVM System</title>
+   <link rel="stylesheet" href="llvm.css" type="text/css">
+ </head>
+ <body>
+ 
+ <div class="doc_title">Documentation for the LLVM System</div>
+ 
+ <div class="doc_text">
+ <table class="layout" width="95%"><tr class="layout"><td class="left">
+ <ul>
+   <li><a href="#llvmdesign">LLVM Design</a></li>
+   <li><a href="#llvmpubs">LLVM Publications</a></li>
+   <li><a href="#userguide">LLVM User Guides</a></li>
+   <li><a href="#llvmprog">General LLVM Programming Documentation</a></li>
+   <li><a href="#subsystems">LLVM Subsystem Documentation</a></li>
+   <li><a href="#maillist">LLVM Mailing Lists</a></li>
+ </ul>
+ </td><td class="right">
+   <form action="http://www.google.com/search" method=get>
+       <input type="hidden" name="sitesearch" value="llvm.org/docs">
+       <input type=text name=q size=25><br>
+       <input type=submit value="Search the LLVM Docs" name="submit">
+   </form>
+ </td></tr></table>
+ </div>
+ 
+ <div class="doc_author">    
+   <p>Written by <a href="http://llvm.org">The LLVM Team</a></p>
+ </div>
+ 
+ <!--=======================================================================-->
+ <div class="doc_section"><a name="llvmdesign">LLVM Design & Overview</a></div>
+ <!--=======================================================================-->
+ 
+ <ul>
+ <li><a href="LangRef.html">LLVM Language Reference Manual</a> - Defines the LLVM
+ intermediate representation.</li>
+ 
+ <li><a href="http://llvm.org/pubs/2006-04-25-GelatoLLVMIntro.html">Introduction to the LLVM Compiler Infrastructure</a> - Presentation describing LLVM.</li>
+ <li><a href="http://llvm.org/pubs/2004-09-22-LCPCLLVMTutorial.html">The LLVM Compiler Framework and
+ Infrastructure Tutorial</a> - Tutorial for writing passes, exploring the system.</li>
+ <li><a href="http://llvm.org/pubs/2004-01-30-CGO-LLVM.html">LLVM: A Compilation Framework for
+ Lifelong Program Analysis & Transformation</a> - Design overview.</li>
+ <li><a href="http://llvm.org/pubs/2002-12-LattnerMSThesis.html">LLVM: An Infrastructure for
+ Multi-Stage Optimization</a> - More details (somewhat old now).</li>
+ 
+ </ul>
+ 
+ <!--=======================================================================-->
+ <div class="doc_section"><a name="userguide">LLVM User Guides</a></div>
+ <!--=======================================================================-->
+ 
+ <ul>
+ <li><a href="GettingStarted.html">The LLVM Getting Started Guide</a> -
+ Discusses how to get up and running quickly with the LLVM infrastructure.
+ Everything from unpacking and compilation of the distribution to execution of
+ some tools.</li>
+ 
+ <li><a href="GettingStartedVS.html">Getting Started with the LLVM System using
+ Microsoft Visual Studio</a> - An addendum to the main Getting Started guide for
+ those using Visual Studio on Windows.</li>
+ 
+ <li><a href="CommandGuide/index.html">LLVM Command Guide</a> - A reference
+ manual for the LLVM command line utilities ("man" pages for LLVM tools).<br/>
+ Current tools:
+  <a href="CommandGuide/html/llvm-ar.html">llvm-ar</a>,
+  <a href="CommandGuide/html/llvm-ranlib.html">llvm-ranlib</a>,
+  <a href="CommandGuide/html/llvm-as.html">llvm-as</a>,
+  <a href="CommandGuide/html/llvm-dis.html">llvm-dis</a>,
+  <a href="CommandGuide/html/opt.html">opt</a>,
+  <a href="CommandGuide/html/llc.html">llc</a>,
+  <a href="CommandGuide/html/lli.html">lli</a>,
+  <a href="CommandGuide/html/llvm-link.html">llvm-link</a>,
+  <a href="CommandGuide/html/analyze.html">analyze</a>,
+  <a href="CommandGuide/html/llvm-nm.html">llvm-nm</a>,
+  <a href="CommandGuide/html/llvm-prof.html">llvm-prof</a>,
+  <a href="CommandGuide/html/llvmgcc.html">llvmgcc</a>,
+  <a href="CommandGuide/html/llvmgxx.html">llvmgxx</a>,
+  <a href="CommandGuide/html/gccas.html">gccas</a>,
+  <a href="CommandGuide/html/gccld.html">gccld</a>,
+  <a href="CommandGuide/html/stkrc.html">stkrc</a>,
+  <a href="CommandGuide/html/bugpoint.html">bugpoint</a>,
+  <a href="CommandGuide/html/llvm-extract.html">llvm-extract</a>,
+  <a href="CommandGuide/html/llvm-bcanalyzer.html">llvm-bcanalyzer</a>,
+  <a href="CommandGuide/html/llvmc.html">llvmc</a>
+ </li>
+ 
+ <li><a href="FAQ.html">Frequently Asked Questions</a> - A list of common
+ questions and problems and their solutions.</li>
+ 
+ <li><a href="ReleaseNotes.html">Release notes for the current release</a> 
+ - This describes new features, known bugs, and other limitations.</li>
+ 
+ <li><a href="HowToSubmitABug.html">How to Submit A Bug Report</a> -
+ Instructions for properly submitting information about any bugs you run into in
+ the LLVM system.</li>
+ 
+ <li><a href="TestingGuide.html">LLVM Test Suite Guide</a> - A reference
+ manual for using the LLVM test suite.</li>
+ 
+ <li><a href="CFEBuildInstrs.html">How to build the C/C++ front-end</a> -
+ Instructions for building the front-end from source.</li>
+ 
+ <li><a href="Lexicon.html">The LLVM Lexicon</a> - Definition of acronyms, terms
+ and concepts used in LLVM.</li>
+ 
+ <li><a name="irc">You can probably find help on the unofficial LLVM IRC 
+ channel</a>.  We often are on irc.oftc.net in the #llvm channel.  If you are 
+ using the mozilla browser, and have chatzilla installed, you can <a 
+ href="irc://irc.oftc.net/llvm">join #llvm on irc.oftc.net</a> directly.</li>
+ 
+ </ul>
+ 
+ 
+ <!--=======================================================================-->
+ <div class="doc_section"><a name="llvmprog">General LLVM Programming Documentation</a></div>
+ <!--=======================================================================-->
+ 
+ <ul>
+ <li><a href="LangRef.html">LLVM Language Reference Manual</a> - Defines the LLVM
+ intermediate representation and the assembly form of the different nodes.</li>
+ 
+ <li><a href="ProgrammersManual.html">The LLVM Programmers Manual</a> -
+ Introduction to the general layout of the LLVM sourcebase, important classes
+ and APIs, and some tips & tricks.</li>
+ 
+ <li><a href="Projects.html">LLVM Project Guide</a> - How-to guide and
+ templates for new projects that <em>use</em> the LLVM infrastructure.  The
+ templates (directory organization, Makefiles, and test tree) allow the project
+ code to be located outside (or inside) the <tt>llvm/</tt> tree, while using LLVM
+ header files and libraries.</li>
+ 
+ <li><a href="MakefileGuide.html">LLVM Makefile Guide</a> - Describes how the
+ LLVM makefiles work and how to use them.</li>
+ 
+ <li><a href="CommandLine.html">CommandLine library Reference Manual</a> -
+ Provides information on using the command line parsing library.</li>
+ 
+ <li><a href="CodingStandards.html">LLVM Coding standards</a> -
+ Details the LLVM coding standards and provides useful information on writing
+ efficient C++ code.</li>
+ 
+ <li><a href="ExtendingLLVM.html">Extending LLVM</a> - Look here to see how 
+ to add instructions and intrinsics to LLVM.</li>
+ 
+ <li><a href="UsingLibraries.html">Using LLVM Libraries</a> - Look here to
+ understand how to use the libraries produced when LLVM is compiled.</li>
+ 
+ <li><a href="HowToReleaseLLVM.html">How To Release LLVM To The Public</a> - This
+ is a guide to preparing LLVM releases. Most developers can ignore it.</li>
+ 
+ <li><a href="http://llvm.org/doxygen/">Doxygen generated 
+ documentation</a> (<a
+ href="http://llvm.org/doxygen/inherits.html">classes</a>)
+ 
+ (<a href="http://llvm.org/doxygen/doxygen.tar.gz">tarball</a>)
+ </li>
+ 
+ <li><a href="http://llvm.org/cvsweb/cvsweb.cgi/llvm">CVSWeb CVS Tree 
+ Browser</a></li>
+ 
+ </ul>
+ 
+ <!--=======================================================================-->
+ <div class="doc_section"><a name="subsystems">LLVM Subsystem Documentation</a></div>
+ <!--=======================================================================-->
+ 
+ <ul>
+ 
+ <li><a href="WritingAnLLVMPass.html">Writing an LLVM Pass</a> - Information
+ on how to write LLVM transformations and analyses.</li>
+ 
+ <li><a href="WritingAnLLVMBackend.html">Writing an LLVM Backend</a> - Information
+ on how to write LLVM backends for machine targets.</li>
+ 
+ <li><a href="CodeGenerator.html">The LLVM Target-Independent Code
+ Generator</a> - The design and implementation of the LLVM code generator.
+ Useful if you are working on retargetting LLVM to a new architecture, designing
+ a new codegen pass, or enhancing existing components.</li>
+ 
+ <li><a href="TableGenFundamentals.html">TableGen Fundamentals</a> -
+ Describes the TableGen tool, which is used heavily by the LLVM code
+ generator.</li>
+ 
+ <li><a href="AliasAnalysis.html">Alias Analysis in LLVM</a> - Information
+ on how to write a new alias analysis implementation or how to use existing
+ analyses.</li>
+ 
+ <li><a href="Stacker.html">The Stacker Chronicles</a> - This document
+ describes both the Stacker language and LLVM frontend, but also some details
+ about LLVM useful for those writing front-ends.</li>
+ 
+ <li><a href="GarbageCollection.html">Accurate Garbage Collection with
+ LLVM</a> - The interfaces source-language compilers should use for compiling
+ GC'd programs.</li>
+ 
+ <li><a href="SourceLevelDebugging.html">Source Level Debugging with
+ LLVM</a> - This document describes the design and philosophy behind the LLVM
+ source-level debugger.</li>
+ 
+ <li><a href="Bugpoint.html">Bugpoint</a> - automatic bug finder and test-case
+ reducer description and usage information.</li>
+ 
+ <li><a href="CompilerDriver.html">Compiler Driver (llvmc)</a> - This document
+ describes the design and configuration of the LLVM compiler driver tool,
+ <tt>llvmc</tt>.</li>
+ 
+ <li><a href="BytecodeFormat.html">LLVM Bytecode File Format</a></li>
+ 
+ <li><a href="SystemLibrary.html">System Library</a> - This document describes
+ the LLVM System Library (<tt>lib/System</tt>) and how to keep LLVM source code
+ portable</li>
+ 
+ </ul>
+ 
+ 
+ <!--=======================================================================-->
+ <div class="doc_section"><a name="maillist">LLVM Mailing Lists</a></div>
+ <!--=======================================================================-->
+ 
+ <ul>
+ <li>The <a href="http://mail.cs.uiuc.edu/mailman/listinfo/llvm-announce">
+ LLVM Announcements List</a>: This is a low volume list that provides important 
+ announcements regarding LLVM.  It gets email about once a month.</li>
+ 
+ <li>The <a href="http://mail.cs.uiuc.edu/mailman/listinfo/llvmdev">Developer's
+ List</a>: This list is for people who want to be included in technical 
+ discussions of LLVM. People post to this list when they have questions about 
+ writing code for or using the LLVM tools. It is relatively low volume.</li>
+ 
+ <li>The <a href="http://mail.cs.uiuc.edu/pipermail/llvmbugs/">Bugs &
+ Patches Archive</a>: This list gets emailed every time a bug is opened and
+ closed, and when people submit patches to be included in LLVM.  It is higher 
+ volume than the LLVMdev list.</li>
+ 
+ <li>The <a href="http://mail.cs.uiuc.edu/pipermail/llvm-commits/">CVS Commits
+ Archive</a>: This list contains all commit messages that are made when LLVM 
+ developers commit code changes to the CVS archive. It is useful for those who 
+ want to stay on the bleeding edge of LLVM development. This list is very high
+ volume.</li>
+ 
+ <li>The <a href="http://mail.cs.uiuc.edu/pipermail/llvm-testresults/">
+ Test Results Archive</a>: A message is automatically sent to this list by every
+ active nightly tester when it completes.  As such, this list gets email several
+ times each day, making it a high volume list.</li>
+ 
+ </ul>
+ 
+ <!-- *********************************************************************** -->
+ 
+ <hr>
+ <address>
+   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img
+   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a>
+   <a href="http://validator.w3.org/check/referer"><img
+   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a>
+ 
+   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br>
+   Last modified: $Date: 2006/08/09 05:56:40 $
+ </address>
+ 

Index: llvm-www/releases/1.8/docs/llvm.css
diff -c /dev/null llvm-www/releases/1.8/docs/llvm.css:1.1
*** /dev/null	Wed Aug  9 00:56:53 2006
--- llvm-www/releases/1.8/docs/llvm.css	Wed Aug  9 00:56:40 2006
***************
*** 0 ****
--- 1,84 ----
+ /*
+  * LLVM documentation style sheet
+  */
+ 
+ /* Common styles */
+ .body { color: black; background: white; margin: 0 0 0 0 }
+ 
+ /* No borders on image links */
+ a:link img, a:visited img {border-style: none}
+ 
+ address img { float: right; width: 88px; height: 31px; }
+ address     { clear: right; }
+ 
+ TR, TD      { border: 2px solid gray; padding: 4pt 4pt 2pt 2pt; }
+ TH          { border: 2px solid gray; font-weight: bold; font-size: 105%; 
+               color: black; background: url("img/lines.gif");
+               font-family: "Georgia,Palatino,Times,Roman,SanSerif"; text-align:center;
+               vertical-align: middle; }
+ TABLE       { text-align: center; border: 2px solid black; 
+               border-collapse: collapse; margin-top: 1em; margin-left: 1em; 
+               margin-right: 1em; margin-bottom: 1em; }
+ /* 
+  * Documentation 
+  */
+ /* Common for title and header */
+ .doc_title, .doc_section, .doc_subsection { 
+   color: black; background: url("img/lines.gif");
+   font-family: "Georgia,Palatino,Times,Roman,SanSerif"; font-weight: bold;
+   border-width: 1px;
+   border-style: solid none solid none;
+   text-align: center;
+   vertical-align: middle;
+   padding-left: 8pt;
+   padding-top: 1px;
+   padding-bottom: 2px
+ }
+ 
+ .doc_title      { text-align: left;   font-size: 25pt }
+ .doc_section    { text-align: center; font-size: 22pt;
+                   margin: 20pt 0pt 5pt 0pt; }
+ .doc_subsection { width: 75%;
+                   text-align: left;  font-size: 12pt; padding: 4pt 4pt 4pt 4pt;
+                   margin: 1.5em 0.5em 0.5em 0.5em }
+ 
+ .doc_subsubsection { margin: 2.0em 0.5em 0.5em 0.5em;
+                      font-weight: bold; font-style: oblique;
+                      border-bottom: 1px solid #999999; font-size: 12pt;
+                      width: 75%; }
+ .doc_author     { text-align: left; font-weight: bold; padding-left: 20pt }
+ .doc_text       { text-align: left; padding-left: 20pt; padding-right: 10pt }
+ 
+ .doc_footer     { text-align: left; padding: 0 0 0 0 }
+ 
+ .doc_hilite     { color: blue; font-weight: bold; }
+ 
+ .doc_table      { text-align: center; width: 90%; 
+                   padding: 1px 1px 1px 1px; border: 1px; }
+ 
+ .doc_table_nw   { text-align: center; border: 1px; 
+     		  padding: 1px 1px 1px 1px; }
+ 
+ .doc_warning    { color: red; font-weight: bold }
+ 
+ .doc_code       { border: solid 1px gray; background: #eeeeee;
+                   margin: 0 1em 0 1em; 
+                   padding: 0 1em 0 1em;
+                   display:table;
+                 }
+ .doc_notes      { background: #fafafa; border: 1px solid #cecece; padding: 0.1em }
+ 
+ TABLE.layout    { text-align: left; border: none; border-collapse: collapse;
+                   padding: 4px 4px 4px 4px; }
+ TR.layout       { border: none; padding: 4pt 4pt 2pt 2pt; }
+ TD.layout       { border: none; padding: 4pt 4pt 2pt 2pt; 
+                   vertical-align: top;}
+ TD.left         { border: none; padding: 4pt 4pt 2pt 2pt; text-align: left; 
+                   vertical-align: top;}
+ TD.right        { border: none; padding: 4pt 4pt 2pt 2pt; text-align: right; 
+                   vertical-align: top;}
+ TH.layout       { border: none; font-weight: bold; font-size: 105%; 
+                   text-align:center; vertical-align: middle; }
+ 
+ /* Left align table cell */
+ .td_left        { border: 2px solid gray; text-align: left; }