[llvm-announce] LLVM 1.2 Release & Status update

Fri Mar 19 21:00:21 PST 2004

News flash: LLVM 1.2 is now available!
--------------------------------------

LLVM 1.2 is the result of ~3 months of hard work by many people in the
LLVM community.  It contains a bunch of new features, produces
substantially better code, and has many bug fixes over the 1.1 release.
A detailed list of new and improved features are included in the 1.2
release notes:
http://llvm.cs.uiuc.edu/releases/1.2/docs/ReleaseNotes.html#whatsnew

If you are using LLVM 1.0 or 1.1, we *highly* recommend upgrading to LLVM
1.2.  You can download LLVM from the releases page:
http://llvm.cs.uiuc.edu/releases/

March Status Update
-------------------

Since the February status update, LLVM received substantial code quality
improvements, compiler performance improvements, and even has some new
features.  All of these changes are in the new 1.2 release:

High-level changes:

1. The LLVM C-backend has been moved from the llvm-dis tool to the llc
   tool.  Instead of using 'llvm-dis -c x.bc -o x.cbe.c', now you should
   use 'llc -march=c x.bc -o x.cbe.c'.  This allows the C backend to share
   a bunch of code with the other code generators.
2. Brian fixed our autoconf configuration to not allow the definitions
   from our config.h file to leak into external client's code (PR180).
3. Brian wrote 'make install' and simple RPM generation targets.
4. 81 PRs have been fixed with a 1.2 milestone (these include code
   quality, code cleanup, quality of implementation improvements)

VMCore changes:

5. LLVM now handles large zero-initialized global variables much more
   efficiently than before (used by SPECINT95 among other programs).
6. LLVM has initial support for a new "select" instruction, which is
   useful for flattening control structures and recognizing idioms like
   min/max and saturated arithmetic.  Currently only the C backend
   supports it, though full support will be available in LLVM 1.3.  Here
   are the docs: http://llvm.cs.uiuc.edu/docs/LangRef.html#i_select
7. Misha added support to the pass manager for adding 'transitive'
   relationships between passes.
8. llvm.memset, llvm.memcpy, and llvm.memmove intrinsics were added to
   LLVM.  These allow code generators to produce fast inline code for
   these functions and for the optimizer to understand them.
9. LLVM now includes llvm.returnaddress & llvm.frameaddress intrinsics.
   These intrinsics are supported by the X86 and C backends and are
   produced by the C front-end (from the __builtin_return/frame_address()
   GCC extensions).

The new intrinsics are documented here:
  http://llvm.cs.uiuc.edu/docs/LangRef.html#intrinsics

Optimizer changes:

10. The LLVM profiler can now gather edge profiles and provide them to
    profile-guided optimization passes.
11. LLVM now includes a simple profile-guided basic block layout pass.
12. The LLVM optimizer can now transform code like "if (X == 0 || X ==
    12 || X == .." into a big switch statement.
13. The LLVM optimizer can now mark global variables 'constant' if they
    are obviously never written to, enabling many subsequent
    optimizations.
14. LLVM has a new aggressive interprocedural optimization (-argpromotion)
    that turns small "by reference" arguments into by-value arguments.
    This is particularly useful for C++ programs, but does speed up some C
    programs a lot too.
15. LLVM can now hoist loop-invariant calls out of loops and eliminate
    redundant expressions involving calls (if they have no side effects).
    This is particularly useful for functions like sin/cos/strlen/...
16. The "basicaa" alias analysis pass is more aggressive when considering
    what local stack objects can be modified by a function call.

Code generator changes:

17. The LLVM native code generators now support exception handling
    constructs correctly (which include setjmp/longjmp).
18. The linear scan register allocator that Alkis wrote works much better
    than it did in 1.1.  There is one outstanding known problem (PR264),
    but it otherwise seems to work well, and generate much faster code
    than the local allocator (2x speedups are not uncommon, and it beats
    the C backend+GCC in some cases).  We would really appreciate testing
    of this allocator (use -regalloc=linearscan on the lli or llc command
    lines), as we hope to make it the default for 1.3.
19. Alkis and Brian have been working hard on the code generator, cleaning
    it up and simplifying it.
20. The target-independent code generator is now substantially faster than
    it used to be and uses less memory.
21. Alkis added support in the target-independent code generator to fold
    spill code into instructions on targets that support it (e.g. X86).
22. The X86 code generator now generates substantially better code than it
    used to and can take advantage of its powerful addressing modes.
23. Alkis wrote a generic "spiller" interface that can be used by
    arbitrary global register allocators.
24. The X86 code generator compiles memcpy and memset into rep movs/stos
    instructions.
25. Brian made the -print-machineinstrs debugging option common across all
    of the code generators.
26. Alkis converted the MachineBasicBlock class from "vector of pointers
    to instructions" to "doubly-linked ilist of instructions", making the
    code generator faster and easier to program.

C/C++ front-end changes:

27. The C front-end generates much more efficient code when expanding
    large aggregate value initializations and copies.
28. The C front-end now implements the GCC "labels as values" extension,
    which is often used to implement "threaded interpreters".
29. Several miscellaneous bugfixes and code quality improvements

Miscellaneous changes:

30. Reid Spencer contributed a patch that allowed us to remove our one
    annoying dependency on the boost library.
31. John added several new programs to the test suite.
32. Misha wrote a new generic 'code extractor' which can be used to rip a
    region of code out of one function into another, without breaking the
    program.
33. Bugpoint has several important improvements:
   1. It now has a code generator crash debugging mode.
   2. It is now faster, especially when debugging crashes.
   3. Bugpoint can now try to narrow down miscompilations to the loop nest
      being miscompiled, instead of just down to a function.  This is made
      possible by Misha's code extractor.
34. Alkis tweaked the problem sizes for many of the benchmarks in the
    testsuite, allowing us to get meaningful performance numbers from
    them.
35. The .ll file parser is substantially faster in some nasty cases.
36. Many minor bugs are dead and miscellaneous improvements have been
    made.

Also, as with the 1.0 and 1.1 releases, John Criswell was the one
primarily responsible for pulling the release together and ensuring that
it worked well.  Many, many, thanks go out to him for all of his hard
work!

Here's a link to the February status update, which also includes a bunch
of new 1.2 additions/improvements/features:
http://mail.cs.uiuc.edu/pipermail/llvm-announce/2004-February/000005.html

If you have any questions or comments about LLVM or any of the features
in this status update, please feel free to contact us on the llvmdev list!

-Chris

-- 
http://llvm.cs.uiuc.edu/
http://www.nondot.org/~sabre/Projects/