[all-commits] [llvm/llvm-project] c932ce: Update Unicode to 15.0

cor3ntin via All-commits all-commits at lists.llvm.org
Wed Sep 21 20:03:19 PDT 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: c932cef32aa4a87259e5e204a579355cce949e20
      https://github.com/llvm/llvm-project/commit/c932cef32aa4a87259e5e204a579355cce949e20
  Author: Corentin Jabot <corentinjabot at gmail.com>
  Date:   2022-09-22 (Thu, 22 Sep 2022)

  Changed paths:
    M clang/docs/ReleaseNotes.rst
    M clang/lib/Lex/UnicodeCharSets.h
    M clang/test/Lexer/unicode.c
    M llvm/lib/Support/Unicode.cpp
    M llvm/lib/Support/UnicodeCaseFold.cpp
    M llvm/lib/Support/UnicodeNameToCodepoint.cpp
    M llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
    M llvm/unittests/Support/UnicodeTest.cpp
    M llvm/utils/UnicodeData/UnicodeNameMappingGenerator.cpp

  Log Message:
  -----------
  Update Unicode to 15.0

Unicode 15.0 adds 4,489 characters, for a total of 149,186 characters.
These additions include 2 new scripts along with 20 new emoji characters,
and 4,193 CJK ideographs.

This changes modify most existing tables including
 - XID_Start/XID_Continue in Clang
 - The character name database (used by \N{} in Clang)
 - The list of formattable/printable codepoints
 - The case folding algorithm (which we had not updated since Unicode 9)
 - The list of nonspacing/enclosing marks used by the column width
   computation algorithm. The rest of the column width algorithm
   is not updated.

Reviewed By: tahonermann

Differential Revision: https://reviews.llvm.org/D133807




More information about the All-commits mailing list