[all-commits] [llvm/llvm-project] 68410f: Fix handling of medial hyphens in Unicode Names.

cor3ntin via All-commits all-commits at lists.llvm.org
Fri Jul 28 06:09:36 PDT 2023


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 68410fbed769485f2386a64f745b34f6d3f3295e
      https://github.com/llvm/llvm-project/commit/68410fbed769485f2386a64f745b34f6d3f3295e
  Author: Corentin Jabot <corentinjabot at gmail.com>
  Date:   2023-07-28 (Fri, 28 Jul 2023)

  Changed paths:
    M clang/docs/ReleaseNotes.rst
    M clang/lib/Lex/LiteralSupport.cpp
    M clang/test/Preprocessor/ucn-pp-identifier.c
    M llvm/lib/Support/UnicodeNameToCodepoint.cpp
    M llvm/lib/Support/UnicodeNameToCodepointGenerated.cpp
    M llvm/unittests/Support/UnicodeTest.cpp
    M llvm/utils/UnicodeData/UnicodeNameMappingGenerator.cpp

  Log Message:
  -----------
  Fix handling of medial hyphens in Unicode Names.

In a Unicode name was stored in a way that caused
a medial hyphen to be at the end of a a chunk, it would not
be properly ignored by the loose matching algorithm.

For example if `LEFT-TO-RIGHT OVERRIDE` was stored as
`LEFT-` [...], the `-` would not be ignored.

The generators now ensures nodes are not cut accross
medial hyphen boundaries.

Fixes #64161

Differential Revision: https://reviews.llvm.org/D156518




More information about the All-commits mailing list