[libcxx-commits] [PATCH] D144346: [libc++][format] Improves Unicode decoders.

Tom Honermann via Phabricator via libcxx-commits libcxx-commits at lists.llvm.org
Fri Feb 24 09:29:36 PST 2023


tahonermann added inline comments.


================
Comment at: libcxx/include/__format/unicode.h:139-147
+  // U+0000..U+007F     00..7F                                         U+0000..U+007F 1 code unit range
+  // U+0080..U+07FF     *C2*..DF   80..BF                              U+0080..U+07FF 2 code unit range *
+  // U+0800..U+0FFF     E0         *A0*..BF    80..BF                  U+0800..U+FFFF 3 code unit range
+  // U+1000..U+CFFF     E1..EC     80..BF      80..BF
+  // U+D000..U+D7FF     ED         80..*9F*    80..BF                  U+D800..U+DFFF surrogate range
+  // U+E000..U+FFFF     EE..EF     80..BF      80..BF
+  // U+10000..U+3FFFF   F0         *90*..BF    80..BF     80..BF       U+10000..U+10FFFF 4 code unit range
----------------
Here is another presentation option that avoids the need for those footnotes. If you like this better, great. If not, no problem. The current presentation has the benefit of matching the bold highlighting in the table from the Unicode Standard, but I think the suggested presentation better explains the reason those invalid ranges exist.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D144346/new/

https://reviews.llvm.org/D144346



More information about the libcxx-commits mailing list