[cfe-dev] [REVIEW] UTF-8 in identifiers proof of concept

Sean Hunt scshunt at csclub.uwaterloo.ca
Fri Mar 30 08:06:15 PDT 2012


On Sat, Dec 31, 2011 at 22:11, Sean Hunt <scshunt at csclub.uwaterloo.ca>wrote:

> Hey folks,
>
> Attached is a proof of concept for the handling of UTF-8 in
> identifiers. Aside from the terrible isIdentifierBody function, which
> should be optimized where possible (possibly into a lookup table for
> the BMP, since that would be 8kb, and using the simple bitwise
> operation in there for other planes), I think the approach is the
> correct one. Given that this is sensitive code, however, I would like
> to ensure no one has any issues with this approach before I convert
> more of the lexer code over.
>
> Sean
>

This patch still applies reasonably cleanly; any feedback?

Sean
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20120330/bdc067e4/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: utf8-proof-of-concept.patch
Type: text/x-patch
Size: 3570 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20120330/bdc067e4/attachment.bin>


More information about the cfe-dev mailing list