[cfe-dev] C99/C++ UCN (Universal Character Name) Support

Neil Booth neil at daikokuya.co.uk
Fri Mar 27 19:44:45 PDT 2009


steve naroff wrote:-

> Folks,
>
> I'm in the process of implementing UCN support in LiteralSupport.cpp.
>
> Part of implementing this is converting UTF-16 (\u) and UTF-32 (\U) to  
> UTF-8 (for insertion into a C-string, say).
>
> Unfortunately, Unix doesn't appear to have any standard support for this 
> type of conversion (which surprised me).
>
> Does anyone have any experience with this type of conversion?
>
> Thanks for any pointers!

Are you working on accepting them in identifiers too?

It's nice, both in literals and identifiers, to permit multibyte
characters from the current locale too, interchangably with UCNs.

Does clang implement execution charsets, or is it mandating it be
UTF-8?

Neil.



More information about the cfe-dev mailing list