[cfe-dev] UCNs/extended characters revisitted
public at alisdairm.net
Sat Jul 11 14:25:47 PDT 2009
> -----Original Message-----
> From: Ned Holbrook [mailto:ned at panic.com]
> Sent: 11 July 2009 22:22
> To: Eli Friedman
> Cc: AlisdairM(public); clang-dev Developers
> Subject: Re: [cfe-dev] UCNs/extended characters revisitted
> On Jul 11, 2009, at 1:58 PM, Eli Friedman wrote:
> > On Sat, Jul 11, 2009 at 1:40 PM, Ned Holbrook<ned at panic.com> wrote:
> >> As for (ii), I assume you simply want to disallow characters having
> >> general category P* (ie: Pd, Ps, etc.), no?
> > Not exactly; see Annex D of the C99 standard.
> I guess that means I'm not sure what Alisdair meant, then: is there
> special additional processing needed for punctuation, or is the goal
> simply an efficient representation of the table in Annex D?
Yes, that is what I mean.
I interpret the excluded character ranges as essentially containing punctuation and similar glyphs from assorted alphabets, which clearly should not be part of an identifier. They are generally 'non-alphabetic' characters anyway, and I probably expressed myself badly by generalising.
More information about the cfe-dev