Hi! by the way lua also uses the simple approach to allow every character with the msb set as identifier: http://lua-users.org/wiki/UnicodeIdentifers so I would say at least for llvm this is simple and efficient, for c++ it's dependent on the standard. -Jochen