David Nicol wrote: >for that matter, how about defining all strings as utf8 for semantic >reference For semantic reference, all strings are sequences of codepoint values (in some not-entirely-decided range that is a superset of Unicode codepoints). Latin-1 representation is a space and speed optimisation with restricted applicability; UTF-8 representation is a space optimisation but speed pessimisation. -zeframThread Previous | Thread Next