> However, I don't think this actually affects your comments, except that > I'd guess that the half digits mentioned by Hong don't have the same > term "case" used with them that the letters of various alphabets do. I am not sure if we mean the same thing. The regular ascii "0123456789" are called "half-width-digit" in china, because they take about half of the width of any chinese character to display on the screen or paper. There are another set of "012..." in chinese encoding that denotes digits look the same width as chinese characters, full-width. The full width characters mainly used for formatting. It has nothing to do width the lowercase/uppercase in roman language. I believe Unicode has many "font" characters. > Is this Uppercase? > Is this Lowercase? I believe the Unicode already defines character categories, such as L, Lu, Ll, Lo. I prefer we just use unicode term instead of extending "ctype.h". The Perl 5 regex already support them. HongThread Previous | Thread Next