On Thu, 29 Oct 2009 20:35:11 +0000 John <john.imrie@vodafoneemail.co.uk> wrote: > Tom Christiansen wrote: > > > > Ethiopic DIGITs are *all* \D, and read left to right: > > > > \D L U+01369 ETHIOPIC DIGIT ONE > > > > Whereas all Kharoshthi DIGITs are also \D, but read right to left: > > > > > Tom, > > I suspect directionality may be a red herring here. IIRC the > directionality of the text only affects the *display* of characters not > the order the encoded codepoints appear in a data stream.Thats why > directionality is recorded in the UCD. But Unicode doesn't even encode whether the language's numeric system is "big-endian" or "little-endian". The quantity "ten" is usually written "10"; that is, substr($ten,0,1) == 1 and substr($ten,1,1) == 0; thus we arrive at 10 == (1*10) + 0; I see no guarantee in Unicode, nor any information to tell us, whether there isn't such a language which would represent the quantity ten as a string satisfying substr($x,0,1) == 0 and substr($x,1,1) == 1. This is irrespective of the order the characters are rendered to pixels on the screen. -- Paul "LeoNerd" Evans leonerd@leonerd.org.uk ICQ# 4135350 | Registered Linux# 179460 http://www.leonerd.org.uk/Thread Previous | Thread Next