develooper Front page | perl.perl5.porters | Postings from October 2009

Re: Rule 1 has been invoked [Re: What should \s \w \d match in5.12?]

Thread Previous | Thread Next
From:
Paul LeoNerd Evans
Date:
October 29, 2009 13:56
Subject:
Re: Rule 1 has been invoked [Re: What should \s \w \d match in5.12?]
Message ID:
20091029205636.05f1c47c@nim.leo
On Thu, 29 Oct 2009 20:35:11 +0000
John <john.imrie@vodafoneemail.co.uk> wrote:

> Tom Christiansen wrote:
> >
> > Ethiopic DIGITs are *all* \D, and read left to right:
> >
> >     \D L   U+01369 ETHIOPIC DIGIT ONE
> >
> > Whereas all Kharoshthi DIGITs are also \D, but read right to left:
> >
> >   
> Tom,
> 
> I suspect directionality may be a red herring here. IIRC the 
> directionality of the text only affects the *display* of characters not 
> the order the encoded codepoints appear in a data stream.Thats why 
> directionality is recorded in the UCD.

But Unicode doesn't even encode whether the language's numeric system is
"big-endian" or "little-endian".

The quantity "ten" is usually written "10"; that is, substr($ten,0,1) ==
1 and substr($ten,1,1) == 0; thus we arrive at 10 == (1*10) + 0;

I see no guarantee in Unicode, nor any information to tell us, whether
there isn't such a language which would represent the quantity ten as a
string satisfying substr($x,0,1) == 0 and substr($x,1,1) == 1. This is
irrespective of the order the characters are rendered to pixels on the
screen.

-- 
Paul "LeoNerd" Evans

leonerd@leonerd.org.uk
ICQ# 4135350       |  Registered Linux# 179460
http://www.leonerd.org.uk/


Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About