develooper Front page | perl.perl5.porters | Postings from January 2011

Re: refined :utf8 I/O layers proposal

Thread Previous
From:
Zefram
Date:
January 4, 2011 05:51
Subject:
Re: refined :utf8 I/O layers proposal
Message ID:
20110104135127.GP25547@lake.fysh.org
karl williamson wrote:
>I think we are using the term "standard UTF-8" differently.

I'm using it to refer to UTF-8 as originally specified.  It corresponds to
the older concept of ISO-10646 as a 31-bit charset, back when Unicode was
only 16-bit.  Later, when Unicode realised that 16 bits wasn't enough,
they invented UTF-16 and compromised on the 20.09 bits that UTF-16
reaches.  So nowadays some statements of the UTF-8 encoding describe
only how to apply it to the 20.09-bit range, which is what you're
picking up on.  I view the 20.09-bit limit as a feature of Unicode,
not of the encoding.

-zefram

Thread Previous


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About