develooper Front page | perl.perl5.porters | Postings from November 2000

Re: [ID 20001114.001] use utf8;use charnames; is incorrect for \x{80}-\x{FF}

Thread Previous | Thread Next
From:
Simon Cozens
Date:
November 14, 2000 09:20
Subject:
Re: [ID 20001114.001] use utf8;use charnames; is incorrect for \x{80}-\x{FF}
Message ID:
20001114172019.B26682@pembro4.pmb.ox.ac.uk
On Wed, Nov 15, 2000 at 03:06:19AM +1300, Andrew McNaughton wrote:
> On Tue, 14 Nov 2000, Nick Ing-Simmons wrote:
> > Andrew McNaughton <andrew@tki.org.nz> writes:
> > >use utf8;
> > >use charnames ':full';
> > >$text .= "\N{LATIN CAPITAL LETTER A WITH DIAERESIS}";
> > >
> > >
> > >This fails because of the final line of &charnames::charnames.  It returns an
> > >8 bit value.
> > 
> > It is an 8-bit value - that is the UNICODE codepoint is < 256.
> 
> The unicode codepoint may be less than 256, but in utf8 2 byte characters
> start from codepoint 128, not 256.

Why do you think that Perl should encode this in UTF8?

-- 
	"The elder gods went to Suggoth and all I got was this lousy T-shirt."

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About