develooper Front page | perl.perl5.porters | Postings from February 2001

Re: The State of The Unicode

Thread Previous | Thread Next
From:
Simon Cozens
Date:
February 19, 2001 19:43
Subject:
Re: The State of The Unicode
Message ID:
20010220034314.A2327@pembro26.pmb.ox.ac.uk
On Mon, Feb 19, 2001 at 10:25:03PM -0500, Andrew Pimlott wrote:
> I'm not familiar with the complete semantics of the one we've got.
> I really need them.

Yeah, I suppose that if you're going to diss the current implementation, you
really ought to know what it is. Ill-informed debate helps nobody.

> > I wonder if this would work:
> > 
> >     sub to_utf8 {
> >         use bytes
> >         return $_[0]
> >     }
> 
> I saw Jarkko's reply, but regardless of what he said, can you
> explain what you want that to do if the internal representation is
> not (at the moment) UTF-8?

OK, the subroutine I gave wasn't correct (I only said I wondered if it would
work) but the principle is there: if you feed it UTF8 encoded data, you get a
byte string back. If you feed it non-UTF8-encoded data, you get a byte string
back. Which is What You Meant. How nice.

> that you haven't thought through.

This isn't going to endear you, you realise? I've thought this through
enough to produce the code to get the current Unicode model working, and
working very well. How much have *you* thought it through?

-- 
Morton's Law:
	If rats are experimented upon, they will develop cancer.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About