On Mon, Feb 19, 2001 at 10:25:03PM -0500, Andrew Pimlott wrote: > I'm not familiar with the complete semantics of the one we've got. > I really need them. Yeah, I suppose that if you're going to diss the current implementation, you really ought to know what it is. Ill-informed debate helps nobody. > > I wonder if this would work: > > > > sub to_utf8 { > > use bytes > > return $_[0] > > } > > I saw Jarkko's reply, but regardless of what he said, can you > explain what you want that to do if the internal representation is > not (at the moment) UTF-8? OK, the subroutine I gave wasn't correct (I only said I wondered if it would work) but the principle is there: if you feed it UTF8 encoded data, you get a byte string back. If you feed it non-UTF8-encoded data, you get a byte string back. Which is What You Meant. How nice. > that you haven't thought through. This isn't going to endear you, you realise? I've thought this through enough to produce the code to get the current Unicode model working, and working very well. How much have *you* thought it through? -- Morton's Law: If rats are experimented upon, they will develop cancer.Thread Previous | Thread Next