develooper Front page | perl.perl6.internals.unicode | Postings from February 2001

Re: string encoding

Thread Previous | Thread Next
Simon Cozens
February 16, 2001 16:35
Re: string encoding
Message ID:
On Fri, Feb 16, 2001 at 02:25:59PM -0800, Hong Zhang wrote:
> I think you already mixed the codepoint vc character. What you will get is
> 10th codepoint, not 10th character.

I think you're confused. Codepoints *are* characters. Combining characters are
taken care of as per the RFC.

> The UTF-32 has its problems too, such as cache locality, memory footprint,
> encoding conversion.

I'm talking about UTF16. You're talking about UTF32.
Try talking about what I'm talking about.

> I said it is not common case

And I am saying that it is.

> You need to exmine the first two bytes for UTF-16 too, right?


> > UTF16 : s += 2;            : O(1) : Good
> > UTF8  : s += UTF8WIDTH(*s) : O(n) : Bad
> What I don't understand where you really use random access of string?

I have been through this many, many times. I am not going through it

You are mistaken, and explaining our point is not helping.
Let us agree to differ.

"$UNIVERSAL::AUTOLOAD "}; hacker perl another just;

Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About