develooper Front page | perl.perl6.internals | Postings from October 2001

RE: Transcoding patch

Thread Previous | Thread Next
From:
Dan Sugalski
Date:
October 9, 2001 07:41
Subject:
RE: Transcoding patch
Message ID:
5.1.0.14.2.20011009103404.02dfaf90@pop.sidhe.org
At 07:03 PM 10/8/2001 -0500, Gibbs Tanton - tgibbs wrote:
>This looks good.
>
>Also, WRT the utf8_t, utf16_t, and utf32_t can we not just use utf32_t and
>then mask off the lower 8 or 16 bits?  We can still have utf8_t be defined
>as char to allow sizeof to work right and we can do sizeof(utf8_t)*2 to get
>the utf16_t's size.

utf8 and utf16 are both variable length encodings for space reasons. 
There's not much reason to space-compact something then expand the heck out 
of it. On the other hand, I'd really, *really* rather not have Unicode 
constants in anything other than UTF-32, so I'd as soon we chopped out the 
utf-8 and utf-16 constant support from this.

A should be the prefix for US-ASCII characters.
U should be the prefix for Unicode characters
N should be the prefix for the native character set (and the default)

Beyond that I'm not sure what, if anything, we should accommodate in the 
assembler.

					Dan

--------------------------------------"it's like this"-------------------
Dan Sugalski                          even samurai
dan@sidhe.org                         have teddy bears and even
                                      teddy bears get drunk


Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About