develooper Front page | perl.perl5.porters | Postings from April 2011

Re: pity about /ii

Thread Previous | Thread Next
From:
David Cantrell
Date:
April 4, 2011 03:26
Subject:
Re: pity about /ii
Message ID:
20110404102629.GA388@bytemark.barnyard.co.uk
On Sat, Apr 02, 2011 at 12:02:38AM -0600, Tom Christiansen wrote:

> I'm looking for a more user-friendly form of case-insensitivity, one
> that relates to UCA collation strengths...
>      
> Things like these, for example, are all D:
>      
>     D   00044 GC=Lu LATIN CAPITAL LETTER D
>     ???  0FF24 GC=Lu FULLWIDTH LATIN CAPITAL LETTER D
>     ???   0216E GC=Nl ROMAN NUMERAL FIVE HUNDRED

I'm dubious about sorting 500 the same as D.  After all, if you do
that, you have to sort 50 the same as L, and so either sort 500 before
50, or L before D.

>     ???   024B9 GC=So CIRCLED LATIN CAPITAL LETTER D

It is, of course, silly to treat this slightly different representation
of the letter D as being a different character.  This is like treating
D in different fonts as being different characters.

UBEEFCAFE UNICODE CONSORTIUM LEAPING WITH GAY ABANDON OVER A SHARK

-- 
David Cantrell | Bourgeois reactionary pig

In this episode, R2 and Luke weld the doors shut on their X-Wing,
and Chewbacca discovers that his Ewok girlfriend is really just a
Womble with its nose chopped off.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About