develooper Front page | perl.perl5.porters | Postings from April 2011

Re: Unicode regex negated case-insensitivity in 5.14.0-RC1

Thread Previous | Thread Next
From:
Tom Christiansen
Date:
April 28, 2011 16:40
Subject:
Re: Unicode regex negated case-insensitivity in 5.14.0-RC1
Message ID:
3354.1304033992@chthon
Karl Williamson <public@khwilliamson.com> wrote
   on Thu, 28 Apr 2011 17:32:32 MDT: 

> And fixing the .pod's

:)

I have to think about this a bit.  
Complementing multichar folds is a weird area.

I think Perl doing full case-mapping is a feature.  And
sure it will still do it with functions.

I know that Java only does full CM on its equiv of lc/uc/etc.
It does only simple CM with its regexes.  

If Perl doesn't do full case mapping, does that mean we won't
be able to match "\xDF" =~ /ss/i and vice versa anymore?

I notice the pattern was forbidding the 128-255 code points,
which also pulled ANGSTROM SIGN and such.  But that one is 
ok since we have no builtin NFD matching to wrack our brains over.

I think I agree this should be a release blocker until we've
thought it through.

--tom

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About