develooper Front page | perl.perl5.porters | Postings from December 2009

Re: PATCH: partial [perl #58182]: regex case-sensitive matching nowutf8ness independent

Thread Previous | Thread Next
December 10, 2009 10:57
Re: PATCH: partial [perl #58182]: regex case-sensitive matching nowutf8ness independent
Message ID:
Juerd Waalboer wrote:
> karl williamson skribis 2009-12-09 12:11 (-0700):
>> Since Yves is incommunicado, I took what he had done before Larry's veto  
>> and extended and modified it, adding an intermediate way.  What that  
>> means is that anything that looks like[[:xxx:]] will match only in the  
>> ASCII range, or in the current locale, if set.  I never heard any  
>> controversy about that part of the proposal, and it makes sense to me  
>> that a Posix construct should act like the Posix definition says to.
> These "posix" constructs have for a long time been documented as
> *equivalent* to \d, \s and \w, with two remarks: [[:space:]] also
> includes \cK and [[:word:]] doesn't even exist in POSIX.
> Changing them is as bad as changing the metacharacters. Changing them to
> break the equivalency might even be worse.
> Also, note that perlre calls this "POSIX character class **syntax**"
> (emphasis mine).
> An even stronger argument is that perlre defines equivalence with
> \p{...}, and explicitly mentions that these are Unicode constructs.
Could we then make [:Unicode property name:] map to \p{Unicode property 
name} and not just limit ourselves to the names list?

This email has been scanned by Netintelligence

Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About