develooper Front page | perl.perl5.porters | Postings from June 2010

Re: PATCH: [perl #58182] partial, "The Unicode Bug". Add unicodesemantics for \s, \w

Thread Previous | Thread Next
From:
karl williamson
Date:
June 22, 2010 19:13
Subject:
Re: PATCH: [perl #58182] partial, "The Unicode Bug". Add unicodesemantics for \s, \w
Message ID:
4C216D9C.4070403@khwilliamson.com
Paul LeoNerd Evans wrote:
> On Thu, May 20, 2010 at 01:03:36PM +0100, Ben Morrow wrote:
>> Since there aren't any upper-case modifiers yet, it would be possible to
>> introduce the rule 'Upper-case modifiers take a single-character
>> argument' at this point. This would give a syntax like
>>
>>     m/unicode    /Uugx;
>>     m/locale     /Ulgx;
>>     m/traditional/Utgx;
>>
>> which seems at least as clear as three or four random letters that
>> happen to be mutually-exclusive.
> 
> Oh, now I do like that. It has all the neatness of my suggestion, but
> with the added bonus that now we know that capitals take a sub-flag,
> which we should look at the next letter afterwards.
> 
> Perhaps we could establish this as a general precident? I have no idea
> what this means
> 
>  m/foo/AzTlxMvg
> 
> but at least I can parse it as A=z, T=l, x=true, M=v, g=true
> 

I think we should go with Ben's idea.  I had suggested using M for mode, 
other suggestions were L apparently for locale, and I for 
internationalization.  I'm now thinking C for character set.
Cu for Unicode
Cl for current locale
Ct for traditional, or Cm for mixed, which I think is clearer.

Later there could be Ca for ASCII or Cp for Posix.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About