develooper Front page | perl.perl5.porters | Postings from June 2010

Re: PATCH: [perl #58182] partial, "The Unicode Bug". Add unicodesemantics for \s, \w

Thread Previous | Thread Next
karl williamson
June 22, 2010 19:13
Re: PATCH: [perl #58182] partial, "The Unicode Bug". Add unicodesemantics for \s, \w
Message ID:
Paul LeoNerd Evans wrote:
> On Thu, May 20, 2010 at 01:03:36PM +0100, Ben Morrow wrote:
>> Since there aren't any upper-case modifiers yet, it would be possible to
>> introduce the rule 'Upper-case modifiers take a single-character
>> argument' at this point. This would give a syntax like
>>     m/unicode    /Uugx;
>>     m/locale     /Ulgx;
>>     m/traditional/Utgx;
>> which seems at least as clear as three or four random letters that
>> happen to be mutually-exclusive.
> Oh, now I do like that. It has all the neatness of my suggestion, but
> with the added bonus that now we know that capitals take a sub-flag,
> which we should look at the next letter afterwards.
> Perhaps we could establish this as a general precident? I have no idea
> what this means
>  m/foo/AzTlxMvg
> but at least I can parse it as A=z, T=l, x=true, M=v, g=true

I think we should go with Ben's idea.  I had suggested using M for mode, 
other suggestions were L apparently for locale, and I for 
internationalization.  I'm now thinking C for character set.
Cu for Unicode
Cl for current locale
Ct for traditional, or Cm for mixed, which I think is clearer.

Later there could be Ca for ASCII or Cp for Posix.

Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About