develooper Front page | perl.perl5.porters | Postings from October 2009

Re: The special charclasses \s \w and \d.

Thread Previous
From:
Glenn Linderman
Date:
October 19, 2009 15:42
Subject:
Re: The special charclasses \s \w and \d.
Message ID:
4ADCEB35.7040504@nevcal.com
On approximately 10/18/2009 12:53 AM, came the following characters from 
the keyboard of demerphq:
> 2009/10/18 karl williamson <public@khwilliamson.com>:
>   
>> demerphq wrote:
>>     
>>> I have basically fixed the failing new tests.
>>>
>>> I have not uploaded the fix because these fixes break other tests, and
>>> so far I havent been able to review them all for correctness and fix
>>> them or their underlying breakage.
>>>
>>> Also, I start to doubt the tenability of this path. Making \d mean
>>> [0-9] seems to me to be clear (please speak up if anyone disagree).
>>> Making \w have the strict [A-Za-z0-9_] behavior by default is looking
>>> less sensible than it seemed at first.
>>>       
>> Please elaborate, cause it still seems very sensible to me.
>>     
>
> The branch i pushed fully implements the agreed apon behaviour.
>
> Look at the number of failing tests, and tests I had to patch.
>
> That gives an idea of the impact on our community, in particular the
> ones who don't use English as a native language. I think i
> unrealistically underestimated how much impact this would have.

Having a working Unicode semantics, and an understandable ASCII 
semantics, and not cluttering the code to support 4 different cases, 
still makes it sound worthwhile to me.

But I speak English, although I do write programs that process 
non-English text.

Thread Previous


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About