develooper Front page | perl.perl5.porters | Postings from December 2009

Re: RFC: regex /i folding always use utf8?

Thread Previous
From:
karl williamson
Date:
December 7, 2009 08:17
Subject:
Re: RFC: regex /i folding always use utf8?
Message ID:
4B1D2A8A.409@khwilliamson.com
Rafael Garcia-Suarez wrote:
> 2009/12/7 karl williamson <public@khwilliamson.com>:
>> I have been trying to solve the discrepancies involving the semantics being
>> different when a scalar is stored in utf8 or not.
>>
>> To review, there are 3 major and 1 very minor known areas where this occurs.
>>  Blead already contains a fix for one of the major areas: case changing via
>> uc() and its cousins.
>>
>> I am about to submit a patch that solves it for another of the major areas:
>> regex matching (non-folded).  And I'm close to having a patch for the minor
>> area.
>>
>> If those patches are accepted, it will leave just one area left, and that is
>> qr/.../i.  I think it would be a very good thing if the whole problem could
>> be solved for 5.12.
> 
> I concur; that would improve release consistency.
> 
>> I want to throw out for comment the possibility that this could be solved
>> trivially by always using utf8 for case insensitive matching.
> 
> Unless "use legacy" is activated ?

You're right.  I hadn't thought of that.

>[snip]


Thread Previous


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About