develooper Front page | perl.perl5.porters | Postings from December 2009

Re: RFC: regex /i folding always use utf8?

Thread Previous
karl williamson
December 7, 2009 08:17
Re: RFC: regex /i folding always use utf8?
Message ID:
Rafael Garcia-Suarez wrote:
> 2009/12/7 karl williamson <>:
>> I have been trying to solve the discrepancies involving the semantics being
>> different when a scalar is stored in utf8 or not.
>> To review, there are 3 major and 1 very minor known areas where this occurs.
>>  Blead already contains a fix for one of the major areas: case changing via
>> uc() and its cousins.
>> I am about to submit a patch that solves it for another of the major areas:
>> regex matching (non-folded).  And I'm close to having a patch for the minor
>> area.
>> If those patches are accepted, it will leave just one area left, and that is
>> qr/.../i.  I think it would be a very good thing if the whole problem could
>> be solved for 5.12.
> I concur; that would improve release consistency.
>> I want to throw out for comment the possibility that this could be solved
>> trivially by always using utf8 for case insensitive matching.
> Unless "use legacy" is activated ?

You're right.  I hadn't thought of that.


Thread Previous Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About