develooper Front page | perl.perl5.porters | Postings from November 2010

[perl #71734] regex match of Above Latin1 character with ASCII or Latin1 fold doesnt work if the fold is in a character class

From:
Father Chrysostomos via RT
Date:
November 22, 2010 13:34
Subject:
[perl #71734] regex match of Above Latin1 character with ASCII or Latin1 fold doesnt work if the fold is in a character class
Message ID:
rt-3.6.HEAD-13564-1290461652-1961.71734-15-0@perl.org
On Mon Nov 15 13:15:59 2010, public@khwilliamson.com wrote:
> The attached lengthy series of patches fix this bug among other things. 
>   It is a general cleanup of the bracketed character class 
> implementation.  I have more things I'm doing with that now, relatively 
> minor, but this is a good patch point.
> 
> The patches are also available at:
https://github.com/khwilliamson/perl.git
> branch folding
> 
> These patches also solve the Unicode bug for character classes, except 
> for the multi-char folding German sharp s, and all such multi-char folds 
> have a number of bugs.
> 
> The series also includes a fix for converting a character from utf8 to 
> its ordinal.  These would show up as bugs only on an EBCDIC machine.
> 
> The patches require [perl #79088] PATCH: free up bit in regex ANYOF node 
> for time/space tradeoff, which are included again here.  This may be 
> able to be reverted eventually.

I have just applied these, except for the first hunk in the perldelta
patch, which needs to be rewritten.




nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About