develooper Front page | perl.perl5.porters | Postings from November 2012

Re: Comment period extended for Unicode's changing some common charactersfrom Punctuation to Symbol

Thread Previous | Thread Next
From:
Karl Williamson
Date:
November 29, 2012 22:24
Subject:
Re: Comment period extended for Unicode's changing some common charactersfrom Punctuation to Symbol
Message ID:
50B7E095.4000000@khwilliamson.com
On 11/29/2012 10:58 AM, Leon Timmermans wrote:
> On Thu, Nov 29, 2012 at 6:51 PM, David Golden <xdg@xdg.me> wrote:
>> * How can we better document (if we're not) the forward compatibility
>> risks inherent in using Unicode character classes?
>
> It probably should be documented, though I suspect it would easily get
> buried in the pile of Unicode documentation,.
>
>> * How can we let programs introspect the version of Unicode that Perl provides?
>
> That information is currently already available in Module::Corelist.
>
>> * Is it possible to make any of this pluggable, so a program could
>> specify which version of Unicode classes they want to use?
>
> How do you scope that? Specially with things like qr//.
>
> Leon
>

It is not possible to make it pluggable.  However, it is possible to 
recompile Perl with any Unicode version so far.  Thus you can download 
the Unicode character database for any Unicode version, follow the 
instructions in README.perl in lib/unicore, and get your perl to work 
with it.  If there are some decisions you don't like, you could write a 
script that changes the db for those decisions, and apply it to each new 
standard that comes out.

BTW, these particular changes were in blead for a while last summer as 
an experiment, and exactly one module broke.  [[:punct:]] matches both 
the Unicode ASCII-range symbols plus punctuation, so most Perl programs 
would see no difference from these particular changes.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About