develooper Front page | perl.perl5.porters | Postings from January 2016

qr/\b{wb}/ is now tailored for perl expectations

From:
Karl Williamson
Date:
January 8, 2016 21:22
Subject:
qr/\b{wb}/ is now tailored for perl expectations
Message ID:
5690288C.5040701@khwilliamson.com
  commit f1f6961f5a6fd77a3e3c36f242f1b72ce5dfe205
  Author: Karl Williamson <khw@cpan.org>
  Date:   Tue Jan 5 16:12:55 2016 -0700

      Tailor \b{wb} for Perl

      The Unicode \b{wb} matches the boundary between space characters in a
      span of them.  This is opposite of what \b does, and is 
counterintuitive
      to Perl expectations.  This commit tailors \b{wb} to not split up 
spans
      of white space.

      I have submitted a request to Unicode to re-examine their 
algorithm, and
      this has been assigned to a subcommittee to look at, but the result
      won't be available until after 5.24 is done.  In any event, Unicode
      encourages tailoring for local conditions.


However, a trailing horizontal white-space in a span that is immediately 
followed by something like a combining mark is attached to that mark, 
and split off from the rest of the span.  This makes the mark modify 
that single space instead of the whole span



nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About