develooper Front page | perl.perl5.porters | Postings from January 2010

Re: Use ICU for Unicode in Perl?

Thread Previous | Thread Next
From:
Tom Christiansen
Date:
January 28, 2010 21:32
Subject:
Re: Use ICU for Unicode in Perl?
Message ID:
18331.1264743079@chthon
> I found only a couple of references to the ICU Unicode C language 
> library in the p5p archives.  Is there a reason not to use some of this 
> open source code when it suits our purposes?

Seems like it's safe:

  * http://userguide.icu-project.org/icufaq#TOC-How-is-the-ICU-licensed-

    How is the ICU licensed?

    The ICU projects since ICU 1.8.1 and ICU4J 1.3.1 are covered
    by the ICU license , a simple, permissive non-copyleft free
    software license, compatible with the GNU GPL.   The ICU
    license is identical to the version of the X license that was
    formerly available at http://www.x.org/Downloads_terms.html .
    (This site no longer exists, but can still be retrieved
    through internet archive services)

    The ICU license is intended to allow ICU to be included
    both in free software projects and in proprietary or
    commercial products.

They use Perl string escapes and regex notation:

  * http://userguide.icu-project.org/strings/unicodeset#TOC-UnicodeSet-Patterns
  * http://userguide.icu-project.org/strings/regexp

Plus, they've added a /w flag to control \b behaviour, 
a frequent and subtle pain-in-the-posterior.

ICU seems worth looking over at least, even if it's just to learn
things, not to use things.  These also caught my interest:

  * http://userguide.icu-project.org/boundaryanalysis
  * http://userguide.icu-project.org/transforms/general
  * http://userguide.icu-project.org/collation/concepts
  * http://userguide.icu-project.org/collation/customization

Thanks, Karl.

--tom

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About