develooper Front page | perl.perl5.porters | Postings from October 2009

Re: [perl #69414] Case-insensitive utf8 matching problem

Thread Previous | Thread Next
From:
Rafael Garcia-Suarez
Date:
October 2, 2009 17:14
Subject:
Re: [perl #69414] Case-insensitive utf8 matching problem
Message ID:
b77c1dce0910020921s5e926080g49440430dd5a1644@mail.gmail.com
2009/10/2 karl williamson <public@khwilliamson.com>:
> FFFE is a 'noncharacter'.  It isn't an 'illegal character'.  It is perfectly
> legal in Unicode to use this code point.  It is, however, illegal to use it
> in "interchange", and hence would never be confused with a BOM from an input
> stream.  I believe the message should be changed to be something more like
> Unicode noncharacter 0xfffe is illegal for interchange
>
> with a fuller explanation in perldiag.pod, and the ability to turn off just
> this warning.

Like this ?

=item Unicode non-character %s is illegal for interchange

(W utf8) Certain codepoints, such as U+FFFE and U+FFFF, are defined by the
Unicode standard to be non-characters. Those are legal codepoints, but are
reserved for internal use; so, applications shouldn't attempt to exchange
them. If you know what you are doing you can turn off this warning by
C<no warnings 'utf8';>.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About