develooper Front page | perl.perl5.porters | Postings from June 2013

Re: [perl #113824] Regexp error messages are not UTF8-clean

Thread Previous | Thread Next
From:
Eirik Berg Hanssen
Date:
June 18, 2013 10:36
Subject:
Re: [perl #113824] Regexp error messages are not UTF8-clean
Message ID:
CAHAeAG7KhkoC14Pyv5sWgOKJB45HW+n7JivwR1_Q0BuMMjngvQ@mail.gmail.com
On Tue, Jun 18, 2013 at 11:53 AM, demerphq <demerphq@gmail.com> wrote:

>
> The interface for Perl_croak() and friends do not support UTF8 at all.
> They accept only a char* pointer, and have no facility for a UTF8
> flag.


  Then it can be safely used as neither UTF-8 nor any 8-bit encoding, right?

  Shouldn't they then, to preserve sanity, treat the char* data as binary
and convert all high-bit-set bytes to some ASCII representation (octal
escape sequences or whatever)?

  That would still leak internal representation, but at least it would be
an actual internal representation ...


Eirik

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About