On Thu, 9 Dec 2010 14:04:04 -0800 Jonathan Pool <pool@utilika.org> wrote: > > use encoding 'UTF-8'; > > $x = "\xC2\x80"; > > > > (Except perhaps in how the UTF8 flag is set, but that's not suppose to make a difference.) > > > > - Eric > > Could the latter representation (\xc2\x80) appear in a regular-expression character class, too? Could with perl 5.8.0, 5.8.1, 5.8.3, 5.8.8. Cannot with perl 5.8.9, 5.10.0, 5.10.1. (I didn't run with other versions.) #!perl use strict; use warnings; use charnames ':full'; use encoding 'UTF-8'; print "perl $]\n"; print "a\N{NO-BREAK SPACE}z" =~ /a\xC2\xA0z/ ? "ok\n" : "not ok\n"; __END__ perl 5.008 ok perl 5.008001 ok perl 5.008003 ok perl 5.008008 ok perl 5.008009 not ok perl 5.010000 not ok perl 5.010001 not okThread Previous | Thread Next