[ID 20000316.001] UTF8-ness seems strange

Graham Barr
March 16, 2000 02:53
This is a bug report for perl from gbarr@chipper.localdomain,
generated with the help of perlbug 1.27 running under perl v5.6.0.

[Please enter your report here]

The setting of the UTF8 bit still seems strange.

Surley if a regexp is executed in a utf8 block then $1 etc
should be tagged as UTF8. Likewise if executed in a block
which explicitly hase use bytes.

Also split, if executed in a utf8 scope, should its results
not be utf8 strings ?


use Devel::Peek;

$\ = "\n";
$x = v1.999;
print length $x;
 use bytes;
 print length $x;
 ($y) = ($x =~ /(.*)/s);

print unpack("H*",$x);

print "Should be UTF8 - OK";
print Dump($x);

print "Should not be UTF8 - OK";
print Dump($y);

 use utf8;
 print "Should be UTF8 - BAD";
 print Dump(($x =~ /(.)/gs)[1]);
 use utf8;
 print "Should be UTF8 - BAD";
 print Dump( (split(//,$y))[1]);

