develooper Front page | perl.perl5.porters | Postings from December 2013

[perl #120790] Unicode::UCD::charscript fails to identify Han ideograph

Thread Previous
From:
Mark-Jason Dominus
Date:
December 14, 2013 15:50
Subject:
[perl #120790] Unicode::UCD::charscript fails to identify Han ideograph
Message ID:
rt-4.0.18-23414-1387036194-1123.120790-75-0@perl.org
# New Ticket Created by  Mark-Jason Dominus 
# Please include the string:  [perl #120790]
# in the subject line of all future correspondence about this issue. 
# <URL: https://rt.perl.org/Ticket/Display.html?id=120790 >



This is a bug report for perl from mjd@plover.com,
generated with the help of perlbug 1.39 running under perl 5.14.2.


-----------------------------------------------------------------
[Please describe your issue here]

This program:

     perl -MUnicode::UCD=charscript -wle 'print charscript(chr(0x6237)) // "undef"'

should print "Han", but instead it prints "undef".  The same behavior
occurs on two different machines, with 5.18.1 and 5.14.2.

The applicable line of the Unicode data file
http://www.unicode.org/Public/UCD/latest/ucd/Scripts.txt is:

      4E00..9FCC    ; Han # Lo [20941] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FCC

[Please do not change anything below this line]
-----------------------------------------------------------------
---
Flags:
    category=library
    severity=medium
    module=Unicode::UCD
---
Site configuration information for perl 5.18.1:

Configured by mjd at Tue Oct  8 12:58:09 EDT 2013.

Summary of my perl5 (revision 5 version 18 subversion 1) configuration:
   
  Platform:
    osname=linux, osvers=3.2.0-54-generic, archname=x86_64-linux
    uname='linux ortolan 3.2.0-54-generic #82-ubuntu smp tue sep 10 20:08:42 utc 2013 x86_64 x86_64 x86_64 gnulinux '
    config_args='-des -Dinc_version_list=none'
    hint=recommended, useposix=true, d_sigaction=define
    useithreads=undef, usemultiplicity=undef
    useperlio=define, d_sfio=undef, uselargefiles=define, usesocks=undef
    use64bitint=define, use64bitall=define, uselongdouble=undef
    usemymalloc=n, bincompat5005=undef
  Compiler:
    cc='cc', ccflags ='-fno-strict-aliasing -pipe -fstack-protector -I/usr/local/include -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64',
    optimize='-O2',
    cppflags='-fno-strict-aliasing -pipe -fstack-protector -I/usr/local/include'
    ccversion='', gccversion='4.6.3', gccosandvers=''
    intsize=4, longsize=8, ptrsize=8, doublesize=8, byteorder=12345678
    d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=16
    ivtype='long', ivsize=8, nvtype='double', nvsize=8, Off_t='off_t', lseeksize=8
    alignbytes=8, prototype=define
  Linker and Libraries:
    ld='cc', ldflags =' -fstack-protector -L/usr/local/lib'
    libpth=/usr/local/lib /lib/x86_64-linux-gnu /lib/../lib /usr/lib/x86_64-linux-gnu /usr/lib/../lib /lib /usr/lib
    libs=-lnsl -ldl -lm -lcrypt -lutil -lc
    perllibs=-lnsl -ldl -lm -lcrypt -lutil -lc
    libc=, so=so, useshrplib=false, libperl=libperl.a
    gnulibc_version='2.15'
  Dynamic Linking:
    dlsrc=dl_dlopen.xs, dlext=so, d_dlsymun=undef, ccdlflags='-Wl,-E'
    cccdlflags='-fPIC', lddlflags='-shared -O2 -L/usr/local/lib -fstack-protector'

Locally applied patches:
    

---
@INC for perl 5.18.1:
    /usr/local/lib/perl5/site_perl/5.18.1/x86_64-linux
    /usr/local/lib/perl5/site_perl/5.18.1
    /usr/local/lib/perl5/5.18.1/x86_64-linux
    /usr/local/lib/perl5/5.18.1
    .

---
Environment for perl 5.18.1:
    HOME=/home/mjd
    LANG=en_US.UTF-8
    LANGUAGE=
    LD_LIBRARY_PATH (unset)
    LOGDIR (unset)
    PATH=/home/mjd/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games
    PERL_BADLANG (unset)
    SHELL=/bin/bash


Thread Previous


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About