develooper Front page | perl.perl4lib | Postings from January 2009

Re: How to convert from ANSEL/MARC-8 to UTF-8?

Thread Previous | Thread Next
Galen Charlton
January 7, 2009 08:47
Re: How to convert from ANSEL/MARC-8 to UTF-8?
Message ID:

On Wed, Jan 7, 2009 at 11:42 AM, Michael Lackhoff
<> wrote:
> diakritics + base char to the combined character. So I still have two
> characters for e.g. the
> German umlauts. This might be correct UTF-8 but is not useable to
> present in (X)HTML.
> Is there any other option short of  doing it by hand with lots of s///
> for at least the most common
> combinations?

You can use NFC() from Unicode::Normalize to do this (after using
MARC::Charset to do the conversion to UTF-8).


Galen Charlton
VP, Research & Development, LibLime
p: 1-888-564-2457 x709
skype: gmcharlt

Thread Previous | Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About