develooper Front page | perl.libwww | Postings from January 2001

HTML::Parser removing &nbsp

Thread Next
From:
John Aughey
Date:
January 11, 2001 10:12
Subject:
HTML::Parser removing &nbsp
Message ID:
Pine.SOL.3.96.1010111114352.22749A-100000@ritz.cec.wustl.edu
I'm using HTML::Parser to process an HTML file to re-write URL's and such.
I've discovered that it seems to be changing &nbsp to a space character
instead of passing the actual "&nbsp" text.  It also appears to be
escaping non-printable characters too.

Can I turn this feature off?  And if I cannot, what would be the best way
to parse the HTML so I can re-write selected tags.

Thank you
John Aughey



Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About