develooper Front page | perl.libwww | Postings from July 2001

HTML::Parser - Extracting out the text from <body>

Thread Next
Bill Moseley
July 2, 2001 11:17
HTML::Parser - Extracting out the text from <body>
Message ID:

I need to extract text out of html docs to do search word highlighting in
context.  (You know, like google's output.)

So, is there a "fastest" method to do this -- better than just using
HTML::Parser, setting a flag when I catch <body> and then storing the text?
(short of pre-processing the html documents?)


Bill Moseley

Thread Next Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About