develooper Front page | perl.perl5.porters | Postings from November 1999

[DOCPATCH 5.005_62 perlfaq9.pod] Mention HTML::FormatText

Michael G Schwern
November 10, 1999 14:21
[DOCPATCH 5.005_62 perlfaq9.pod] Mention HTML::FormatText
Message ID:
It might not be a bad idea to metion HTML::FormatText as part of
perlfaq9's "How do i remove HTML from a string?"

--- pod/perlfaq9.pod        1999/11/10 20:07:42
+++ pod/perlfaq9.pod        1999/11/10 21:32:50
@@ -77,7 +77,9 @@
 =head2 How do I remove HTML from a string?

 The most correct way (albeit not the fastest) is to use HTML::Parser
-from CPAN (part of the HTML-Tree package on CPAN).
+from CPAN (part of the HTML-Tree package on CPAN).  Another correct
+way is to use HTML::FormatText which not only removes HTML but also
+attempts to do a little simple formatting of the resulting plain text.

 Many folks attempt a simple-minded regular expression approach, like
 C<s/E<lt>.*?E<gt>//g>, but that fails in many cases because the tags Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at | Group listing | About