It might not be a bad idea to metion HTML::FormatText as part of perlfaq9's "How do i remove HTML from a string?" --- pod/perlfaq9.pod 1999/11/10 20:07:42 +++ pod/perlfaq9.pod 1999/11/10 21:32:50 @@ -77,7 +77,9 @@ =head2 How do I remove HTML from a string? The most correct way (albeit not the fastest) is to use HTML::Parser -from CPAN (part of the HTML-Tree package on CPAN). +from CPAN (part of the HTML-Tree package on CPAN). Another correct +way is to use HTML::FormatText which not only removes HTML but also +attempts to do a little simple formatting of the resulting plain text. Many folks attempt a simple-minded regular expression approach, like C<s/E<lt>.*?E<gt>//g>, but that fails in many cases because the tags