On 2012-04-14 09:26, Somu wrote: > I was trying to strip off all html tags and the special characters from a > html file using regex. Alternative: lynx -stdin -dump < in.html > out.txt -- Ruud