See `perldoc -q remove.HTML`: <quote> Here's one "simple-minded" approach, that works for most files: #!/usr/bin/perl -p0777 s/<(?:[^>'"]*|(['"]).*?\1)*>//gs </quote> That regex has a ** structure, which can make it very slow, for instance on an "incomplete" string like qq{> <META http-equiv=3DContent-Type content=3D"text/html; =\n>}. So maybe we should put a "safer" variant in the documentation? Yves suggested to improve it to something like: s/< (?: (?> [^>'"]+ ) | (?> " (?> [^"]* ) " ) | (?> ' (?> [^']* ) ' ) )* > //xgs I am currently using something more like: 1 while s~ < [/!]? \w+ (?: \s+ (?: \w+ = )? (?: " [^"]* " | ' [^']* ' | \w+ ) )* > ~~xs; -- Affijn, Ruud "Gewoon is een tijger."Thread Next