develooper Front page | perl.libwww | Postings from April 2001

Does anyone know

Thread Previous | Thread Next
From:
ben goswami
Date:
April 24, 2001 07:34
Subject:
Does anyone know
Message ID:
005601c0cccb$b4b28680$2b0b0c18@smateo1.sfba.home.com
Hi,
Does anyone have any script to go to a url identify all the links for the headlines (by a paatern matching), go and download content from each of those links and write to a single file 
here is the scenario:
1. go to a urL:(ex: www.epaynews.com)
2. get the links for head lines (they start with pattern "<A HREF="/index.cgi?survey=&ref=browse&f=view&id=")
3. go to each of those links and download content and parse the content from <head line title> to <Related links> and write to a file. 
4. do it for all the headline links and write the contents to a single file.
I tried wget and webchatpp with not much success. wget does not have windows version and webchat 
does not have much docs
Thx
ben


Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About