develooper Front page | perl.beginners | Postings from August 2011

Re: Spidering

Thread Previous | Thread Next
From:
shawn wilson
Date:
August 2, 2011 17:08
Subject:
Re: Spidering
Message ID:
CAH_OBicgszFtuOaWV+NcJAV60m=_O7hw2xKcCy9gZdDqJMadpA@mail.gmail.com
On Tue, Aug 2, 2011 at 19:47, Mike McClain <mike.junk@cox.net> wrote:
> On Mon, Aug 01, 2011 at 06:51:37PM +0100, Rob Dixon wrote:
>> On 01/08/2011 11:03, VinoRex.E wrote:
>> >
>> >Hi everyone i am a  beginer for Perl can you give me a psedocode and a
>> >sample code for a spider program.It will be helpful in understanding web
>> >interfaces.Thank you
>>
>> Since the purpose of your efforts is to learn Perl, I think a module
>> like WWW::Mechanize is the wrong choice. To write a program that
>> accesses the internet, you should install and study the LWP library.
>
> For my first ever web ap I started with Mechanize because I'd seen it
> recommended here so many times. I don't believe it possible to use
> Mechanize without having to become quite familiar with most of the
> LWP library, particularly LWP::UserAgent, HTML::TreeBuilder,
> HTML::Form.
> JMHO,

yeah, that's why i like Web::Scraper. now that i know it (even though
it's been three month sense i've had the need for it), i can still
scrape a site in 15 minutes. but, for more intense stuff, i can
understand mechanize - most sites aren't that complex though.

Thread Previous | Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About