
![]() |
Fatty
Webbots, Spiders, and Screen Scrapers (http://www.schrenk.com/nostarch/webbots/index.
php ![]() It's a good introduction to cURL and scraping. If you're a PHPguy and are completely new to scraping this would be a good book, it has a set of libraries that make things easier, also helps youlearnhow to extract information out of a web page and do stuff with it. The chapters take you through the commands, putting them together into the author's library, and then some minor projects to make use of them. It also has some chapters on integrating with POP/NNTP/SMTP which may or may not be interesting to you.If you're a Perlguy and have an understanding of WWW::Mechanize or LWP you might find this a bit primitive, especially if you've done work with HTML::TreeBuilder and HTML::TokeParser. In that case, O'Reilly's "LWP &Perl" would be a better bet tolearnthe advanced ways of scraping with TreeBuilder and TokeParser. "Spidering Hacks" is another good one, it's a lot lighter on the technical stuff but has a lot of good information and examples to work from, especially using WWW::Mechanize.Fatty nop_90
cool
![]() piratescurvy
I bought it, I'm a sucker.
![]() perkiset
Review please!
|

Thread Categories

![]() |
![]() |
Best of The Cache Home |
![]() |
![]() |
Search The Cache |
- Ajax
- Apache & mod_rewrite
- BlackHat SEO & Web Stuff
- C/++/#, Pascal etc.
- Database Stuff
- General & Non-Technical Discussion
- General programming, learning to code
- Javascript Discussions & Code
- Linux Related
- Mac, iPhone & OS-X Stuff
- Miscellaneous
- MS Windows Related
- PERL & Python Related
- PHP: Questions & Discussion
- PHP: Techniques, Classes & Examples
- Regular Expressions
- Uncategorized Threads