Post by Peter Cooper on April 6th, 2007
Scraping IMDB with Ruby and Hpricot

- How To Scrape Google With Ruby In 0 Seconds
- scRUBYt - Hot, New Ruby Web-Scraping Toolkit Released
- The Top 10 Ruby / Rails Blogs (and Why)



Tim from We Heart Code has written an easy-to-follow, detailed tutorial about scraping data from the Internet Movie Database using Ruby and Hpricot. As I would have suspected, Peter Szinek, developer of ScRUBYt! presents an even simpler solution in the comments.

Click here to add on del.icio.us









April 6th, 2007 at 8:07 pm
Does IMDb not have a backend interface of any sort?
April 7th, 2007 at 12:18 am
You can download plain text files of IMDB, however I believe the only web API is available on a pay-per-play basis.
April 7th, 2007 at 6:31 am
They have a whole database you can download, I believe. I think IMDB was just chosen as an 'example' in this case. It's really a demonstration of how to scrape anything structured.
April 7th, 2007 at 6:39 am
(As Tim said.. didn't see till moderation :))
April 20th, 2007 at 3:35 pm
why not use omdb.org ? its restfull, its implemented in rails and it's free :)