Post by Peter Cooper on April 6th, 2007

Scraping IMDB with Ruby and Hpricot

Imdb

Tim from We Heart Code has written an easy-to-follow, detailed tutorial about scraping data from the Internet Movie Database using Ruby and Hpricot. As I would have suspected, Peter Szinek, developer of ScRUBYt! presents an even simpler solution in the comments.

5 Responses to “Scraping IMDB with Ruby and Hpricot”

  1. #1
    Daniel Berger Says:

    Does IMDb not have a backend interface of any sort?

  2. #2
    Tim Says:

    You can download plain text files of IMDB, however I believe the only web API is available on a pay-per-play basis.

  3. #3
    Peter Cooper Says:

    They have a whole database you can download, I believe. I think IMDB was just chosen as an 'example' in this case. It's really a demonstration of how to scrape anything structured.

  4. #4
    Peter Cooper Says:

    (As Tim said.. didn't see till moderation :))

  5. #5
    Ben Says:

    why not use omdb.org ? its restfull, its implemented in rails and it's free :)