Flickr Scraper

This scraper scrapes photo information from http://flickr.com. Run it on any page containing image links to photo details page, such as tag search page, group page or users' sets.

Contents

Currently it scrapes:

  • dc:title (from photo title)
  • dc:creator (from photo author)
  • dc:date (from uploaded date, not taken date)
  • dc:rights ("All rights reserved" or link to creative commons license)
  • dc:source (flickr photo page URL)
  • tags (from photo tags)
  • coordinates (from <meta name="geo.position"...> - new way flickr geotags photos)

Wish list

  • all EXIF meta-data
  • scraping from non-flickr pages that reference flickr images (blogs for example) - currently impossible due to cross scripting security policy

Warning

This scraper is a multi page scraper. Scraping a page with a lot of flickr photos might take a few minutes.


Formal Semantic Description

Facts about FlickrPhotoScraper — Click + to find similar pages.RDF feed
Attribute values
Creator Václav Synáček  +
Pb:url pattern ^http://.*flickr\.com/.*  +
Pb:javascript URL http://simile.mit.edu/wiki/Flickr_Photo_Scraper_Script?action=raw  +
Pb:generates http://flickr.com#FlickrPhoto  +
Pb:directionsLink http://simile.mit.edu/wiki/FlickrPhotoScraper  +
Pb:directions Visit any page at http://flickr.com, click the coin and get info of all images linked from the page.  +
Pb:disclaimer This software is provided 'as-is' with no expressed or implied warranty of any kind. Users of this software take all responsibility for the accuracy (or lack thereof) of any results obtained through its use.  +