Generic Web Page Scraper
This script pulls data from any HTML web page, capturing information such as title, location, and tries to transform any existing <meta> element into something that meaningful in RDF.
[edit]
Formal Semantic Markup
- creator: Stefano Mazzocchi
- match: "^.*$"
- javascript: http://simile.mit.edu/repository/piggy-bank/trunk/src/extension/chrome/content/scrapers/generic-page-scraper.js
- generates: http://simile.mit.edu/2005/04/ontologies/web#Page
- directions: Run on any page
- disclaimer: This software is provided 'as-is' with no expressed or implied warranty of any kind. Users of this software take all responsibility for the accuracy (or lack thereof) of any results obtained through its use.
Facts about Generic Web Page Scraper — Click + to find similar pages.RDF feed
| Attribute values | |
|---|---|
| Creator | Stefano Mazzocchi + |
| Pb:url pattern | ^.*$ + |
| Pb:javascript URL | http://simile.mit.edu/repository/piggy-bank/trunk/src/extension/chrome/content/scrapers/generic-page-scraper.js + |
| Pb:generates | http://simile.mit.edu/2005/04/ontologies/web#Page + |
| Pb:directions | Run on any page + |
| Pb:disclaimer | This software is provided 'as-is' with no expressed or implied warranty of any kind. Users of this software take all responsibility for the accuracy (or lack thereof) of any results obtained through its use. + |

