Re: structured bibliographic info for BioMed Central articles now available as RDF

From: Stefano Mazzocchi <stefanom_at_mit.edu>
Date: Wed, 14 Sep 2005 14:46:34 -0400

Matthew Cockerill wrote:

> embedding the RDF as a comment is already in very common use
> for CC metadata - so shouldn't Piggy Bank really support the
> identification of islands of RDF in comments like this?

The problem is performance: since piggybank is not doing the HTML
parsing and the HTML parser is not aware of RDF/XML in comments, we need
to pretty much explore the DOM for *every* page you click.

We'll need to do it for creative commons anyway.

But it really doesn't hurt if you put a <link> to the RDF data as well
(and Google can just ignore while we and other crawlers might prefer
that rather than just scraping the whole page looking for RDF/XML
embedded in comments)

-- 
Stefano Mazzocchi
Research Scientist                 Digital Libraries Research Group
Massachusetts Institute of Technology            location: E25-131C
77 Massachusetts Ave                   telephone: +1 (617) 253-1096
Cambridge, MA  02139-4307              email: stefanom at mit . edu
-------------------------------------------------------------------
Received on Wed Sep 14 2005 - 18:42:11 EDT

This archive was generated by hypermail 2.3.0 : Thu Aug 09 2012 - 16:39:18 EDT