On Thu, 03 Feb 2005 20:22:20 -0500, Stefano Mazzocchi <stefanom_at_mit.edu> wrote:
> Dominique Hazaël-Massieux wrote:
> > Hi,
> >
> > With the arrival of piggy-bank making the Semantic Web part of the
> > browsing experience, there is one thing that I would love to see happen:
> > get piggy-bank understand GRDDL to harvest data on the Web.
>
> Bummer, you ruined my surprise :-)
++
> I can hardly agree more. An XSLT stylesheet is the ideal implementation
> of an XML2RDF bridge.... my concern is with non-well-formed HTML.. but
> yeah, we could ship JTidy along with piggy-bank.... hmmm...
John Cowan's TagSoup [1] is a lighter-weight alternative to JTidy, and
if you want really light weight and more general (but less smart) then
I've got a class for the purpose [2] (actually I think I may have a
slightly tidier version, functionally equiv, must check).
[1]
http://mercury.ccil.org/~cowan/XML/tagsoup/
[2]
http://dannyayers.com/archives/2004/09/08/jsoup/
--
http://dannyayers.com
Received on Fri Feb 04 2005 - 10:37:36 EST