Re: [RT] Learning from Greasemonkey + Platypus

From: Stefano Mazzocchi <stefanom_at_mit.edu>
Date: Wed, 03 Aug 2005 18:27:30 -0700

Bruce D'Arcus wrote:
>
> On Aug 3, 2005, at 8:12 PM, Eric Miller wrote:
>
>> ...
>> - http://simile.mit.edu/piggy-bank/screen-scrapers-howto.html
>>
>> for additional details.
>
>
> Still, given that most web pages are not valid XML, if there was some
> way to pipe pages through Tidy first, that might open up more options?

We *are* using JTidy.

-- 
Stefano Mazzocchi
Research Scientist                 Digital Libraries Research Group
Massachusetts Institute of Technology            location: E25-131C
77 Massachusetts Ave                   telephone: +1 (617) 253-1096
Cambridge, MA  02139-4307              email: stefanom at mit . edu
-------------------------------------------------------------------
Received on Thu Aug 04 2005 - 01:23:56 EDT

This archive was generated by hypermail 2.3.0 : Thu Aug 09 2012 - 16:39:18 EDT