[status] weekly report

From: Stefano Mazzocchi <stefanom_at_mit.edu>
Date: Mon, 13 Feb 2006 09:26:50 -0500

last week:

  - continued the work on gadget 2.0 (reached two milestones) and
getting close to have something to show to the public (and found out
that Velocity is really not designed for recursive macro calls, grrrr)

  - continued the work on OCW RDFization (but work on Gadget 2.0, which
I need to do it well, slowed me down)

  - cleaned up the simile web site (and fixed a few html annoyances) and
added the Mellon proposal to the site (as suggested by MacKenzie) as an
indication of the project roadmap

  - cleaned up and updated the RDFizer list

  - added a README.txt and build.xml for those RDFizers that didn't have
one (gotta love clean up dishes for students ;-)

  - read a bunch of papers on discovering patterns in graph and trees,
found a few very promising ones from data-mining literature for XML
datasets (for the future work on Gadget and potentially for the
scalability of faceted browsing)

  - thought about ways to scale the O(n^2) problem of finding clusters
of similar strings in a list by string distance analysis, came out with
an interesting (and apparently innovative since I couldn't find anything
like that in the literature) user-interactive approach.

this week:

  - finish the RDFization of OCW

  - mine the overlap between OCW, DSpace and Barton, seeding with OCW
instructors and create the demo dataset.

  - help Ryan with longwell, if he needs it.

  - implement string clustering in Gadget to test the
speed/effectiveness of various distances.

  - if bored/snowed-under, implement sparklines in Gadget

  - force myself to start sending stuff in the mail to LA or I'll never
finish moving :-(

Stefano Mazzocchi
Research Scientist                 Digital Libraries Research Group
Massachusetts Institute of Technology            location: E25-131C
77 Massachusetts Ave                   telephone: +1 (617) 253-1096
Cambridge, MA  02139-4307              email: stefanom at mit . edu
Received on Mon Feb 13 2006 - 14:25:46 EST

This archive was generated by hypermail 2.3.0 : Thu Aug 09 2012 - 16:39:18 EDT