Re: no coordinates in my starbucks piggy bank results

From: David Huynh <dfhuynh_at_csail.mit.edu>
Date: Fri, 20 Jan 2006 14:38:05 -0500

I just tried out the craigslist scraper and it did not stall for me. I'm
on Windows XP, Firefox 1.5.

Could you create a totally new Firefox profile through the Profile
Manager and try again?... Any proxy issue?...

David


Susan Hardin wrote:

> if I go to http://www.craigslist.org/apa/ and try to run the scraper
> (using 0, 1, or any number for the # of pages to load)
>
> Javascript console results: absolutely nothing
>
> Java Console results:
>
> ===== Fri Jan 20 2006 ===== 14:13:15 America/Detroit =====
> 14:13:19.536 [...RDFUtilities] Query took 2ms:
> SELECT DISTINCT s FROM {s}
> <http://simile.mit.edu/2005/04/piggy-bank#originURL>
> {"http://www.craigslist.org/apa/"} (7437ms)
> 14:13:19.543 [...ileUtilities] application/rss+xml -> RSS (7ms)
> 14:13:19.544 [...ileUtilities] http://www.craigslist.org/apa/index.rss
> -> RSS (1ms)
> 14:13:19.544 [...ileUtilities] text/html -> null (0ms)
> 14:13:19.544 [...ileUtilities] http://www.craigslist.org/apa/ -> null
> (0ms)
> 14:13:27.529 [...RDFUtilities] Query took 2ms:
> SELECT DISTINCT o FROM
> {<http://people.csail.mit.edu/people/dfhuynh/research/downloads/screen-scrapers/screen-scraper#CraigslistApartmentListingScraper>}
> <http://simile.mit.edu/2005/04/piggy-bank#code> {o} (7985ms)
> 14:13:27.534 [...RDFUtilities] Query took 2ms:
> SELECT DISTINCT o FROM
> {<http://people.csail.mit.edu/people/dfhuynh/research/downloads/screen-scrapers/screen-scraper#CraigslistApartmentListingScraper>}
> <http://simile.mit.edu/2005/04/piggy-bank#code> {o} (5ms)
>
> =======================================
>
>
> As an experiment, I clicked on "Next 100 Results" then clicked on the
> coin. The expected RSS data came up quickly in Piggy Bank, but of
> course, had no coordinates data associated with it since I wasn't
> running the scraper.
>
> Is this related to the utilities.processDocuments() bug?
>
>
> Susan
>
>
>
>
> I On Jan 19, 2006, at 2:17 AM, David Huynh wrote:
>
> I don't know why it gets stuck. Is there any error in the
> JavaScript console or the Java console?
>
> David
>
>
> Prokopp, Christian wrote:
>
> No idea..it might take me 5 minutes to get through a
> craigslist page but eventually it works. Maybe David or
> Stefano have an idea?!
> Cheers,
> Christian
>
> ------------------------------------------------------------------------
>
> *From:* Susan Hardin [mailto:shardin_at_umich.edu]
> *Sent:* Saturday, 14 January 2006 1:47 AM
> *To:* general_at_simile.mit.edu
> *Subject:* Re: no coordinates in my starbucks piggy bank results
>
> I understand how this should work, but for some reason I never
> get past the point of being told that additional pages are
> being loaded. I choose 0 or 1 for additional pages, to
> minimize the amount of time it would take to get results, and
> still my browser just churns and churns. That window never
> goes away, and no results appear. Since it never gets past
> this point, it never steps into the "waiting for Google
> coordinates" popup window. This occurs on all machines I've
> tested. Seems that it is in a loop or something, or not
> finding the additional pages.
>
> Thinking that my home networks (ISDN and Satellite) were too
> slow, I tried it several times on the university's high speed
> network this morning. Same results.
>
> I experimented with creating a couple of scrapers using
> solvent last night for single pages and things went very well.
> However, any attempts at working with multiple pages seemed to
> stall out or hang. I would have to restart Firefox to be able
> stop the current process so I could retry running the scraper
> (using solvent).
>
> My primary machine is a Mac, running 10.3.9, the other Mac is
> running 10.4.x(?) and both machines are running Firefox 1.5.
> The other machine is a PC, running XP and Firefox 1.5. All are
> running Piggy Bank 2.1.2 (Jan 11th build).
>
> Any ideas on what's happening?
>
> Susan
> Susan Hardin
> Webmaster - sakaiproject.org
> shardin_at_umich.edu
>
>
>
>
>
> Susan Hardin
> Webmaster - sakaiproject.org
> shardin_at_umich.edu
>
>
Received on Fri Jan 20 2006 - 19:37:42 EST

This archive was generated by hypermail 2.3.0 : Thu Aug 09 2012 - 16:39:18 EDT