Project scraping archiving data Tracking Amtrak 188

Tracking Amtrak 188

How curiosity and tinkering let Al Jazeera America publish historical data for a derailed train’s route without Amtrak’s cooperation.

Project data scraping elections Scraping Nevada

Scraping Nevada

Derek Willis breaks down the three stages of scraping (denial, annoyance, and acceptance) while confronting the election-results form from hell.

Project Python scraping Twitter data To Scrape, Perchance to Tweet

To Scrape, Perchance to Tweet

At the Chicago Tribune, we had a simple goal: to automatically tweet contributions to Illinois politicians of $1,000 or more, which campaigns are required to report within five business days. To see, in something approximating real time, which campaigns are bringing in the big bucks and who those big-buck-bearers are. The Illinois State Board of Elections (ISBE) has helpfully published exactly this data for years online, in a format that appears to have changed very little since at least the mid-2000s. There’s no API for this data, but the stability of the format is encouraging. A scraper is hardly an ideal tool for anything intended to last for a while and produce public-facing data, but if we can count on the format of the page not to change much over at least the next several months, it’s probably worth it.

Roundup events Event Roundup, Apr 22

Event Roundup, Apr 22

Journalists gather in Italy this week, while Hacks/Hackers chapters hold meetups on balloon mapping and HTML 5, plus a cryptoparty.