Articles

Projects walkthroughs, tool teardowns, interviews, and more.

Articles tagged: data

  1. How We Made the SOTU Twitter Visualization

    By Nicolas Belmonte and Simon Rogers

    Posted on

    People tweet what they think, when they think it—and, crucially, we wanted to provide a visualization for the State of the Union speech which reflected that. This wouldn’t be a (shudder) word cloud based on frequencies but a way to track the conversation on Twitter as it was directly influenced by the President’s speech.

  2. You Got the Documents. Now What?

    By Jonathan Stray

    Posted on

    Jonathan Stray’s guide to turning documents into data you can run with.

  3. Human-Assisted Reporting Gets the Story

    By Tyler Dukes

    Posted on

    Tyler Dukes on combining the power of data-sorting tools with old-fashioned digging.

  4. How to Make a News App in Two Days

    By Al Shaw

    Posted on

    As part of the orientation week for the 2014 class of Knight-Mozilla OpenNews Fellows, fellow nerd-cuber Mike Tigas and I led a hackathon at Mozilla’s headquarters in San Francisco…

  5. To Scrape, Perchance to Tweet

    By Abe Epton

    Posted on

    At the Chicago Tribune, we had a simple goal: to automatically tweet contributions to Illinois politicians of $1,000 or more, which campaigns are required to report within five business days. To see, in something approximating real time, which campaigns are bringing in the big bucks and who those big-buck-bearers are. The Illinois State Board of Elections (ISBE) has helpfully published exactly this data for years online, in a format that appears to have changed very little since at least the mid-2000s. There’s no API for this data, but the stability of the format is encouraging. A scraper is hardly an ideal tool for anything intended to last for a while and produce public-facing data, but if we can count on the format of the page not to change much over at least the next several months, it’s probably worth it.

  6. Introducing Treasury.IO

    By Michael Keller and Cezary Podkul

    Posted on

    The U.S. Treasury’s Daily Treasury Statement lists actual cash spending down to the million on everything the government spent money on each day, as well as how it funded the spending. But, the Treasury only releases these files in PDF or fixed-width text files like this one, making any analysis very difficult. To liberate the data and make it easy to analyze federal money flows across time, we created Treasury.IO. The system we built downloads and parses the fixed-width files into a standard schema, creating a SQLite database that can be directly queried via a URL endpoint.

  7. Open Your Data

    By Waldo Jaquith

    Posted on

    Waldo Jaquith on the whys and wherefores of making it open

  8. And Remember, this Is for Posterity

    By Jacob Harris

    Posted on

    Jacob Harris on the hows and whys of designing interactives to survive the future.

  9. Building News Apps on a Shoestring

    By Alan Palazzolo

    Posted on

    Alan Palazzolo on how the MinnPost team rocks it without a big budget

  10. Well Hello, Census

    By Joe Germuska

    Posted on

    Joe Germuska on the iterative, human-centered process that’s made the new Census Reporter project especially awesome.

  11. Let research apps, MVC JavaScript, and APIs work for you

    By Agustin Armendariz, Michael Corey, and Aaron Williams

    Posted on

    The Center for Investigative Reporting continues their work visualizing Department of Veterans Affairs’ data. Here, they discuss their development process.

  12. Using Big Data to Ask Big Questions

    By Chase Davis

    Posted on

    Chase Davis lays down some data science upon us to change how you think about the questions you’re asking of your data

  13. US Shutdown Scuttles Data

    By Dan Sinker

    Posted on

    As the government shutdown grinds into its third day, many news developers, civic data hackers, and open gov activists are starting to feel the hurt due to the suspension of most government data feeds, APIs, and websites. How they’re adapting and collaborating to fill the gaps of the shutdown.

  14. The Times Regrets the Programmer Error

    By Jacob Harris

    Posted on

    Jake Harris opens a serious barrel of monkeys about when and how to issue corrections for data journalism.

  15. Kill All Your Darlings

    By Matt Waite

    Posted on

    Matt Waite on what to do when things don’t work out like you planned.

  16. All About Reporter

    By Erin Kissane and Jeremy Singer-Vine

    Posted on

    The Wall Street Journal’s Jeremy Singer-Vine recently released Reporter, an open source tool that makes it easy to hide and reveal the code behind common forms of data visualization presented on the web. We spoke with him about the tool’s makeup, design goals, and future development plan.

  17. People Power Prevails!

    By John Keefe

    Posted on

    John Keefe on tracking the cicada pestilence with open source sensor journalism and crowdsourced data collection

  18. Telling Your Boss “No”

    By Matt Waite

    Posted on

    Matt Waite says just because you can make it doesn’t mean you should

  19. Sane Data Updates Are Harder than You Think: Part 3

    By Adrian Holovaty

    Posted on

    Third in a three-part series by Adrian Holovaty about hairy data-parsing problems from a journalist’s perspective

  20. They Are Tweet Zombies!! They Are Followers!!

    By Jacob Harris

    Posted on

    Jake Harris on how dead accounts and spambots can mess with your Twitter data mojo

Current page