Ted Han was the lead technologist behind DocumentCloud from 2011 to 2018, a successful project and hosting service used widely for publication of newsworthy documents and for document analysis. Ted has been involved in open source software for 15 years, was lead developer at Investigative Reporters and Editors and taught at Missouri University School of Journalism.
Ted’s work on Source
Articles by Ted
Our Search for the Best OCR Tool, and What We Found
A side-by-side comparison of seven OCR tools using multiple kinds of documents, from FactfulPosted on
We couldn’t find single side by side comparison of the most accessible OCR options, so we ran a handful of documents through seven different tools, and compared the results.
Protecting Your Sources When Releasing Sensitive Documents
Scrub metadata, redact information properly, search for microdots & morePosted on
Critical advice for protecting sources when releasing sensitive documents.
What AMP (Maybe) Means for News Developers
A Source roundtable on the implications of Google’s Accelerated Mobile PagesPosted on
When Google’s Accelerated Mobile Pages (AMP, naturally) arrived late last week, the journalism internet produced a rainbow of responses. We invited a few news developers to comment at greater length, and they dug into the issues with gusto and rigor.