We make tools and insights using
open data, open content and open code
Join in »

The Data Wrangling Blog

  • Extracting data from PDFs remains, unfortunately, a common data wrangling task. This post reviews various tools and services for doing this with a focus on free (and preferably) open source options. The tools we can consider fall into three categories: Extracting text from PDF Extracting...
  • 25 March 2016 Alexandre Bonnasseau

    Tools for Data Packages: Make vs. Tuttle

    When crafting data from some other data, like packaging public data, using the good tools can really ease development process and reliability of the data. The venerable make which have already been used for decades to build software, is a very good option as advocated...
  • 11 March 2016 Dan Fowler

    Frictionless Data Transport in Python

    Tool and platform integrations for “Data Packages” are key elements of our Frictionless Data Initiative at Open Knowledge International. We recently posted on the main blog about some integration work funded by our friends at Google. We’ve built useful Python libraries for working with Tabular...
  • 18 February 2016 Josh Wieder

    Submit your Newsletter ideas today!

    The first quarter of 2016 is almost through, which means that the OKFN Labs Newsletter is on its way! But we have a problem. We know that you have spent the last 3 months writing awesome code, founding disruptive new projects and basically changing the...
  • 05 December 2015 Josh Wieder

    Labs newsletter: Q4 2015

    Hey there hackers & hackettes! Welcome to the 4th quarter 2015 Open Knowledge Labs Newsletter: A Very Special Holiday Edition of the Open Knowledge Labs Newsletter. We hope that all of our readers, volunteers, team members & contributors have a great holiday season. Labs is...
  • All blog posts…