We make tools and insights using
open data, open content and open code
Join in »

The Data Wrangling Blog

  • 04 August 2016 Dan Fowler

    Embulk at csv,conf,v2

    Having co-organized csv,conf,v2 this past May, a few of us from Open Knowledge International had the awesome opportunity to travel to Berlin and sit in on a range of fascinating talks on the current state-of-the-art on wrangling messy data. Previously, I posted about Comma Chameleon...
  • 01 August 2016 Dan Fowler

    Using Data Packages with Pandas

    Frictionless Data is about making it effortless to transport high quality data among different tools and platforms for further analysis. We obviously ♥ data science, and pandas is one of the most popular Python libraries for advanced data analysis and modeling. This post highlights our...
  • 25 July 2016 Dan Fowler

    Publish Data Packages to DataHub (CKAN)

    Back in March, I wrote about a CKAN extension for publishing and exporting Data Packages1. This extension, datapackager, has been updated and is now live on our very own CKAN instance, DataHub. DataHub users can now import and export Data Packages via the CKAN UI...
  • 18 July 2016 Dan Fowler

    Comma Chameleon at csv,conf,v2

    Having co-organized csv,conf,v2 this past May, a few of us from Open Knowledge International had the awesome opportunity to travel to Berlin and sit in on a range of fascinating talks on the current state-of-the-art on wrangling messy data. One such talk was given by...
  • 14 July 2016 Dan Fowler

    Using Data Packages with R

    R is a popular open-source programming language and platform for data analysis. Frictionless Data is an Open Knowledge International project aimed at making it easy to publish and load high-quality data into tools like R through the creation of a standard wrapper format called the...
  • All blog posts…