We make tools and insights using
open data, open content and open code
Join in »

The Data Wrangling Blog

  • 06 March 2015 Paul Walsh

    The Good Tables web service

    Introducing the Good Tables web service The Good Tables web service is an API and UI for processing tabular data, being an HTTP wrapper around Good Tables, which was previously announced on OKFN Labs. It is built by Open Knowledge, with funding from the Open...
  • Getting text out of documents Last year I was working on beta.offenedaten.de, a catalog of data catalogs in Germany using the CKAN platform as the basis. Although the topic of how to enable full-text search of documents in CKAN data catalogs is somewhat open, I...
  • 20 February 2015 Paul Walsh

    Introducing Good Tables

    What is it? Good Tables is a Python package for validating tabular data through a processing pipeline. It is built by Open Knowledge, with funding from the Open Data User Group. Good Tables is currently an alpha release. Applications range from simple validation checks on...
  • Wanted: volunteers to join a team of “Data Curators” maintaining “core” datasets (like GDP or ISO-codes) in high-quality, easy-to-use and open form. What is the project about: Collecting and maintaining important and commonly-used (“core”) datasets in high-quality, standardized and easy-to-use form - in particular, as...
  • dpm the command-line ‘data package manager’ now supports pushing (Tabular) Data Packages straight into a CKAN instance (including pushing all the data into the CKAN DataStore): dpm ckan {ckan-instance-url} This allows you, in seconds, to get a fully-featured web data API – including JSON and...
  • All blog posts…