Patterns
This section contains small snippets that will help you in the process of data wrangling. They might be small useful tips of full blown tutorials on tools or topics.
Note on the term ‘pattern’
The term pattern has developed a very specific meaning in software engineering. While we use the term in this sense, the tricks presented are not defined as a pattern using any of the formal templates that have developed for software design patterns.
- A short introduction to HTML
- Archiving Twitter
- Cleaning Data with Refine
- Cleaning spending data with Open Refine
- Cleaning up Data Scraped from the Web
- Creating Line Charts
- Creating a Choropleth map
- Creating an Interactive Bubble Chart
- Extracting Data from PDFs using Tabula
- Filtering Data
- Freedom of information
- Geocoding / Georeferencing Data
- Geocoding Data in a Google Docs Spreadsheet
- Getting data from the World Bank
- How to find data
- Keeping the data around
- Liberating Data from Microsoft Access Databases
- Liberating HTML Data Tables
- Outputting CSV from Postgres
- Publishing a Dataset on the DataHub
- Publishing our results
- Scraping - Beyond the Basics
- Scraping multiple Pages using the Scraper Extension and Refine
- Scraping websites using the Scraper extension for Chrome
- Sorting Data with Spreadsheets
- Spreadsheet Formulae
- Using a spreadsheet to clean up a dataset
- Walkthrough: Scatterplot
- What is an API and how to use one
- Improve this page Edit on Github Help and instructions
-
Donate
If you have found this useful and would like to support our work please consider making a small donation.