DataPatterns.org: let’s collect some tricks for data wrangling!
Connotea Imports 2012-07-31
Summary:
"How do you scrape a massive online archive? How do you fix a broken CSV file? How do you normalize entity names in a large collection of records?
There is a lot of practical skill in handling newly opened data, and the implicit promise of the open data movement is that we will help more people to access and re-use data. And while it would be desirable to be able to offer simple web-based tools for data wrangling, the truth is that what’s required is often a wild mix of web tools, desktop and command-line tools and programming skills.
So what we need is the other half of the Open Data Manual.
datapatterns.org will be a collaborative attempt to collect specific tips on how to code, wrangle and hack your way through messy data...."