DataPatterns.org: let’s collect some tricks for data wrangling!

Connotea Imports 2012-07-31

Summary:

"How do you scrape a massive online archive? How do you fix a broken CSV file? How do you normalize entity names in a large collection of records? There is a lot of practical skill in handling newly opened data, and the implicit promise of the open data movement is that we will help more people to access and re-use data. And while it would be desirable to be able to offer simple web-based tools for data wrangling, the truth is that what’s required is often a wild mix of web tools, desktop and command-line tools and programming skills. So what we need is the other half of the Open Data Manual. datapatterns.org will be a collaborative attempt to collect specific tips on how to code, wrangle and hack your way through messy data...."

Link:

http://blog.okfn.org/2011/08/04/datapatterns-org-lets-collect-some-tricks-for-data-wrangling/

From feeds:

Open Access Tracking Project (OATP) » Connotea Imports

Tags:

ru.no oa.new oa.data oa.comment oa.tools

Authors:

petersuber

Date tagged:

07/31/2012, 12:55

Date published:

08/05/2011, 23:12