Packaging Data: The core problem in general data sharing?

Amyluv's bookmarks 2017-11-06

Summary:

"In this final post about the IDRC data sharing pilot project I want to close the story that started with an epic rant a few months ago. To recap, I had data from the project that I wanted to deposit in Zenodo. Ideally I would have found an example of doing this well, organised my data files in a similar way, zipped up a set of directories with a structured manifest or catalogue in a recognised format and job done. It turned out to not be so easy. In particular there were two key gaps. The first was my inability to find specific guidance on how to organise the kinds of data that I had. These were, amongst other things, interview recordings and transcripts. You’d think that it would be common enough, but I could not find any guidance on standard metadata schema or the best way to organise the data. That is not the say that it isn’t out there. There is likely some very good guidance, and certainly expertise, but I struggled to find it. It was not very discoverable, and as I said at the time, if I can’t find it then its highly unlikely that the ‘average’ non-computational researcher generating small scale data sets will be able to. In the end the best advice I got on organizing the package was ‘organise it in the way that makes most sense for the kinds of re-use you’d expect’, so I did."

Link:

http://cameronneylon.net/blog/packaging-data-the-core-problem-in-general-data-sharing/

From feeds:

Open Access Tracking Project (OATP) » Amyluv's bookmarks

Tags:

oa.new oa.data oa.obstacles oa.platforms oa.metadata oa.rdm

Authors:

Cameron Neylon

Date tagged:

11/06/2017, 10:58

Date published:

11/06/2017, 10:35