Open Data and Preservation | The Signal: Digital Preservation

abernard102@gmail.com 2013-05-11

Summary:

Yesterday, May 9, 2013, the U.S. government issued an executive order and an open data policy mandating that federal agencies collect and publish new datasets in open, machine-readable, and, whenever possible, non-proprietary formats.  The new policy gives agencies six months to create an inventory of all the government-produced datasets they collect and maintain; a list of datasets that are publicly accessible; and an online system to collect feedback from the public as to how they would like to use the data.  The goals are twofold — greater access to government data for the public, and the availability of data in forms that businesses and researchers can better use.  This builds on the earlier White House Memorandum on Transparency and Open Government.  These documents were accompanied by a link to something that actually caught my fancy even more – a greatly expanded Project Open Data Github repository for guidelines, use cases and tools.  This, alongside the ever-growing (and soon to be extensively updated) data.gov, are evidence of real efforts to release more data and make it truly useful and usable.

The documents provide guidance on open licensing, metadata, and standards, as well as lifecycle-based information stewardship. But what I personally keep struggling with are two questions: What IS open data? And how is is being preserved?  The project has some defining principles for open data that I think can inform any dataset preservation project ... I am thrilled to see guidance about active management of datasets and supporting users in their work with the data.  But what could be available for this and all open dataset projects is more attention on dataset preservation.  These are a few of some great resources on this topic: [1] The Library of Congress Sustainability of Digital Formats site on datasets [2] A Report on the Preservation of Public Sector Datasets from Archives New Zealand [3] Open Data and Archiving Datasets from the National Archives UK [4] DataONE [5] Data-PASS [6] DataConservancy [7] Life of a Dataset from ICPSR [8] Best Practices for Archival Processing for Geospatial Datasets from the GeoMAPP Project [9] Datasets, Issues, Contexts and Solutions from the Open Planets Foundation..."

Link:

http://blogs.loc.gov/digitalpreservation/2013/05/open-data-preservation/

From feeds:

Open Access Tracking Project (OATP) » abernard102@gmail.com

Tags:

oa.new oa.psi oa.policies oa.licensing oa.comment oa.government oa.usa oa.green oa.copyright oa.best_practices oa.metadata oa.preservation oa.tools oa.github oa.data.gov oa.obama_directive oa.repositories oa.libre oa.data oa.standards

Date tagged:

05/11/2013, 16:32

Date published:

05/11/2013, 12:32