Input data for Open Citations – the PMC Open Access Subset

Connotea Imports 2012-07-31

Summary:

"The Open Citations Project has to date worked exclusively with the Open Access subset (OASS) of PMC. As of 24 January 2011, there were 204,637 OASS articles, including a few published before 1980. In almost all of these OASS articles, the reference lists were nicely marked up in NLM-DTD XML, making the task of identifying individual references straightforward. In a few cases, the articles were present as scanned page images, lacking any internal markup – those we were unable to process. From the XML reference lists of these papers, we were able to identify and extract 6,325,178 individual references, which, together with the bibliographic information we had on the OASS articles themselves gave us 6,529,815 independent bibliographic records of both citing and cited entities...."

Link:

http://opencitations.wordpress.com/2011/07/01/input-data-for-open-citations-the-pmc-open-access-subset/

From feeds:

Open Access Tracking Project (OATP) » Connotea Imports

Tags:

oa.new oa.data ru.ps oa.metadata oa.citations oa.pmc

Authors:

petersuber

Date tagged:

07/31/2012, 13:22

Date published:

07/02/2011, 23:29