Wikipedia citations in Wikidata – Diff
peter.suber's bookmarks 2021-07-16
From Google's English: "The Wikipedia Citations dataset currently includes approximately 30 million citations from Wikipedia pages to a variety of sources, including 4 million scientific publications. Increasing the connection with external data services and providing structured data to one of the key elements of Wikipedia articles has two significant advantages: first, better identification of relevant encyclopedic articles related to academic studies; furthermore, the strengthening of Wikipedia as a social authority and political hub, which would allow policy makers to gauge the importance of an article, a person, a research group and an institution by looking at how many Wikipedia articles cite them.
These are the motivations behind the “Wikipedia Citations in Wikidata” project , supported by a grant from the WikiCite Initiative. From January 2021 until the end of April, the team of Silvio Peroni (co-founder and director of OpenCitations), Giovanni Colavizza, Marilena Daquino, Gabriele Pisciotta and Simone Persiani of the University of Bologna (Department of Classical and Italian Philology) worked on the development of a codebase to enrich Wikidata with citations to academic publications that are currently referenced in English in Wikipedia . This codebase is divided into four software modules in Python and integrates new components (a classifier to distinguish citations based on the cited source and a search module to equip citations with identifiers from Crossref or other APIs). In doing so, Wikipedia Citations extends previous work that focused only on citations that already have identifiers...."