Mining and analysing invoice data from Elsevier relative to hybrid open access | SUB Goettingen, Scholarly Communication Analytics blog
flavoursofopenscience's bookmarks 2020-10-19
Summary:
Jahn, "Scholarly Communication Analytics: Mining and analysing invoice data from Elsevier relative to hybrid open access", Scholarly Communication Analytics Blog, 2019
Publishers rarely make publication fee spending for hybrid journals transparent. Elsevier is a remarkable exception, as the publisher provides open and machine-readable data relative to its central invoicing with funding bodies and fee waivers at the article level. This blogpost illustrates how to mine Elsevier full-texts for these data with the data science tool R and presents new insights by analysing the resulting dataset: of 70,657 articles published open access in 1,753 hybrid journals from 2015 to date, around one third of the publication fees were paid through central agreements. Nevertheless, the majority of funding sources for hybrid open access remains unclear.