Monitoring Open Access at a national level: French case study
openacrs's bookmarks 2019-10-18
Eric Jeangirard. Monitoring Open Access at a national level: French case study. ELPUB 2019 23rd edition of the International Conference on Electronic Publishing, Jun 2019, Marseille, France. ⟨hal-02141819⟩
After the launch of multiple plans for Open Science, there is now a need for an accurate method or tool to monitor the Open Science trends and in particular Open Access (OA) trends. We address this requirement with a methodology that we developed and tested for France, but that could be extended to other countries. Only open data and information available on the Web are used, leveraging as much as we can large-scale systems such as Unpaywall, HAL (the main open repository in France, part of the CNRS), ORCID and IDRef (referential for French Higher Education and Research). We used rule-based and machine learning techniques to enrich the metadata of the publications. We estimate that the overall OA rate for French affiliated publications ranges from 39% to 42% between 2013 and 2017. The trend is slightly up, except for the last year, but we gather evidence that shows this is a consequence of the moving nature of the OA status. Therefore these figures should be seen as a snapshot rather than definitive. For the last observed year (2017), we show that the OA rate varies according to the publication type, the publisher and the discipline (more than 60% in Mathematics while it is about 30% in Medical research which represents the largest share in the number of publications). We describe the main challenges of our method (detection of the publications with a French affiliation, metadata enrichment with machine learning, open access status) and evaluate the errors of each step. Most of the method is not country-specific and could be applied for another perimeter. Our implementation is open sourced on the repository https://github.com/dataesr/publications.