PubMed Computed Authors in 2024: an open resource of disambiguated author names in biomedical literature | Bioinformatics | Oxford Academic

peter.suber's bookmarks 2024-11-29

Summary:

Abstract:  Over 55% of author names in PubMed are ambiguous: the same name is shared by different individual researchers. This poses significant challenges on precise literature retrieval for author name queries, a common behavior in biomedical literature search. In response, we present a comprehensive dataset of disambiguated authors. Specifically, we complement the automatic PubMed Computed Authors algorithm with the latest ORCID data for improved accuracy. As a result, the enhanced algorithm achieves high performance in author name disambiguation, and subsequently our dataset contains more than 21 million disambiguated authors for over 35 million PubMed articles and is incrementally updated on a weekly basis. More importantly, we make the dataset publicly available for the community such that it can be utilized in a wide variety of potential applications beyond assisting PubMed’s author name queries. Finally, we propose a set of guidelines for best practices of authors pertaining to use of their names.

 

Link:

https://academic.oup.com/bioinformatics/article/40/11/btae672/7888882

From feeds:

Open Access Tracking Project (OATP) » peter.suber's bookmarks

Tags:

oa.new oa.pubmed oa.medicine oa.orcid oa.authors oa.scholcomm oa.recommendations oa.best_practices

Date tagged:

11/29/2024, 11:21

Date published:

11/29/2024, 06:21