Increasing metadata coverage of SRA BioSample entries using deep learning-based named entity recognition

Database (Oxford) 2022-01-31

Summary:

High-quality metadata annotations for data hosted in large public repositories are essential for research reproducibility and for conducting fast, powerful and scalable meta-analyses. Currently, a majority of sequencing samples in the National Center for Biotechnology Information's Sequence Read Archive (SRA) are missing metadata across several categories. In an effort to improve the metadata coverage of these samples, we leveraged almost 44 million attribute-value pairs from SRA BioSample to...

Link:

https://pubmed.ncbi.nlm.nih.gov/33914028/?utm_source=Other&utm_medium=rss&utm_campaign=journals&utm_content=101517697&fc=None&ff=20220131034121&v=2.17.5

From feeds:

📚BioDBS Bibliography » Database (Oxford)

Tags:

Authors:

Adam Klie, Brian Y Tsui, Shamim Mollah, Dylan Skola, Michelle Dow, Chun-Nan Hsu, Hannah Carter

Date tagged:

01/31/2022, 03:44

Date published:

04/29/2021, 06:00