BacDive in 2025: the core database for prokaryotic strain data

(database[TitleAbstract]) AND (Nucleic acids research[Journal]) 2024-11-11

Nucleic Acids Res. 2024 Oct 29:gkae959. doi: 10.1093/nar/gkae959. Online ahead of print.

ABSTRACT

In 2025, the bacterial diversity database BacDive is the leading database for strain-level bacterial and archaeal information. It has been selected as an ELIXIR Core Data Resource as well as a Global Core Biodata Resource. Since its initial release more than ten years ago, BacDive (https://bacdive.dsmz.de) has grown tremendously in content and functionalities, and is a comprehensive resource covering the phenotypic diversity of prokaryotes with data on taxonomy, morphology, physiology, cultivation, and more. The current release (2023.2) contains 2.6 million data points on 97 334 strains, reflecting an increase by 52% since the previous publication in 2021. This remarkable growth can largely be attributed to the integration of the world-wide largest collection of Analytical Profile Index (API) test results, which are now fully integrated into the database and searchable. A novel BacDive knowledge graph provides powerful search options through a SPARQL endpoint, including the possibility for federated searches across multiple data sources. The high-quality data provided by BacDive is increasingly being used for the training of artificial intelligence models and resulting genome-based predictions with high confidence are now used to fill content gaps in the database.

PMID:39470737 | DOI:10.1093/nar/gkae959