SoIR: a comprehensive Solanaceae information resource for comparative and functional genomic study
(database[TitleAbstract]) AND (Nucleic acids research[Journal]) 2024-11-15
Nucleic Acids Res. 2024 Nov 11:gkae1040. doi: 10.1093/nar/gkae1040. Online ahead of print.
ABSTRACT
The Solanaceae family, which includes economically important crops such as tomatoes, potatoes and peppers, has experienced a rapid expansion in genomic data due to advancements in sequencing technologies. However, existing databases are limited by incomplete species representation, a lack of comprehensive comparative genomic tools and the absence of systematic pan-genomic analyses. To address these gaps, we developed the Solanaceae Information Resource (SoIR, https://soir.bio2db.com), a comprehensive genomics database for the Solanaceae family. SoIR integrates genomic data from 81 species and transcriptomic data from 41 species, encompassing a total of 3 908 408 gene annotations derived from Gene Ontology, nonredundant protein, Pfam, Swiss-Prot and TrEMBL databases. The resource also includes 3 437 115 CRISPR guide sequences, 212 395 transcription factors and 19 086 genes associated with methylation modification. In addition to species-specific analyses, SoIR provides extensive bioinformatics tools for investigating gene family evolution, phylogenetic relationships and karyotype reconstruction across 25 fully sequenced genomes. With advanced tools such as Blast, Synteny and Sequence Alignment, the platform provides users with interactive and intuitive visualizations for conducting cross-species comparative genomics. As the first comprehensive pan-genomic resource for the entire Solanaceae family, SoIR facilitates in-depth cross-species analysis, supporting global research initiatives in plant evolution, functional genomics and crop improvement.
PMID:39526372 | DOI:10.1093/nar/gkae1040