Comprehensive analysis of genomic diversity of SARS-CoV-2 in different geographic regions of India: An endeavour to classify Indian SARS-CoV-2 strains on the basis of co-existing mutations | bioRxiv preprints (not peer reviewed)

Zotero / K4D COVID-19 Health Evidence Summaries Group / Top-Level Items 2020-09-23

Type Journal Article Author Rakesh Sarkar Author Suvrotoa Mitra Author Pritam Chandra Author Priyanka Saha Author Anindita Banerjee Author Shanta Dutta Author Mamta Chawla-Sarkar URL https://www.biorxiv.org/content/10.1101/2020.07.14.203463v1 Rights © 2020, Posted by Cold Spring Harbor Laboratory. The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. Pages 2020.07.14.203463 Publication bioRxiv Date 15/07/2020 Extra Publisher: Cold Spring Harbor Laboratory Section: New Results DOI 10.1101/2020.07.14.203463 Library Catalog www.biorxiv.org Language en Abstract Accumulation of mutations within the genome is the primary driving force for viral evolution within an endemic setting. This inherent feature often leads to altered virulence, infectivity and transmissibility as well as antigenic shift to escape host immunity, which might compromise the efficacy of vaccines and antiviral drugs. Therefore, we aimed at genome-wide analyses of circulating SARS-CoV-2 viruses for the emergence of novel co-existing mutations and trace their spatial distribution within India. Comprehensive analysis of whole genome sequences of 441 Indian SARS-CoV-2 strains revealed the occurrence of 33 different mutations, 21 being distinctive to India. Emergence of novel mutations were observed in S glycoprotein (7/33), NSP3 (6/33), RdRp/NSP12 (4/33), NSP2 (2/33) and N (2/33). Non-synonymous mutations were found to be 3.4 times more prevalent than synonymous mutations. We classified the Indian isolates into 22 groups based on the co-existing mutations. Phylogenetic analyses revealed that representative strain of each group divided themselves into various sub-clades within their respective clades, based on the presence of unique co-existing mutations. India was dominated by A2a clade (55.60%) followed by A3 (37.38%) and B (7%), but exhibited heterogeneous distribution among various geographical regions. The A2a clade mostly predominated in East India, Western India and Central India, whereas A3 clade prevailed in South and North India. In conclusion, this study highlights the divergent evolution of SARS-CoV-2 strains and co-circulation of multiple clades in India. Monitoring of the emerging mutations would pave ways for vaccine formulation and designing of antiviral drugs. Short Title Comprehensive analysis of genomic diversity of SARS-CoV-2 in different geographic regions of India