Large scale genomic analysis of 3067 SARS-CoV-2 genomes reveals a clonal geodistribution and a rich genetic variations of hotspots mutations

58Citations
Citations of this article
177Readers
Mendeley users who have this article in their library.

Abstract

In late December 2019, an emerging viral infection COVID-19 was identified in Wuhan, China, and became a global pandemic. Characterization of the genetic variants of SARS-CoV-2 is crucial in following and evaluating it spread across countries. In this study, we collected and analyzed 3,067 SARS-CoV-2 genomes isolated from 55 countries during the first three months after the onset of this virus. Using comparative genomics analysis, we traced the profiles of the whole-genome mutations and compared the frequency of each mutation in the studied population. The accumulation of mutations during the epidemic period with their geographic locations was also monitored. The results showed 782 variants sites, of which 512 (65.47%) had a non-synonymous effect. Frequencies of mutated alleles revealed the presence of 68 recurrent mutations, including ten hotspot non-synonymous mutations with a prevalence higher than 0.10 in this population and distributed in six SARS-CoV-2 genes. The distribution of these recurrent mutations on the world map revealed that certain genotypes are specific to geographic locations. We also identified co-occurring mutations resulting in the presence of several haplotypes. Moreover, evolution over time has shown a mechanism of mutation co-accumulation which might affect the severity and spread of the SARS-CoV-2. The phylogentic analysis identified two major Clades C1 and C2 harboring mutations L3606F and G614D, respectively and both emerging for the first time in China. On the other hand, analysis of the selective pressure revealed the presence of negatively selected residues that could be taken into considerations as therapeutic targets. We have also created an inclusive unified database (http://covid-19.medbiotech.ma) that lists all of the genetic variants of the SARS-CoV-2 genomes found in this study with phylogeographic analysis around the world.

References Powered by Scopus

The Sequence Alignment/Map format and SAMtools

41210Citations
N/AReaders
Get full text

MAFFT multiple sequence alignment software version 7: Improvements in performance and usability

31114Citations
N/AReaders
Get full text

MEGA X: Molecular evolutionary genetics analysis across computing platforms

28479Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Sars-cov-2 m<sup>pro</sup>: A potential target for peptidomimetics and small-molecule inhibitors

125Citations
N/AReaders
Get full text

Stability of SARS-CoV-2 phylogenies

60Citations
N/AReaders
Get full text

SARS-CoV-2 main protease suppresses type I interferon production by preventing nuclear translocation of phosphorylated IRF3

56Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Laamarti, M., Alouane, T., Kartti, S., Chemao-Elfihri, M. W., Hakmi, M., Essabbar, A., … Ibrahimi, A. (2020). Large scale genomic analysis of 3067 SARS-CoV-2 genomes reveals a clonal geodistribution and a rich genetic variations of hotspots mutations. PLoS ONE, 15(11 November). https://doi.org/10.1371/journal.pone.0240345

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 47

48%

Researcher 33

34%

Professor / Associate Prof. 12

12%

Lecturer / Post doc 6

6%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 34

40%

Medicine and Dentistry 23

27%

Agricultural and Biological Sciences 18

21%

Immunology and Microbiology 11

13%

Article Metrics

Tooltip
Mentions
News Mentions: 2
Social Media
Shares, Likes & Comments: 131

Save time finding and organizing research with Mendeley

Sign up for free