Selected article for: "GC content and GC GC content"

Author: Satyam, Rohit; Jha, Niraj Kumar; Kar, Rohan; Jha, Saurabh Kumar; Sharma, Ankur; Kumar, Dhruv; Nand, Parma; Ruokolainen, Janne; Kesari, Kavindra Kumar; Kamal, Mohammad Amjad
Title: Deciphering the SSR incidences across viral members of Coronaviridae family
  • Cord-id: dtvlo0kz
  • Document date: 2020_9_21
  • ID: dtvlo0kz
    Snippet: Presence of Simple Sequence Repeats (SSRs), both in genic and intergenic regions, have been widely studied in eukaryotes, prokaryotes, and viruses. In the current study, we undertook a survey to analyze the frequency and distribution of microsatellites or SSRs in multiple genomes of Coronaviridae members. We successfully identified 919 SSRs with length≥12 bp across 55 reference genomes majority of which (838 S SRs) were found abundant in genic regions. The in-silico analysis further identified
    Document: Presence of Simple Sequence Repeats (SSRs), both in genic and intergenic regions, have been widely studied in eukaryotes, prokaryotes, and viruses. In the current study, we undertook a survey to analyze the frequency and distribution of microsatellites or SSRs in multiple genomes of Coronaviridae members. We successfully identified 919 SSRs with length≥12 bp across 55 reference genomes majority of which (838 S SRs) were found abundant in genic regions. The in-silico analysis further identified the preferential abundance of hexameric SSRs than any other size-based motif class. Our analysis shows that the genome size and GC content of the genome had a weak influence on SSR frequency and density. However, we find a positive correlation of SSRs GC content with genomic GC content. We also report relatively low abundances of all theoretically possible 501 repeat motif classes in all the genomes of Coronaviridae. The majority of SSRs were AT-rich. Overall, we see an underrepresentation of SSRs across the genomes of Coronaviridae. Besides, our integrative study highlights the presence of SSRs in ORF1ab (nsp3, nsp4, nsp5A_3CLpro and nsp5B_3CLpro, nsp6, nsp10, nsp12, nsp13, & nsp15 domains), S, ORF3a, ORF7a, N & 3′ UTR regions of SARS-CoV-2 and harbours multiple mutations (3′UTR and ORF1ab SSRs serving as major mutational hotspots). This indicates the genic SSRs are under selection pressure against mutations that might alter the reading frame and at the same time responsible for rapid protein evolution. Our preliminary results indicate the significance of the limited repertoire of SSRs in the genomes of Coronaviridae.

    Search related documents:
    Co phrase search for related documents
    • acute respiratory and additional file: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • acute respiratory and long repeat: 1, 2, 3
    • acute respiratory and low reliability: 1, 2, 3
    • acute respiratory syndrome and adaptive advantage: 1
    • acute respiratory syndrome and additional file: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • acute respiratory syndrome and long repeat: 1, 2, 3
    • acute respiratory syndrome and low reliability: 1, 2, 3