Selected article for: "sequence identity and taxonomic classification"

Author: Babaian, Artem; Edgar, Robert C.
Title: Ribovirus classification by a polymerase barcode sequence
  • Cord-id: svnffhtk
  • Document date: 2021_3_3
  • ID: svnffhtk
    Snippet: RNA viruses encoding a polymerase gene (riboviruses) dominate the known eukaryotic virome. Next-generation sequencing is revealing a wealth of new riboviruses with uncharacterised phenotypes, precluding classification by traditional taxonomic methods. These are often classified on the basis of polymerase sequence identity, but standardised methods to support this approach are currently lacking. To address this need, we describe the polymerase palmprint, a well-defined segment of the palm sub-dom
    Document: RNA viruses encoding a polymerase gene (riboviruses) dominate the known eukaryotic virome. Next-generation sequencing is revealing a wealth of new riboviruses with uncharacterised phenotypes, precluding classification by traditional taxonomic methods. These are often classified on the basis of polymerase sequence identity, but standardised methods to support this approach are currently lacking. To address this need, we describe the polymerase palmprint, a well-defined segment of the palm sub-domain delineated by well-conserved catalytic motifs. We present a novel algorithm, Palmscan, which identifies palmprints in nucleotide and amino acid sequences. We describe PALMdb, a reference database of palmprints derived from public sequence databases. Palmscan source code and PALMdb data are deposited at https://github.com/rcedgar/palmscan and https://github.com/rcedgar/palmdb, respectively.

    Search related documents:
    Co phrase search for related documents
    • aa sequence and active site: 1
    • aa sequence and long sequence: 1
    • aa sequence and low sequence identity: 1
    • acid polymerase and active site: 1, 2, 3, 4
    • acid polymerase and long open reading frame: 1
    • acid polymerase and low frequency: 1
    • acid residue and active site: 1, 2, 3, 4, 5, 6, 7, 8, 9
    • acid residue and low frequency: 1
    • active site and long sequence: 1
    • active site and low sequence identity: 1
    • active site sequence and low sequence identity: 1
    • local alignment and long sequence: 1