Selected article for: "homologous protein and related homologous protein"

Author: Sabath, Niv; Wagner, Andreas; Karlin, David
Title: Evolution of Viral Proteins Originated De Novo by Overprinting
  • Document date: 2012_7_19
  • ID: 629fwmgk_47
    Snippet: To identify de novo gene candidates, we applied the criterion of monophyly stated in the introduction: one of the genes in an overlapping pair-the ancestral gene-must occur in each member of a viral clade, whereas the other gene-the de novo gene-must be restricted to a single subclade, the focal clade ( fig. 1b) . To find genes that meet this criterion, we first identified, for each gene pair, homologous protein products in related genomes. (We f.....
    Document: To identify de novo gene candidates, we applied the criterion of monophyly stated in the introduction: one of the genes in an overlapping pair-the ancestral gene-must occur in each member of a viral clade, whereas the other gene-the de novo gene-must be restricted to a single subclade, the focal clade ( fig. 1b) . To find genes that meet this criterion, we first identified, for each gene pair, homologous protein products in related genomes. (We found no evidence of duplicated genes in the genomes under study, hence all our homologs are orthologs.) We aligned the homologous protein sequences of the ancestral protein (which is more phylogenetically widespread) and constructed their phylogenetic tree. By manual examination of these trees, we identified 12 cases of de novo origin (tables 1 and 3) that met the criterion of monophyly. We scanned the related genomes of the focal clade for unannotated ORFs to overcome missing genes due to fault annotation. These trees also allowed us to infer the internal node of a tree closest to the origin of the de novo gene. We call this node the last common ancestor (LCA, marked with a blue circle in the hypothetical example of fig. 1b ). To ensure that the identification of the LCA is not biased by genome annotation, we manually examined the related genomes for presence of homologous unannotated ORFs. We provide detailed explanations of the challenges in de novo gene and LCA identification in the supplementary information, Supplementary Material online (supplementary figs. S1-S4, Supplementary Material online). The phylogenetic tree and the corresponding genomic maps of the 12 cases are presented in supplementary figure S5, Supplementary Material online. For all gene pairs that met the monophyly criterion, we also determined the DNA sequence alignments corresponding to the amino acid sequence alignments (of the ancestral proteins), to enable the calculations described later.

    Search related documents:
    Co phrase search for related documents
    • amino acid and ancestral protein: 1, 2, 3, 4
    • amino acid and common ancestor: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • amino acid and de novo gene: 1, 2, 3, 4, 5
    • amino acid and de novo gene gene: 1, 2, 3, 4
    • amino acid and de novo gene origin: 1, 2
    • ancestral gene and common ancestor: 1, 2
    • ancestral gene and de novo gene: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
    • ancestral gene and de novo gene gene: 1, 2, 3, 4
    • ancestral gene and de novo gene origin: 1, 2
    • ancestral protein and common ancestor: 1, 2
    • ancestral protein and de novo gene: 1, 2, 3
    • ancestral protein and de novo gene gene: 1
    • ancestral protein and de novo gene origin: 1