Selected article for: "protein length and total number"

Author: Wang, Shiliang; Sundaram, Jaideep P.; Stockwell, Timothy B.
Title: VIGOR extended to annotate genomes for additional 12 different viruses
  • Document date: 2012_6_4
  • ID: wd3ir3wg_26
    Snippet: Twenty-seven complete MPV genomes were used to evaluate VIGOR. A total of 243 genes were detected by VIGOR, the same number of genes were annotated in GenBank. VIGOR predictions completely agreed with GenBank annotations for 235 genes. VIGOR detected internal stop codons in the coding regions of four genes and the truncated proteins were shorter than 95% of the reference protein length. VIGOR, therefore, categorized these four genes as pseudogene.....
    Document: Twenty-seven complete MPV genomes were used to evaluate VIGOR. A total of 243 genes were detected by VIGOR, the same number of genes were annotated in GenBank. VIGOR predictions completely agreed with GenBank annotations for 235 genes. VIGOR detected internal stop codons in the coding regions of four genes and the truncated proteins were shorter than 95% of the reference protein length. VIGOR, therefore, categorized these four genes as pseudogenes. These genes were annotated as functional genes in GenBank. The start codons are the same as these predicted by VIGOR. The stop codons in GenBank of these genes are same as the internal stop codons detected by VIGOR. The other four cases, in which VIGOR gene predictions were not same as GenBank annotations, are the M2 ORF1 gene (M2-1). Two versions of M2-1 protein exist in GenBank protein database. The longer protein has five additional amino acids at the N-terminus. VIGOR always selects the longer protein to be the reference sequence. The upstream ATG was therefore selected as the start codon in these four genes. In the corresponding GenBank annotations, the downstream ATG were annotated as the start codons for these four genes.

    Search related documents:
    Co phrase search for related documents
    • amino acid and stop codon: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • code region and start codon: 1
    • functional gene and start codon: 1, 2
    • functional gene and stop codon: 1, 2, 3
    • GenBank annotation and start codon: 1, 2, 3
    • GenBank annotation and stop codon: 1
    • GenBank protein and start codon: 1, 2
    • GenBank protein database and start codon: 1, 2
    • GenBank protein database exist and start codon: 1, 2
    • GenBank stop codon and stop codon: 1
    • gene number and stop codon: 1
    • gene prediction and start codon: 1, 2
    • gene prediction and stop codon: 1, 2
    • gene start codon and start codon: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
    • gene start codon and stop codon: 1, 2
    • internal stop codon and start codon: 1
    • internal stop codon and stop codon: 1, 2, 3, 4
    • long protein and start codon: 1
    • long protein and stop codon: 1