Author: Wang, Shiliang; Sundaram, Jaideep P.; Stockwell, Timothy B.
Title: VIGOR extended to annotate genomes for additional 12 different viruses Document date: 2012_6_4
ID: wd3ir3wg_26
Snippet: Twenty-seven complete MPV genomes were used to evaluate VIGOR. A total of 243 genes were detected by VIGOR, the same number of genes were annotated in GenBank. VIGOR predictions completely agreed with GenBank annotations for 235 genes. VIGOR detected internal stop codons in the coding regions of four genes and the truncated proteins were shorter than 95% of the reference protein length. VIGOR, therefore, categorized these four genes as pseudogene.....
Document: Twenty-seven complete MPV genomes were used to evaluate VIGOR. A total of 243 genes were detected by VIGOR, the same number of genes were annotated in GenBank. VIGOR predictions completely agreed with GenBank annotations for 235 genes. VIGOR detected internal stop codons in the coding regions of four genes and the truncated proteins were shorter than 95% of the reference protein length. VIGOR, therefore, categorized these four genes as pseudogenes. These genes were annotated as functional genes in GenBank. The start codons are the same as these predicted by VIGOR. The stop codons in GenBank of these genes are same as the internal stop codons detected by VIGOR. The other four cases, in which VIGOR gene predictions were not same as GenBank annotations, are the M2 ORF1 gene (M2-1). Two versions of M2-1 protein exist in GenBank protein database. The longer protein has five additional amino acids at the N-terminus. VIGOR always selects the longer protein to be the reference sequence. The upstream ATG was therefore selected as the start codon in these four genes. In the corresponding GenBank annotations, the downstream ATG were annotated as the start codons for these four genes.
Search related documents:
Co phrase search for related documents- reference sequence and start codon: 1, 2, 3, 4, 5, 6, 7
- reference sequence and stop codon: 1, 2, 3, 4, 5, 6, 7, 8, 9
- reference sequence and vigor detect: 1, 2
- reference sequence and vigor detect stop codon: 1
- reference sequence and vigor gene prediction: 1, 2
- reference sequence and vigor predict: 1, 2
- reference sequence and vigor prediction: 1, 2, 3
- start codon and stop codon: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31
- start codon and vigor evaluate: 1
- start codon and vigor gene prediction: 1, 2
- start codon and vigor predict: 1, 2
- start codon and vigor prediction: 1, 2
- stop codon and truncate protein: 1
- stop codon and vigor detect: 1
- stop codon and vigor detect stop codon: 1
- stop codon and vigor evaluate: 1
- stop codon and vigor gene prediction: 1, 2
- stop codon and vigor predict: 1, 2, 3, 4, 5
- stop codon and vigor prediction: 1, 2, 3, 4
Co phrase search for related documents, hyperlinks ordered by date