Author: Huang, Yi; Lau, Susanna K. P.; Woo, Patrick C. Y.; Yuen, Kwok-yung
Title: CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes Document date: 2007_10_2
ID: ujhgb3b0_19
Snippet: Rapid and accurate batch sequence retrieval is both the cornerstone and bottleneck for comparative gene or genome analysis. During the process of complete genome sequencing and comparative analysis of the various novel human and animal coronavirus genomes in the past 2 years, we have developed a comprehensive The first column is CoVDB gene id. In the Uniq column, 'Uniq' will be shown if there is no other identical sequence in CoVDB. Otherwise, ge.....
Document: Rapid and accurate batch sequence retrieval is both the cornerstone and bottleneck for comparative gene or genome analysis. During the process of complete genome sequencing and comparative analysis of the various novel human and animal coronavirus genomes in the past 2 years, we have developed a comprehensive The first column is CoVDB gene id. In the Uniq column, 'Uniq' will be shown if there is no other identical sequence in CoVDB. Otherwise, gene id of the sequences identical to it will be shown. database, CoVDB, of annotated coronavirus genes and genomes, which offers efficient batch sequence retrieval and analysis. As shown by our experience in using CoVDB for comparative genome analysis of novel coronaviruses we have discovered (4, 13, 16, 18, 19) , we find that CoVDB is more rapid and efficient than other existing coronavirus databases for batch sequence retrieval for the following reasons. First, we have performed annotation on all non-structural proteins in the polyprotein encoded by orf1ab of every single sequence. Second, annotation was performed for the non-structural proteins encoded by ORFs downstream to orf1ab using a standardized system, with some exceptions given to some names that have been used for a long time so as to minimize confusion. Third, all sequences with identical nucleotide sequences were labeled where one can choose to show or not to show strains with identical sequences. Fourth, CoVDB contains not only complete coronavirus genome sequences, but also incomplete genomes and their genes. Some genes of coronaviruses, such as pol, spike and nucleocapsid are sequenced much more frequently than others because they are either most conserved or least conserved. These gene sequences are particularly important for evolutionary analysis, single nucleotide polymorphism studies and design of primers for RT-PCR or quantitative RT-PCR amplification.
Search related documents:
Co phrase search for related documents- accurate rapid batch sequence retrieval and CoVDB database: 1
- analysis batch sequence retrieval and coronavirus database: 1, 2, 3
- analysis batch sequence retrieval and coronavirus gene: 1
- analysis batch sequence retrieval and coronavirus genome sequence: 1
- analysis batch sequence retrieval and CoVDB database: 1, 2
- animal human and coronavirus database: 1
- animal human and coronavirus gene: 1, 2, 3, 4, 5
- animal human and coronavirus genome sequence: 1, 2
- animal human and CoVDB database: 1
- animal human coronavirus genome and coronavirus genome sequence: 1
- annotated coronavirus gene and coronavirus gene: 1
- batch sequence and coronavirus database: 1, 2, 3
- batch sequence and coronavirus gene: 1, 2
- batch sequence and coronavirus genome sequence: 1
- batch sequence and CoVDB database: 1, 2
- batch sequence retrieval and coronavirus database: 1, 2, 3
- batch sequence retrieval and coronavirus gene: 1, 2
- batch sequence retrieval and coronavirus genome sequence: 1
- batch sequence retrieval and CoVDB database: 1, 2
Co phrase search for related documents, hyperlinks ordered by date