Selected article for: "commonly sequence and gene sequence"

Author: Jabado, Omar J.; Liu, Yang; Conlan, Sean; Quan, P. Lan; Hegyi, Hédi; Lussier, Yves; Briese, Thomas; Palacios, Gustavo; Lipkin, W. I.
Title: Comprehensive viral oligonucleotide probe design using conserved protein regions
  • Document date: 2007_12_13
  • ID: xfzhn1n1_24
    Snippet: The viral sequence database is dominated by gene fragments We queried the EMBL viral database to determine the frequencies of coding sequences and full genomic sequences. The majority of viral sequences were <1 kb 1982 1984 1986 1988 1990 1992 1994 1996 1998 A commonly used method to reduce sequence complexity is generation of a non-redundant sequence set by clustering (33) . We grouped sequences at the 98% identity level and selected the longest.....
    Document: The viral sequence database is dominated by gene fragments We queried the EMBL viral database to determine the frequencies of coding sequences and full genomic sequences. The majority of viral sequences were <1 kb 1982 1984 1986 1988 1990 1992 1994 1996 1998 A commonly used method to reduce sequence complexity is generation of a non-redundant sequence set by clustering (33) . We grouped sequences at the 98% identity level and selected the longest sequences as unique representatives of each group. This method was used to assess the growth of sequence diversity between January 2000 and the current release of July 2007. The database grew 600% in the 7-year period; doubling every three years. Unique sequences decreased as a proportion of the database, from 42% to 27%; overall growth of unique sequences was 378% (Figure 1b) . The current database comprised 74 044 unique sequence representatives at the 98% similarity level. Thus, the growth in the number of sequences in the viral database has been rapid, while growth in diversity has been more modest.

    Search related documents:
    Co phrase search for related documents
    • sequence database and viral database: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19
    • sequence database and viral sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
    • sequence database and viral sequence database: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14
    • sequence diversity and unique representative: 1
    • sequence diversity and unique sequence: 1
    • sequence diversity and viral database: 1
    • sequence diversity and viral sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28
    • sequence group and unique representative: 1
    • sequence group and unique sequence: 1
    • sequence group and viral sequence: 1, 2, 3, 4
    • sequence number and unique sequence: 1, 2, 3
    • sequence number and viral database: 1, 2, 3
    • sequence number and viral sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17
    • sequence number and viral sequence database: 1, 2
    • sequence number and viral sequence majority: 1
    • sequence representative and viral sequence: 1, 2, 3
    • similarity level and viral sequence: 1
    • unique representative and viral sequence: 1
    • unique sequence and viral sequence: 1, 2, 3, 4, 5, 6, 7, 8