Selected article for: "amino acid and low complexity"

Author: Kistler, Amy L; Gancz, Ady; Clubb, Susan; Skewes-Cox, Peter; Fischer, Kael; Sorber, Katherine; Chiu, Charles Y; Lublin, Avishai; Mechani, Sara; Farnoushi, Yigal; Greninger, Alexander; Wen, Christopher C; Karlene, Scott B; Ganem, Don; DeRisi, Joseph L
Title: Recovery of divergent avian bornaviruses from cases of proventricular dilatation disease: Identification of a candidate etiologic agent
  • Document date: 2008_7_31
  • ID: 17qoax09_56
    Snippet: Reads sharing 100% identity to each other or the Solexa amplification primers were filtered, reducing our initial set of 1.4 million reads to a working set of 600,000 unique reads. In order to quickly assess the homology of this set of reads to different sequence databases, we employed an iterative strategy using ELAND (Efficient Local Alignment of Nucleotide Data) and BLAST analyses. To filter reads from our analysis potentially derived from psi.....
    Document: Reads sharing 100% identity to each other or the Solexa amplification primers were filtered, reducing our initial set of 1.4 million reads to a working set of 600,000 unique reads. In order to quickly assess the homology of this set of reads to different sequence databases, we employed an iterative strategy using ELAND (Efficient Local Alignment of Nucleotide Data) and BLAST analyses. To filter reads from our analysis potentially derived from psittacine host tissue, the working set of reads were aligned to a database of all Aves sequences from NCBI (n = 918,511) using ELAND, which tolerates no more than 2 base mismatches, and discards both low quality reads and reads with low sequence complexity. Reads that did not align to the Aves database by ELAND analysis were next re-aligned to the Aves database for high stringency blastn analysis (e = 10 -7 , word size = 11), followed by progressively lower stringencies (down to e = 10-2, word size = 8), corresponding to reads containing only 22 nucleotide identities to sequences in the Aves database. To identify reads with some homology to Bornaviridae sequences in the resulting set of 322,790 host-filtered reads, we re-implemented the ELAND/iterative blastn analysis strategy (down to ≥ 15 nucleotides identity) using a database of all NCBI BDV sequences (n = 207) augmented by our previously recovered ABV sequences (n = 5). An additional iterative tblastx analysis was incorporated to capture distantly related reads that shared similarity to the known BDV sequences only at the level of predicted amino acid sequence (down to ≥ 6 amino acid identity).

    Search related documents:
    Co phrase search for related documents
    • ABV sequence and amino acid sequence: 1
    • ABV sequence and BDV sequence: 1, 2, 3, 4
    • ABV sequence and blast analysis: 1
    • ABV sequence and Bornaviridae sequence: 1
    • ABV sequence and host filter: 1
    • ABV sequence and initial set: 1
    • ABV sequence and nucleotide identity: 1, 2
    • ABV sequence and sequence database: 1
    • amino acid and amplification primer: 1, 2
    • amino acid and analysis strategy: 1, 2, 3
    • amino acid and base mismatch: 1, 2
    • amino acid and blast analysis: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
    • amino acid and different sequence database: 1
    • amino acid and high stringency: 1
    • amino acid and host tissue: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11
    • amino acid and initial set: 1, 2
    • amino acid and low quality: 1, 2
    • amino acid and low sequence complexity: 1, 2, 3, 4, 5, 6, 7
    • amino acid and nucleotide identity: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75