Selected article for: "acid amino nucleotide and amino acid"

Author: Zuo, Guanghong; Xu, Zhao; Yu, Hongjie; Hao, Bailin
Title: Jackknife and Bootstrap Tests of the Composition Vector Trees
  • Document date: 2011_3_5
  • ID: vm5zjr64_7
    Snippet: As CV method does not use sequence alignment, statistical re-sampling cannot be carried out in the usual way of random choice of nucleotide or amino acid sites with replacement. Instead, we pick up proteins at random from the pool of all proteins in the genome of an organism. We used four datasets of protein sequences encoded in the genome: On each of these datasets, jackknife and bootstrap tests are performed in the following way. In the CV meth.....
    Document: As CV method does not use sequence alignment, statistical re-sampling cannot be carried out in the usual way of random choice of nucleotide or amino acid sites with replacement. Instead, we pick up proteins at random from the pool of all proteins in the genome of an organism. We used four datasets of protein sequences encoded in the genome: On each of these datasets, jackknife and bootstrap tests are performed in the following way. In the CV method, a species is represented by a composition vector made of overlapping K-residues, designated as "K-peptides" hereafter, from all proteins in the genome. To do jackknife tests, we first take randomly 90% of proteins from the whole protein pool. This is done for all species and a CVTree is constructed by carrying out the crucial "subtraction procedure" (1). The topological distance between this tree and the original CVTree inferred from the whole protein pool is calculated. This re-sampling is performed 100 times and the average topological distance between these 100 trees and the original 100% CVTree at the same K is taken. Then the protein fraction is decreased to 80%, 70%, …, 10% and the average topological distance at a given K is plotted against the protein fraction (Figure 1) .

    Search related documents:
    Co phrase search for related documents
    • amino acid and composition vector: 1, 2, 3, 4, 5
    • amino acid and cv method: 1
    • amino acid and genome encode: 1, 2, 3, 4
    • amino acid and genome protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • amino acid and jackknife test: 1
    • amino acid nucleotide and bootstrap test: 1, 2
    • amino acid nucleotide and genome protein: 1, 2, 3, 4, 5, 6, 7
    • bootstrap test and composition vector: 1
    • composition vector and cv method: 1, 2, 3
    • composition vector and jackknife test: 1
    • cv method and jackknife test: 1