Author: Zuo, Guanghong; Xu, Zhao; Yu, Hongjie; Hao, Bailin
                    Title: Jackknife and Bootstrap Tests of the Composition Vector Trees  Document date: 2011_3_5
                    ID: vm5zjr64_7
                    
                    Snippet: As CV method does not use sequence alignment, statistical re-sampling cannot be carried out in the usual way of random choice of nucleotide or amino acid sites with replacement. Instead, we pick up proteins at random from the pool of all proteins in the genome of an organism. We used four datasets of protein sequences encoded in the genome: On each of these datasets, jackknife and bootstrap tests are performed in the following way. In the CV meth.....
                    
                    
                    
                     
                    
                    
                    
                    
                        
                            
                                Document: As CV method does not use sequence alignment, statistical re-sampling cannot be carried out in the usual way of random choice of nucleotide or amino acid sites with replacement. Instead, we pick up proteins at random from the pool of all proteins in the genome of an organism. We used four datasets of protein sequences encoded in the genome: On each of these datasets, jackknife and bootstrap tests are performed in the following way. In the CV method, a species is represented by a composition vector made of overlapping K-residues, designated as "K-peptides" hereafter, from all proteins in the genome. To do jackknife tests, we first take randomly 90% of proteins from the whole protein pool. This is done for all species and a CVTree is constructed by carrying out the crucial "subtraction procedure" (1). The topological distance between this tree and the original CVTree inferred from the whole protein pool is calculated. This re-sampling is performed 100 times and the average topological distance between these 100 trees and the original 100% CVTree at the same K is taken. Then the protein fraction is decreased to 80%, 70%, …, 10% and the average topological distance at a given K is plotted against the protein fraction (Figure 1) .
 
  Search related documents: 
                                Co phrase  search for related documents- amino acid and composition vector: 1, 2, 3, 4, 5
- amino acid and cv method: 1
- amino acid and genome encode: 1, 2, 3, 4
- amino acid and genome protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70
- amino acid and jackknife test: 1
- amino acid and organism genome: 1, 2
- amino acid and protein fraction: 1, 2, 3, 4, 5
- amino acid and protein pick: 1
- amino acid and protein pool: 1
- amino acid and protein sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83
- amino acid and protein sequence dataset: 1, 2
- amino acid and random choice: 1
- amino acid and sequence alignment: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77
- amino acid and topological distance: 1
- amino acid nucleotide and bootstrap test: 1, 2
- amino acid nucleotide and genome protein: 1, 2, 3, 4, 5, 6, 7
- amino acid nucleotide and protein sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19
- amino acid nucleotide and sequence alignment: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18
- amino acid nucleotide site and sequence alignment: 1
 
                                Co phrase  search for related documents, hyperlinks ordered by date