Author: Kirillova, Svetlana; Kumar, Suresh; Carugo, Oliviero
Title: Protein Domain Boundary Predictions: A Structural Biology Perspective Document date: 2009_1_21
ID: qrnhp1ek_26
Snippet: It is thus easy to select a threshold value t and to predict that a protein contains only one domain if smaller than t and that it is multi-domain protein if larger than t. Table 2 shows the mcc values [see equation (1) ] observed at various threshold values and validated with a Jack-knife procedure for the proteins examined in the CASP7 experiment. It can be observed that the mcc values are obviously smaller for very small or large values of the.....
Document: It is thus easy to select a threshold value t and to predict that a protein contains only one domain if smaller than t and that it is multi-domain protein if larger than t. Table 2 shows the mcc values [see equation (1) ] observed at various threshold values and validated with a Jack-knife procedure for the proteins examined in the CASP7 experiment. It can be observed that the mcc values are obviously smaller for very small or large values of the threshold. On the contrary they are rather large (>0.6) for intermediate threshold values and the highest mcc (0.628) is observed with a threshold of 200 residues. This prediction approach is clearly very naive. It simply assumes that a protein domain has a little probability to be very large and, as a consequence, that larger proteins have a higher probability to contain two or more domains. A protein is predicted to contain a single domain if it contains less residues that t and it is predicted to contain more than one domain if it has a number of residues larger than t. Data are taken from the proteins examined in the CASP7 experiment.
Search related documents:
Co phrase search for related documents- domain contain and large protein: 1, 2, 3, 4
- domain contain and multi domain protein: 1, 2
- domain contain and protein domain: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24
- domain contain and protein domain contain: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11
- domain contain and single domain: 1, 2, 3, 4, 5
- domain contain high probability and high probability: 1
- domain contain high probability and large protein: 1
- domain contain high probability and protein domain: 1
- domain contain high probability and protein domain contain: 1
- domain contain high probability and single domain: 1
- high probability and large protein: 1
- high probability and little probability: 1
- high probability and protein domain: 1, 2, 3
- high probability and protein domain contain: 1
- large protein and multi domain protein: 1, 2, 3
- large protein and protein domain: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- large protein and protein domain contain: 1
- multi domain protein and protein domain: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
- multi domain protein and protein domain contain: 1, 2
Co phrase search for related documents, hyperlinks ordered by date