Selected article for: "low sequence similarity and model test"

Author: Alguwaizani, Saud; Park, Byungkyu; Zhou, Xiang; Huang, De-Shuang; Han, Kyungsook
Title: Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids
  • Document date: 2018_5_9
  • ID: 0dxrai3j_34
    Snippet: We tested the prediction model on independent datasets of virus-host PPIs, which were not used in training the model and have a very low sequence similarity to any protein in training datasets of the model. Despite a low sequence similarity between proteins in training datasets and target proteins in test datasets, the prediction model showed a high performance comparable to the best performance of other methods for single virus-host PPIs. In com.....
    Document: We tested the prediction model on independent datasets of virus-host PPIs, which were not used in training the model and have a very low sequence similarity to any protein in training datasets of the model. Despite a low sequence similarity between proteins in training datasets and target proteins in test datasets, the prediction model showed a high performance comparable to the best performance of other methods for single virus-host PPIs. In comparison of our method with others using same datasets, our method outperformed the others. Experimental results demonstrate that the repeat patterns and composition of amino acids are simple, yet powerful features for predicting virus-host PPIs. e method can be used to find potential PPIs of new viruses or hosts, for which little information is known. TR2 TS6 TR2 TS7 TR2 TS8 TR2 TS9 TR2 TS10 #PPIs 689 191 689 125 689 86 689 57 689 78 #Virus proteins 35 116 35 34 35 24 35 10 35 27 #Host proteins 522 141 522 87 522 79 522 38 522 64 #Virus proteins common to TR and TS 9 (7.8%) 1 (2.9%) 4 (16.7%) 0 (0.0%) 0 (0.0%) e numbers in parentheses represent the proportion of common proteins to proteins in test datasets.

    Search related documents:
    Co phrase search for related documents
    • amino acid and experimental result: 1, 2, 3, 4
    • amino acid and good performance: 1, 2, 3, 4
    • amino acid and high performance: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16
    • amino acid and host new virus: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
    • amino acid and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • amino acid and independent dataset: 1, 2, 3, 4, 5, 6, 7, 8
    • amino acid and little information: 1, 2, 3, 4, 5
    • amino acid and method comparison: 1, 2, 3
    • amino acid and method outperform: 1
    • common protein and high performance: 1, 2
    • common protein and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14
    • experimental result and high performance: 1
    • experimental result and host protein: 1
    • good performance and high performance: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • good performance and host protein: 1
    • good performance and independent dataset: 1
    • good performance and little information: 1, 2
    • good performance and method comparison: 1
    • good performance and model train: 1, 2