Selected article for: "human protein and machine learning"

Author: Alguwaizani, Saud; Park, Byungkyu; Zhou, Xiang; Huang, De-Shuang; Han, Kyungsook
Title: Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids
  • Document date: 2018_5_9
  • ID: 0dxrai3j_13
    Snippet: Window of size 6 Equation for feature #2 Value Machine learning-based approaches to PPI prediction require both positive and negative PPI data, but negative data are not available in databases. Constructing a negative dataset of PPIs is not straightforward because there is no experimentally verified noninteracting pair [17] . Eid et al. [9] , for example, used negative sampling for their negative dataset. In our study, we constructed a negative d.....
    Document: Window of size 6 Equation for feature #2 Value Machine learning-based approaches to PPI prediction require both positive and negative PPI data, but negative data are not available in databases. Constructing a negative dataset of PPIs is not straightforward because there is no experimentally verified noninteracting pair [17] . Eid et al. [9] , for example, used negative sampling for their negative dataset. In our study, we constructed a negative dataset with human proteins whose sequence similarity is lower than 40% to any human protein in the positive dataset by running CD-HIT [18] . Our negative dataset includes 2,819 interactions between 90 virus proteins and 2,819 human proteins. e training and test datasets constructed in this study are available in Additional files 1 and 2.

    Search related documents:
    Co phrase search for related documents
    • additional file and negative sampling: 1, 2
    • additional file and positive dataset: 1
    • negative dataset and positive dataset: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22