Author: Kshirsagar, Meghana; Carbonell, Jaime; Klein-Seetharaman, Judith
Title: Multitask learning for host–pathogen protein interactions Document date: 2013_7_1
ID: sdgt2ms5_49
Snippet: For each protein pair, we compute features similar to the work in Kshirsagar et al. (2012) . Some features use both proteins in the pair, while some others are based on either the host protein or the pathogen protein. While the features used for S.typhi were obtained directly from the authors, those for the other three datasets were derived from the following attributes of proteins available in public databases: protein sequences from Uniprot (Un.....
Document: For each protein pair, we compute features similar to the work in Kshirsagar et al. (2012) . Some features use both proteins in the pair, while some others are based on either the host protein or the pathogen protein. While the features used for S.typhi were obtained directly from the authors, those for the other three datasets were derived from the following attributes of proteins available in public databases: protein sequences from Uniprot (UniProt Consortium, 2011), gene ontology from GO database (Ashburner et al., 2000) , gene expression from GEO (Barrett et al., 2011) , properties of human proteins in the human PPI network. Owing to the lack of space, we briefly mention only some of the prominent features here, and encourage the readers to refer to the supplementary for details. The sequence features count the frequency of amino acid-based n-grams or n-mers (for n ¼ 2, 3, 4, 5) in the protein sequence. The GO features count the co-occurrence of host-pathogen GO term combinations. The human PPI network-based features compute various graph properties like node-degree, betweenness-centrality, clustering coefficient of the human protein.
Search related documents:
Co phrase search for related documents- acid base and gene expression: 1
- acid base and protein sequence: 1, 2
- available protein and betweenness centrality: 1
- available protein and gene expression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
- available protein and gene ontology: 1, 2, 3, 4
- available protein and host pathogen: 1, 2, 3, 4, 5
- available protein and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
- available protein and human protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31
- available protein and pathogen protein: 1, 2, 3
- available protein and PPI network: 1
- available protein and protein sequence: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34
Co phrase search for related documents, hyperlinks ordered by date