Author: Kshirsagar, Meghana; Carbonell, Jaime; Klein-Seetharaman, Judith
Title: Multitask learning for host–pathogen protein interactions Document date: 2013_7_1
ID: sdgt2ms5_49
Snippet: For each protein pair, we compute features similar to the work in Kshirsagar et al. (2012) . Some features use both proteins in the pair, while some others are based on either the host protein or the pathogen protein. While the features used for S.typhi were obtained directly from the authors, those for the other three datasets were derived from the following attributes of proteins available in public databases: protein sequences from Uniprot (Un.....
Document: For each protein pair, we compute features similar to the work in Kshirsagar et al. (2012) . Some features use both proteins in the pair, while some others are based on either the host protein or the pathogen protein. While the features used for S.typhi were obtained directly from the authors, those for the other three datasets were derived from the following attributes of proteins available in public databases: protein sequences from Uniprot (UniProt Consortium, 2011), gene ontology from GO database (Ashburner et al., 2000) , gene expression from GEO (Barrett et al., 2011) , properties of human proteins in the human PPI network. Owing to the lack of space, we briefly mention only some of the prominent features here, and encourage the readers to refer to the supplementary for details. The sequence features count the frequency of amino acid-based n-grams or n-mers (for n ¼ 2, 3, 4, 5) in the protein sequence. The GO features count the co-occurrence of host-pathogen GO term combinations. The human PPI network-based features compute various graph properties like node-degree, betweenness-centrality, clustering coefficient of the human protein.
Search related documents:
Co phrase search for related documents- acid base and gene expression: 1
- available protein and betweenness centrality: 1
- available protein and gene expression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
- available protein and gene ontology: 1, 2, 3, 4
- available protein and host pathogen: 1, 2, 3, 4, 5
- available protein and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20
- betweenness centrality and gene expression: 1
- betweenness centrality and gene ontology: 1, 2
- betweenness centrality and host pathogen: 1
- betweenness centrality and host protein: 1, 2
- database gene ontology and gene expression: 1, 2, 3, 4, 5
- database gene ontology and gene ontology: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17
- gene expression and host pathogen: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- gene expression and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
- gene ontology and host pathogen: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
- gene ontology and host protein: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
Co phrase search for related documents, hyperlinks ordered by date