Author: Klinger, J. E.; Ravarani, C. N. J.; Baukmann, H. A.; Cope, J. L.; Boettinger, E. P.; Konigorski, S.; Schmidt, M. F.
Title: Interaction-based feature selection algorithm outperforms polygenic risk score in predicting Parkinson's Disease status Cord-id: uqj5e20t Document date: 2021_7_23
ID: uqj5e20t
Snippet: Polygenic risk scores (PRS) aggregating results from genome-wide association studies are state of the art to predict the susceptibility to complex traits or diseases. Novel machine learning algorithms that use large amounts of data promise to find gene-gene interactions in order to build models with better predictive performance than PRS. Here, we present a data preprocessing step by using data-mining of contextual information to reduce the number of features, enabling machine learning algorithm
Document: Polygenic risk scores (PRS) aggregating results from genome-wide association studies are state of the art to predict the susceptibility to complex traits or diseases. Novel machine learning algorithms that use large amounts of data promise to find gene-gene interactions in order to build models with better predictive performance than PRS. Here, we present a data preprocessing step by using data-mining of contextual information to reduce the number of features, enabling machine learning algorithms to identify gene-gene interactions. We applied our approach to the Parkinson's Progression Markers Initiative (PPMI) dataset, an observational clinical study of 471 genotyped subjects (368 cases and 152 controls). With an AUC of 0.85 (95% CI = [0.72; 0.96]), the interaction-based prediction model outperforms the PRS (AUC of 0.58 (95% CI = [0.42; 0.81])). Furthermore, feature importance analysis of the model provided insights into the mechanism of Parkinson's Disease. For instance, the model revealed an interaction of previously described drug target candidate genes TMEM175 and GAPDHP25. These results demonstrate that interaction-based machine learning models can improve genetic prediction models and might provide an answer to the missing heritability problem.
Search related documents:
Co phrase search for related documents- additive model and logistic regression classifier: 1
- logistic regression and machine learn: 1
Co phrase search for related documents, hyperlinks ordered by date