Selected article for: "absolute lasso selection shrinkage operator and addition important"

Author: Davagdorj, Khishigsuren; Pham, Van Huy; Theera-Umpon, Nipon; Ryu, Keun Ho
Title: XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction
  • Cord-id: onmjty6s
  • Document date: 2020_9_7
  • ID: onmjty6s
    Snippet: Smoking-induced noncommunicable diseases (SiNCDs) have become a significant threat to public health and cause of death globally. In the last decade, numerous studies have been proposed using artificial intelligence techniques to predict the risk of developing SiNCDs. However, determining the most significant features and developing interpretable models are rather challenging in such systems. In this study, we propose an efficient extreme gradient boosting (XGBoost) based framework incorporated w
    Document: Smoking-induced noncommunicable diseases (SiNCDs) have become a significant threat to public health and cause of death globally. In the last decade, numerous studies have been proposed using artificial intelligence techniques to predict the risk of developing SiNCDs. However, determining the most significant features and developing interpretable models are rather challenging in such systems. In this study, we propose an efficient extreme gradient boosting (XGBoost) based framework incorporated with the hybrid feature selection (HFS) method for SiNCDs prediction among the general population in South Korea and the United States. Initially, HFS is performed in three stages: (I) significant features are selected by t-test and chi-square test; (II) multicollinearity analysis serves to obtain dissimilar features; (III) final selection of best representative features is done based on least absolute shrinkage and selection operator (LASSO). Then, selected features are fed into the XGBoost predictive model. The experimental results show that our proposed model outperforms several existing baseline models. In addition, the proposed model also provides important features in order to enhance the interpretability of the SiNCDs prediction model. Consequently, the XGBoost based framework is expected to contribute for early diagnosis and prevention of the SiNCDs in public health concerns.

    Search related documents:
    Co phrase search for related documents
    • absolute lasso selection shrinkage operator and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45
    • absolute lasso selection shrinkage operator and lr logistic regression: 1, 2
    • absolute lasso selection shrinkage operator and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21
    • absolute lasso selection shrinkage operator and machine learning model: 1, 2, 3, 4, 5, 6
    • access memory and logistic regression: 1
    • access memory and low performance: 1
    • access memory and machine learning: 1, 2, 3
    • accuracy measure and activation function: 1, 2
    • accuracy measure and local global: 1
    • accuracy measure and logistic regression: 1, 2, 3, 4
    • accuracy measure and low performance: 1
    • accuracy measure and lr logistic regression: 1
    • accuracy measure and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22
    • accuracy measure and machine learning model: 1, 2
    • accuracy score and activation function: 1, 2, 3
    • accuracy score and actual difference: 1
    • accuracy score and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38
    • accuracy score and low performance: 1
    • accuracy score and lr logistic regression: 1, 2, 3, 4