Selected article for: "activity study and machine learning"

Author: Jaganathan, Keerthana; Tayara, Hilal; Chong, Kil To
Title: Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets
  • Cord-id: mogmtg7g
  • Document date: 2021_7_28
  • ID: mogmtg7g
    Snippet: Drug-induced liver toxicity is one of the significant safety challenges for the patient’s health and the pharmaceutical industry. It causes termination of drug candidates in clinical trials and also the retractions of approved drugs from the market. Thus, it is essential to identify hepatotoxic compounds in the initial stages of drug development process. The purpose of this study is to construct quantitative structure activity relationship models using machine learning algorithms and systemati
    Document: Drug-induced liver toxicity is one of the significant safety challenges for the patient’s health and the pharmaceutical industry. It causes termination of drug candidates in clinical trials and also the retractions of approved drugs from the market. Thus, it is essential to identify hepatotoxic compounds in the initial stages of drug development process. The purpose of this study is to construct quantitative structure activity relationship models using machine learning algorithms and systematical feature selection methods for molecular descriptor sets. The models were built from a large and diverse set of 1253 drug compounds and were validated internally with 10-fold cross-validation. In this study, we applied a variety of feature selection techniques to extract the optimal subset of descriptors as modeling features to improve the prediction performance. Experimental results suggested that the support vector machine-based classifier had achieved a better classification accuracy with reduced molecular descriptors. The final optimal model provides an accuracy of 0.811, a sensitivity of 0.840, a specificity of 0.783 and Mathew’s correlation coefficient of 0.623 with an internal validation set. Furthermore, this model outperformed the prior studies while evaluated in both the internal and external test sets. The utilization of distinct optimal molecular descriptors as modeling features produce an in silico model with a superior performance.

    Search related documents:
    Co phrase search for related documents
    • activity relationship and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
    • activity relationship and low correlation: 1, 2, 3
    • additive explanations and logistic regression: 1, 2, 3, 4, 5, 6
    • logistic regression and low dimensional: 1