Selected article for: "classification method and decision tree"

Author: Chlioui, Imane; Abnane, Ibtissam; Idri, Ali
Title: Comparing Statistical and Machine Learning Imputation Techniques in Breast Cancer Classification
  • Cord-id: qgf5iub5
  • Document date: 2020_8_19
  • ID: qgf5iub5
    Snippet: Missing data imputation is an important task when dealing with crucial data that cannot be discarded such as medical data. This study evaluates and compares the impacts of two statistical and two machine learning imputation techniques when classifying breast cancer patients, using several evaluation metrics. Mean, Expectation-Maximization (EM), Support Vector Regression (SVR) and K-Nearest Neighbor (KNN) were applied to impute 18% of missing data missed completely at random in the two Wisconsin
    Document: Missing data imputation is an important task when dealing with crucial data that cannot be discarded such as medical data. This study evaluates and compares the impacts of two statistical and two machine learning imputation techniques when classifying breast cancer patients, using several evaluation metrics. Mean, Expectation-Maximization (EM), Support Vector Regression (SVR) and K-Nearest Neighbor (KNN) were applied to impute 18% of missing data missed completely at random in the two Wisconsin datasets. Thereafter, we empirically evaluated these four imputation techniques when using five classifiers: decision tree (C4.5), Case Based Reasoning (CBR), Random Forest (RF), Support Vector Machine (SVM) and Multi-Layer Perceptron (MLP). In total, 1380 experiments were conducted and the findings confirmed that classification using imputation based machine learning outperformed classification using statistical imputation. Moreover, our experiment showed that SVR was the best imputation method for breast cancer classification.

    Search related documents:
    Co phrase search for related documents
    • accuracy rate and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • accuracy rate and machine learning model: 1, 2, 3, 4, 5, 6, 7, 8
    • accuracy rate and magnetic resonance: 1, 2
    • accurate prediction and loss function: 1, 2
    • accurate prediction and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • accurate prediction and machine learning model: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
    • accurate prediction and magnetic resonance: 1, 2, 3, 4
    • additional method and machine learning: 1, 2
    • additional method and magnetic resonance: 1, 2
    • loss function and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23
    • loss function and machine learning model: 1, 2, 3
    • loss function and magnetic resonance: 1, 2, 3, 4, 5, 6
    • machine learning and magnetic resonance: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25
    • machine learning model and magnetic resonance: 1, 2