Selected article for: "complete dataset and present study"

Author: Almeida, Rui Jorge; Adriaans, Greetje; Shapovalova, Yuliya
Title: Graphical Causal Models and Imputing Missing Data: A Preliminary Study
  • Cord-id: 4l3ztamw
  • Document date: 2020_5_18
  • ID: 4l3ztamw
    Snippet: Real-world datasets often contain many missing values due to several reasons. This is usually an issue since many learning algorithms require complete datasets. In certain cases, there are constraints in the real world problem that create difficulties in continuously observing all data. In this paper, we investigate if graphical causal models can be used to impute missing values and derive additional information on the uncertainty of the imputed values. Our goal is to use the information from a
    Document: Real-world datasets often contain many missing values due to several reasons. This is usually an issue since many learning algorithms require complete datasets. In certain cases, there are constraints in the real world problem that create difficulties in continuously observing all data. In this paper, we investigate if graphical causal models can be used to impute missing values and derive additional information on the uncertainty of the imputed values. Our goal is to use the information from a complete dataset in the form of graphical causal models to impute missing values in an incomplete dataset. This assumes that the datasets have the same data generating process. Furthermore, we calculate the probability of each missing data value belonging to a specified percentile. We present a preliminary study on the proposed method using synthetic data, where we can control the causal relations and missing values.

    Search related documents:
    Co phrase search for related documents
    • absolute value and additional information: 1
    • active disease and additional loss: 1
    • active disease and additional measure: 1
    • active disease and adequate information: 1
    • acyclic graph and additional information: 1, 2
    • additional information and adequate information: 1, 2
    • additional information and longitudinal prospective cohort study: 1