Selected article for: "accurately predict and low dimensional"

Author: Bjerre-Nielsen, Andreas; Kassarnig, Valentin; Lassen, David Dreyer; Lehmann, Sune
Title: Task-specific information outperforms surveillance-style big data in predictive analytics
  • Cord-id: 0oz06jv0
  • Document date: 2021_4_6
  • ID: 0oz06jv0
    Snippet: Increasingly, human behavior can be monitored through the collection of data from digital devices revealing information on behaviors and locations. In the context of higher education, a growing number of schools and universities collect data on their students with the purpose of assessing or predicting behaviors and academic performance, and the COVID-19–induced move to online education dramatically increases what can be accumulated in this way, raising concerns about students’ privacy. We f
    Document: Increasingly, human behavior can be monitored through the collection of data from digital devices revealing information on behaviors and locations. In the context of higher education, a growing number of schools and universities collect data on their students with the purpose of assessing or predicting behaviors and academic performance, and the COVID-19–induced move to online education dramatically increases what can be accumulated in this way, raising concerns about students’ privacy. We focus on academic performance and ask whether predictive performance for a given dataset can be achieved with less privacy-invasive, but more task-specific, data. We draw on a unique dataset on a large student population containing both highly detailed measures of behavior and personality and high-quality third-party reported individual-level administrative data. We find that models estimated using the big behavioral data are indeed able to accurately predict academic performance out of sample. However, models using only low-dimensional and arguably less privacy-invasive administrative data perform considerably better and, importantly, do not improve when we add the high-resolution, privacy-invasive behavioral data. We argue that combining big behavioral data with “ground truth” administrative registry data can ideally allow the identification of privacy-preserving task-specific features that can be employed instead of current indiscriminate troves of behavioral data, with better privacy and better prediction resulting.

    Search related documents:
    Co phrase search for related documents
    • academic outcome and logistic regression: 1
    • academic performance and logistic regression: 1, 2, 3, 4, 5, 6, 7
    • academic performance and machine learning: 1, 2, 3, 4, 5
    • accuracy measure and logistic regression: 1, 2, 3, 4
    • accuracy measure and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22
    • accuracy measure and machine learning approach: 1