Author: Murray, Benjamin; Kerfoot, Eric; Graham, Mark S.; Sudre, Carole H.; Molteni, Erika; Canas, Liane S.; Antonelli, Michela; Visconti, Alessia; Chan, Andrew T.; Franks, Paul W.; Davies, Richard; Wolf, Jonathan; Spector, Tim; Steves, Claire J.; Modat, Marc; Ourselin, Sebastien
Title: Accessible Data Curation and Analytics for International-Scale Citizen Science Datasets Cord-id: 25cbus1m Document date: 2020_11_2
ID: 25cbus1m
Snippet: The Covid Symptom Study, a smartphone-based surveillance study on COVID-19 symptoms in the population, is an exemplar of big data citizen science. Over 4.7 million participants and 189 million unique assessments have been logged since its introduction in March 2020. The success of the Covid Symptom Study creates technical challenges around effective data curation for two reasons. Firstly, the scale of the dataset means that it can no longer be easily processed using standard software on commodit
Document: The Covid Symptom Study, a smartphone-based surveillance study on COVID-19 symptoms in the population, is an exemplar of big data citizen science. Over 4.7 million participants and 189 million unique assessments have been logged since its introduction in March 2020. The success of the Covid Symptom Study creates technical challenges around effective data curation for two reasons. Firstly, the scale of the dataset means that it can no longer be easily processed using standard software on commodity hardware. Secondly, the size of the research group means that replicability and consistency of key analytics used across multiple publications becomes an issue. We present ExeTera, an open source data curation software designed to address scalability challenges and to enable reproducible research across an international research group for datasets such as the Covid Symptom Study dataset.
Search related documents:
Co phrase search for related documents- Try single phrases listed below for: 1
Co phrase search for related documents, hyperlinks ordered by date