Author: Moritz Mercker; Uwe Betzin; Dennis Wilken
Title: What influences COVID-19 infection rates: A statistical approach to identify promising factors applied to infection data from Germany Document date: 2020_4_17
ID: 09nvausz_18
Snippet: In order to evaluate possible factors locally driving infection rates, we applied the 'least absolute shrinkage and selection operator' (LASSO) method [14, 15] in combination with cross-validation to the data. LASSO is known to reliably perform model selection even if the total number of possible predictors is high [9, 16] . In particular, we applied LASSO separately to three different types of models (using β 1 , β 2 , respectively β 3 as the.....
Document: In order to evaluate possible factors locally driving infection rates, we applied the 'least absolute shrinkage and selection operator' (LASSO) method [14, 15] in combination with cross-validation to the data. LASSO is known to reliably perform model selection even if the total number of possible predictors is high [9, 16] . In particular, we applied LASSO separately to three different types of models (using β 1 , β 2 , respectively β 3 as the outcome variable, where for β 1 and β 2 we used the normal probability distribution. Since the used software (the R-package glmnet, c.f. below) is not able to perform Betaregression, we reformulated for the LASSO-step the value β 2 into a binomial variable instead ('smaller one' vs. 'one'), using a binomial error distribution. We tested the following variables: On the ADlevel we introduced Longitude, Latitude (as well as their interaction term), the date of the first reported COVID-19 infection (first infection), the local percentage of people older than 65 years (Age) (data from https://www-genesis.destatis.de), the population density (Density), and the local infection intensity (percentage of people infected by COVID-19 -infection density). On the coarser level of states, we introduced human mobility data. In particular, we used mobility data from Google collected by the type of activity, where activity (visits and length of stay) has been quantified at different types of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential -compared to a baseline level before the COVID-19 pandemic (variables have been named All rights reserved. No reuse allowed without permission.
Search related documents:
Co phrase search for related documents- pharmacy grocery and recreation retail: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21
- pharmacy grocery and residential workplace: 1, 2, 3, 4, 5, 6
- pharmacy grocery and residential workplace transit station: 1
- pharmacy grocery and transit station: 1
- pharmacy grocery recreation retail and population density: 1
- pharmacy grocery recreation retail and recreation retail: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18
- pharmacy grocery recreation retail and residential workplace: 1, 2, 3, 4, 5, 6
- pharmacy grocery recreation retail and residential workplace transit station: 1
- pharmacy grocery recreation retail and transit station: 1
- population density and possible factor: 1, 2
- population density and probability distribution: 1, 2, 3
- population density and recreation retail: 1
- population density and residential workplace: 1
- population density and selection absolute shrinkage: 1
- population density and selection absolute shrinkage operator: 1
- population density and total number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56
- possible factor and total number: 1
- possible predictor and total number: 1
- probability distribution and total number: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Co phrase search for related documents, hyperlinks ordered by date