Full Citation
Title: Modeling Racial/Ethnic Differences in COVID-19 Incidence with Covariates Subject to Non-Random Missingness
Citation Type: Miscellaneous
Publication Year: 2022
ISBN:
ISSN:
DOI:
NSFID:
PMCID:
PMID:
Abstract: Covariates like age, sex, and race/ethnicity provide invaluable insight to public health authorities trying to interpret surveillance data collected during a public health emergency such as the COVID-19 pandemic. However, the utility of such data is limited when many cases are missing key covariates. This problem is most concerning when this missingness likely depends on the values of missing covariates, i.e. they are not missing at random (NMAR). We propose a Bayesian parametric model that leverages joint information on spatial variation in the disease and covariate missingness processes and can accommodate both MAR and NMAR missingness. We show that the model is locally identifiable when the spatial distribution of the population covari-ates is known and observed cases can be associated with a spatial unit of observation. We also use a simulation study to investigate the model's finite-sample performance. We compare our model's performance on NMAR data against complete-case analysis and multiple imputation (MI), both of which are commonly used by public health researchers when confronted with missing categorical covariates. Finally, we model spatial variation in cumulative COVID-19 incidence in Wayne County, Michigan using data from the Michi-gan Department and Health and Human Services. The analysis suggests that population relative risk estimates by race during the early part of the COVID-19 pandemic in Michigan were understated for non-white residents compared to white residents when cases missing race were dropped or had these values imputed using MI.
Url: https://arxiv.org/pdf/2206.08161.pdf
User Submitted?: No
Authors: Trangucci, Rob; Chen, Yang; Zelner, Jon
Publisher:
Data Collections: IPUMS NHGIS
Topics: Health, Methodology and Data Collection, Population Data Science
Countries: