IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Improving Bayesian Mixture Models for Multiple Imputation of Missing Data Using Focused Clustering

Citation Type: Journal Article

Publication Year: 2018

Abstract: We present a joint modeling approach for multiple imputation of missing continuous and categorical variables using Bayesian mixture models. The approach extends the idea of focused clustering, in which one separates variables into two sets before estimating the mixture model. Focus variables include variables with high rates of missingness and possibly other variables that could help improve the quality of the imputations. Non-focus variables include the remainder. In this way, one can use a rich sub-model for the focus set and a simpler model for the non-focus set, thereby concentrating fitting power on the variables with the highest rates of missingness. We present a procedure for specifying which variables with low rates of missingness to include in the focus set. We examine the performance of the imputation procedure using simulation studies based on artificial data and on data from the American Community Survey.

Url: https://www.ine.pt/revstat/pdf/REVSTAT_v16-n2-3.pdf

User Submitted?: No

Authors: Wei, Lan; Reiter, Jerome, P

Periodical (Full): REVSTAT Statistical Journal

Issue: 2

Volume: 16

Pages: 213-230

Data Collections: IPUMS USA

Topics: Methodology and Data Collection, Other

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop