Published in

SAGE Publications, The Stata Journal, 2(15), p. 437-456, 2015

DOI: 10.1177/1536867x1501500206

Links

Tools

Export citation

Search in Google Scholar

Multiple imputation of covariates by substantive-model compatible fully conditional specification

Journal article published in 2015 by Jonathan W. Bartlett ORCID, Tim P. Morris
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Multiple imputation is a practical, principled approach to handling missing data. When used to impute missing values in covariates of regression models, imputation models may be misspecified if they are not compatible with the substantive model of interest for the outcome. In this article, we introduce the smcfcs command, which imputes covariates by substantive-model compatible fully conditional specification. This modifies the popular fully conditional specification or chained-equations approach to multiple imputation by imputing each covariate compatibly with a user-specified substantive model. We compare the smcfcs command with standard fully conditional specification imputation using mi impute chained in a simulation study and illustrative analysis of data from a study investigating time to tumor recurrence in breast cancer.