首页 | 本学科首页   官方微博 | 高级检索  
     


A Comparison of Methods for Creating Multiple Imputations of Nominal Variables
Authors:Kyle M. Lang  Wei Wu
Affiliation:1. Institute for Measurement, Methodology, Analysis, and Policy, Texas Tech University;2. Department of Psychology, University of Kansas
Abstract:Many variables that are analyzed by social scientists are nominal in nature. When missing data occur on these variables, optimal recovery of the analysis model's parameters is a challenging endeavor. One of the most popular methods to deal with missing nominal data is multiple imputation (MI). This study evaluated the capabilities of five MI methods that can be used to treat incomplete nominal variables: multiple imputation with chained equations (MICE) using polytomous regression as the elementary imputation method; MICE based on classification and regression trees (CART); MICE based on nested logistic regressions; the ranking procedure described by Allison (2002 Allison, P. D. (2002). Missing data. Thousand Oaks, CA: Sage Publications. https://doi.org/10.4135/9780857020994.n4[Crossref] [Google Scholar]); and a joint modeling approach based on the general location model. We first motivate our inquiry with an applied example and then present the results of a Monte Carlo simulation study that compared the performance of the five imputation methods under conditions of varying sample size, percentage of missing data, and number of nominal response categories. We found that MICE with polytomous regression was the strongest performer while the Allison (2002 Allison, P. D. (2002). Missing data. Thousand Oaks, CA: Sage Publications. https://doi.org/10.4135/9780857020994.n4[Crossref] [Google Scholar]) ranking procedure and MICE with CART performed poorly in most conditions.
Keywords:General location model  missing data  multiple imputation  multiple imputation with chained equations  nominal variables
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号