The Use of Multiple Imputation to Handle Missing Data in Secondary Datasets: Suggested Approaches when Missing Data Results from the Survey Structure

Secondary datasets are used in healthcare research because of its cost advantages, its convenience, and the size of the datasets. However, missing data can cause problems that are difficult to resolve. This manuscript reviews possible causes for missing data, and how to address them. Many researcher...

Full description

Saved in:
Bibliographic Details
Main Author: Soojung Jo PhD, RN (Author)
Format: Book
Published: SAGE Publishing, 2022-05-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_77f900d9ed4144beb3c11d0ce099e2c4
042 |a dc 
100 1 0 |a Soojung Jo PhD, RN  |e author 
245 0 0 |a The Use of Multiple Imputation to Handle Missing Data in Secondary Datasets: Suggested Approaches when Missing Data Results from the Survey Structure 
260 |b SAGE Publishing,   |c 2022-05-01T00:00:00Z. 
500 |a 0046-9580 
500 |a 1945-7243 
500 |a 10.1177/00469580221088627 
520 |a Secondary datasets are used in healthcare research because of its cost advantages, its convenience, and the size of the datasets. However, missing data can cause problems that are difficult to resolve. This manuscript reviews possible causes for missing data, and how to address them. Many researchers use multiple imputation as a solution, which consists of three phases: (a) the imputation phase, (b) the analysis phase, and (c) the pooling phase. When missing data is caused by a refusal to answer or by insufficient knowledge, multiple imputation works well. However, difficulties arise when there are problems with screening questions. If respondents do not answer a screening question, possible answers could be either "yes" or "no." This paper suggests identifying "yes" responses on the screening question, and setting them aside for use in the analysis. The reasons for this approach are the impossibility of conducting multiple imputation twice, the problem of imputation based on the population after sample weight, and the difficulty of producing logical errors on the estimation in imputation phase. This manuscript uses as an example the techniques used to address missing data from screening questions in a national US dataset. These techniques of multiple imputation using examples from the dataset could be used by researchers in future healthcare research that relies on secondary datasets. 
546 |a EN 
690 |a Public aspects of medicine 
690 |a RA1-1270 
655 7 |a article  |2 local 
786 0 |n Inquiry: The Journal of Health Care Organization, Provision, and Financing, Vol 59 (2022) 
787 0 |n https://doi.org/10.1177/00469580221088627 
787 0 |n https://doaj.org/toc/0046-9580 
787 0 |n https://doaj.org/toc/1945-7243 
856 4 1 |u https://doaj.org/article/77f900d9ed4144beb3c11d0ce099e2c4  |z Connect to this object online.