Details der Publikation - Analyses using multiple imputation need to consider missing data in auxiliary variables

Analyses using multiple imputation need to consider missing data in auxiliary variables

Abstract Auxiliary variables are used in multiple imputation (MI) to reduce bias and increase efficiency. These variables may often themselves be incomplete. We explored how missing data in auxiliary variables influenced estimates obtained from MI. We implemented a simulation study with three different missing data mechanisms for the outcome. We then examined the impact of increasing proportions of missing data and different missingness mechanisms for the auxiliary variable on bias of an unadjusted linear regression coefficient and the fraction of missing information. We illustrate our findings with an applied example in the Avon Longitudinal Study of Parents and Children. We found that where complete records analyses were biased, increasing proportions of missing data in auxiliary variables, under any missing data mechanism, reduced the ability of MI including the auxiliary variable to mitigate this bias. Where there was no bias in the complete records analysis, inclusion of a missing not at random auxiliary variable in MI introduced bias of potentially important magnitude (up to 17% of the effect size in our simulation). Careful consideration of the quantity and nature of missing data in auxiliary variables needs to be made when selecting them for use in MI models..

Medienart:	Preprint

Erscheinungsjahr:	2023
Erschienen:	2023

Enthalten in:	bioRxiv.org - (2023) vom: 14. Dez. Zur Gesamtaufnahme - year:2023

Sprache:	Englisch

Beteiligte Personen:	Madley-Dowd, Paul [VerfasserIn] Curnow, Elinor [VerfasserIn] Hughes, Rachael A. [VerfasserIn] Cornish, Rosie [VerfasserIn] Tilling, Kate [VerfasserIn] Heron, Jon [VerfasserIn]

Links:	Volltext [kostenfrei]

Themen:	570 Biology

doi:	10.1101/2023.12.11.23299810

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	XBI041848993

Internformat


LEADER	01000caa a22002652 4500
001	XBI041848993
003	DE-627
005	20231215091159.0
007	cr uuu---uuuuu
008	231213s2023 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1101/2023.12.11.23299810 \|2 doi
035			\|a (DE-627)XBI041848993
035			\|a (biorXiv)10.1101/2023.12.11.23299810
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Madley-Dowd, Paul \|e verfasserin \|0 (orcid)0000-0003-2932-9486 \|4 aut
245	1	0	\|a Analyses using multiple imputation need to consider missing data in auxiliary variables
264		1	\|c 2023
336			\|a Text \|b txt \|2 rdacontent
337			\|a Computermedien \|b c \|2 rdamedia
338			\|a Online-Ressource \|b cr \|2 rdacarrier
520			\|a Abstract Auxiliary variables are used in multiple imputation (MI) to reduce bias and increase efficiency. These variables may often themselves be incomplete. We explored how missing data in auxiliary variables influenced estimates obtained from MI. We implemented a simulation study with three different missing data mechanisms for the outcome. We then examined the impact of increasing proportions of missing data and different missingness mechanisms for the auxiliary variable on bias of an unadjusted linear regression coefficient and the fraction of missing information. We illustrate our findings with an applied example in the Avon Longitudinal Study of Parents and Children. We found that where complete records analyses were biased, increasing proportions of missing data in auxiliary variables, under any missing data mechanism, reduced the ability of MI including the auxiliary variable to mitigate this bias. Where there was no bias in the complete records analysis, inclusion of a missing not at random auxiliary variable in MI introduced bias of potentially important magnitude (up to 17% of the effect size in our simulation). Careful consideration of the quantity and nature of missing data in auxiliary variables needs to be made when selecting them for use in MI models.
650		4	\|a Biology \|7 (dpeaa)DE-84
650		4	\|a 570 \|7 (dpeaa)DE-84
700	1		\|a Curnow, Elinor \|0 (orcid)0000-0002-3109-3647 \|4 aut
700	1		\|a Hughes, Rachael A. \|0 (orcid)0000-0003-0766-1410 \|4 aut
700	1		\|a Cornish, Rosie \|0 (orcid)0000-0002-2874-7646 \|4 aut
700	1		\|a Tilling, Kate \|0 (orcid)0000-0002-1010-8926 \|4 aut
700	1		\|a Heron, Jon \|0 (orcid)0000-0001-6199-5644 \|4 aut
773	0	8	\|i Enthalten in \|t bioRxiv.org \|g (2023) vom: 14. Dez.
773	1	8	\|g year:2023 \|g day:14 \|g month:12
856	4	0	\|u http://dx.doi.org/10.1101/2023.12.11.23299810 \|z kostenfrei \|3 Volltext
912			\|a GBV_XBI
951			\|a AR
952			\|j 2023 \|b 14 \|c 12

Analyses using multiple imputation need to consider missing data in auxiliary variables

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände