Doubly robust inference when combining probability and non-probability samples with high dimensional data

We consider integrating a non-probability sample with a probability sample which provides high dimensional representative covariate information of the target population. We propose a two-step approach for variable selection and finite population inference. In the first step, we use penalized estimating equations with folded concave penalties to select important variables and show selection consistency for general samples. In the second step, we focus on a doubly robust estimator of the finite population mean and re-estimate the nuisance model parameters by minimizing the asymptotic squared bias of the doubly robust estimator. This estimating strategy mitigates the possible first-step selection error and renders the doubly robust estimator root n consistent if either the sampling probability or the outcome model is correctly specified.

Medienart:

E-Artikel

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

Zur Gesamtaufnahme - volume:82

Enthalten in:

Journal of the Royal Statistical Society. Series B, Statistical methodology - 82(2020), 2 vom: 01. Apr., Seite 445-465

Sprache:

Englisch

Beteiligte Personen:

Yang, Shu [VerfasserIn]
Kim, Jae Kwang [VerfasserIn]
Song, Rui [VerfasserIn]

Links:

Volltext

Themen:

Data integration
Double robustness
Generalizability
Journal Article
Penalized estimating equation
Variable selection

Anmerkungen:

Date Revised 13.11.2020

published: Print-Electronic

Citation Status PubMed-not-MEDLINE

doi:

10.1111/rssb.12354

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM317305522