Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)

ABSTRACT Objective To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C).Materials and Methods We developed a comprehensive approach, named <jats:underline>H</jats:underline>ierarchy and rule-based pregnancy episode <jats:underline>I</jats:underline>nference integrated with <jats:underline>P</jats:underline>regnancy <jats:underline>P</jats:underline>rogression <jats:underline>S</jats:underline>ignatures (HIPPS) and applied it to EHR data in the N3C from 1 January 2018 to 7 April 2022. HIPPS combines: 1) an extension of a previously published pregnancy episode algorithm, 2) a novel algorithm to detect gestational aging-specific signatures of a progressing pregnancy for further episode support, and 3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated three types of pregnancy cohorts based on the level of precision for gestational aging and pregnancy outcomes for comparison of COVID-19 and other characteristics.Results We identified 628,165 pregnant persons with 816,471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, spontaneous abortions), and 23.3% had unknown outcomes. We were able to estimate start dates within one week of precision for 431,173 (52.8%) episodes. 66,019 (8.1%) episodes had incident COVID-19 during pregnancy. Across varying COVID-19 cohorts, patient characteristics were generally similar though pregnancy outcomes differed.Discussion HIPPS provides support for pregnancy-related variables based on EHR data for researchers to define pregnancy cohorts. Our approach performed well based on clinician validation.Conclusion We have developed a novel and robust approach for inferring pregnancy episodes and gestational aging that addresses data inconsistency and missingness in EHR data..

Medienart:

Preprint

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

bioRxiv.org - (2024) vom: 23. Apr. Zur Gesamtaufnahme - year:2024

Sprache:

Englisch

Beteiligte Personen:

Jones, Sara [VerfasserIn]
Bradwell, Katie R. [VerfasserIn]
Chan, Lauren E. [VerfasserIn]
Olson-Chen, Courtney [VerfasserIn]
Tarleton, Jessica [VerfasserIn]
Wilkins, Kenneth J. [VerfasserIn]
Qin, Qiuyuan [VerfasserIn]
Faherty, Emily Groene [VerfasserIn]
Lau, Yan Kwan [VerfasserIn]
Xie, Catherine [VerfasserIn]
Kao, Yu-Han [VerfasserIn]
Liebman, Michael N. [VerfasserIn]
Mariona, Federico [VerfasserIn]
Challa, Anup [VerfasserIn]
Li, Li [VerfasserIn]
Ratcliffe, Sarah J. [VerfasserIn]
McMurry, Julie A. [VerfasserIn]
Haendel, Melissa A. [VerfasserIn]
Patel, Rena C. [VerfasserIn]
Hill, Elaine L. [VerfasserIn]

Links:

Volltext [lizenzpflichtig]
Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2022.08.04.22278439

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI036804789