Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm

BACKGROUND: Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses.

METHODS: We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model outputs into individual and composite risk scores and evaluated their predictive performance.

RESULTS: Our analysis identified several factors predictive of diagnoses of PASC, including being overweight/obese and the use of HMG CoA reductase inhibitors prior to COVID-19 infection, and respiratory system symptoms during COVID-19 infection. We developed a composite risk score with a moderate discriminatory ability for PASC (covariate-adjusted AUC (95% confidence interval): 0.66 (0.63, 0.69)) by combining the risk scores based on phenotype and medication records. The combined risk score could identify 10% of individuals with a 2.2-fold increased risk for PASC.

CONCLUSIONS: We identified several factors predictive of diagnoses of PASC and integrated the information into a composite risk score for PASC prediction, which could contribute to the identification of individuals at higher risk for PASC and inform preventive efforts.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:12

Enthalten in:

Journal of clinical medicine - 12(2023), 23 vom: 25. Nov.

Sprache:

Englisch

Beteiligte Personen:

Jin, Weijia [VerfasserIn]
Hao, Wei [VerfasserIn]
Shi, Xu [VerfasserIn]
Fritsche, Lars G [VerfasserIn]
Salvatore, Maxwell [VerfasserIn]
Admon, Andrew J [VerfasserIn]
Friese, Christopher R [VerfasserIn]
Mukherjee, Bhramar [VerfasserIn]

Links:

Volltext

Themen:

COVID-19
Electronic health records
Journal Article
Phenotype risk score
Post-acute sequelae of SARS-CoV-2 infection
Predictive models

Anmerkungen:

Date Revised 10.02.2024

published: Electronic

Citation Status PubMed-not-MEDLINE

doi:

10.3390/jcm12237313

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM365591815