Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm
BACKGROUND: Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses.
METHODS: We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model outputs into individual and composite risk scores and evaluated their predictive performance.
RESULTS: Our analysis identified several factors predictive of diagnoses of PASC, including being overweight/obese and the use of HMG CoA reductase inhibitors prior to COVID-19 infection, and respiratory system symptoms during COVID-19 infection. We developed a composite risk score with a moderate discriminatory ability for PASC (covariate-adjusted AUC (95% confidence interval): 0.66 (0.63, 0.69)) by combining the risk scores based on phenotype and medication records. The combined risk score could identify 10% of individuals with a 2.2-fold increased risk for PASC.
CONCLUSIONS: We identified several factors predictive of diagnoses of PASC and integrated the information into a composite risk score for PASC prediction, which could contribute to the identification of individuals at higher risk for PASC and inform preventive efforts.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:12 |
---|---|
Enthalten in: |
Journal of clinical medicine - 12(2023), 23 vom: 25. Nov. |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Jin, Weijia [VerfasserIn] |
---|
Links: |
---|
Themen: |
COVID-19 |
---|
Anmerkungen: |
Date Revised 10.02.2024 published: Electronic Citation Status PubMed-not-MEDLINE |
---|
doi: |
10.3390/jcm12237313 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM365591815 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM365591815 | ||
003 | DE-627 | ||
005 | 20240210232945.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.3390/jcm12237313 |2 doi | |
028 | 5 | 2 | |a pubmed24n1287.xml |
035 | |a (DE-627)NLM365591815 | ||
035 | |a (NLM)38068365 | ||
035 | |a (PII)7313 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Jin, Weijia |e verfasserin |4 aut | |
245 | 1 | 0 | |a Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Revised 10.02.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status PubMed-not-MEDLINE | ||
520 | |a BACKGROUND: Post-Acute Sequelae of COVID-19 (PASC) have emerged as a global public health and healthcare challenge. This study aimed to uncover predictive factors for PASC from multi-modal data to develop a predictive model for PASC diagnoses | ||
520 | |a METHODS: We analyzed electronic health records from 92,301 COVID-19 patients, covering medical phenotypes, medications, and lab results. We used a Super Learner-based prediction approach to identify predictive factors. We integrated the model outputs into individual and composite risk scores and evaluated their predictive performance | ||
520 | |a RESULTS: Our analysis identified several factors predictive of diagnoses of PASC, including being overweight/obese and the use of HMG CoA reductase inhibitors prior to COVID-19 infection, and respiratory system symptoms during COVID-19 infection. We developed a composite risk score with a moderate discriminatory ability for PASC (covariate-adjusted AUC (95% confidence interval): 0.66 (0.63, 0.69)) by combining the risk scores based on phenotype and medication records. The combined risk score could identify 10% of individuals with a 2.2-fold increased risk for PASC | ||
520 | |a CONCLUSIONS: We identified several factors predictive of diagnoses of PASC and integrated the information into a composite risk score for PASC prediction, which could contribute to the identification of individuals at higher risk for PASC and inform preventive efforts | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a COVID-19 | |
650 | 4 | |a electronic health records | |
650 | 4 | |a phenotype risk score | |
650 | 4 | |a post-acute sequelae of SARS-CoV-2 infection | |
650 | 4 | |a predictive models | |
700 | 1 | |a Hao, Wei |e verfasserin |4 aut | |
700 | 1 | |a Shi, Xu |e verfasserin |4 aut | |
700 | 1 | |a Fritsche, Lars G |e verfasserin |4 aut | |
700 | 1 | |a Salvatore, Maxwell |e verfasserin |4 aut | |
700 | 1 | |a Admon, Andrew J |e verfasserin |4 aut | |
700 | 1 | |a Friese, Christopher R |e verfasserin |4 aut | |
700 | 1 | |a Mukherjee, Bhramar |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of clinical medicine |d 2012 |g 12(2023), 23 vom: 25. Nov. |w (DE-627)NLM230666310 |x 2077-0383 |7 nnns |
773 | 1 | 8 | |g volume:12 |g year:2023 |g number:23 |g day:25 |g month:11 |
856 | 4 | 0 | |u http://dx.doi.org/10.3390/jcm12237313 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 12 |j 2023 |e 23 |b 25 |c 11 |