Homogeneous ensemble models for predicting infection levels and mortality of COVID-19 patients : Evidence from China

© The Author(s) 2022..

Background: Persistence of long-term COVID-19 pandemic is putting high pressure on healthcare services worldwide for several years. This article aims to establish models to predict infection levels and mortality of COVID-19 patients in China.

Methods: Machine learning models and deep learning models have been built based on the clinical features of COVID-19 patients. The best models are selected by area under the receiver operating characteristic curve (AUC) scores to construct two homogeneous ensemble models for predicting infection levels and mortality, respectively. The first-hand clinical data of 760 patients are collected from Zhongnan Hospital of Wuhan University between 3 January and 8 March 2020. We preprocess data with cleaning, imputation, and normalization.

Results: Our models obtain AUC = 0.7059 and Recall (Weighted avg) = 0.7248 in predicting infection level, while AUC=0.8436 and Recall (Weighted avg) = 0.8486 in predicting mortality ratio. This study also identifies two sets of essential clinical features. One is C-reactive protein (CRP) or high sensitivity C-reactive protein (hs-CRP) and the other is chest tightness, age, and pleural effusion.

Conclusions: Two homogeneous ensemble models are proposed to predict infection levels and mortality of COVID-19 patients in China. New findings of clinical features for benefiting the machine learning models are reported. The evaluation of an actual dataset collected from January 3 to March 8, 2020 demonstrates the effectiveness of the models by comparing them with state-of-the-art models in prediction.

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:8

Enthalten in:

Digital health - 8(2022) vom: 28. Jan., Seite 20552076221133692

Sprache:

Englisch

Beteiligte Personen:

Wang, Jiafeng [VerfasserIn]
Zhou, Xianlong [VerfasserIn]
Hou, Zhitian [VerfasserIn]
Xu, Xiaoya [VerfasserIn]
Zhao, Yueyue [VerfasserIn]
Chen, Shanshan [VerfasserIn]
Zhang, Jun [VerfasserIn]
Shao, Lina [VerfasserIn]
Yan, Rong [VerfasserIn]
Wang, Mingshan [VerfasserIn]
Ge, Minghua [VerfasserIn]
Hao, Tianyong [VerfasserIn]
Tu, Yuexing [VerfasserIn]
Huang, Haijun [VerfasserIn]

Links:

Volltext

Themen:

COVID-19
Electronic health records
Ensemble model
Journal Article
Machine learning
Prediction models

Anmerkungen:

Date Revised 08.11.2022

published: Electronic-eCollection

Citation Status PubMed-not-MEDLINE

doi:

10.1177/20552076221133692

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM348537867