Development and Validation of a Robust and Interpretable Early Triaging Support System for Patients Hospitalized With COVID-19 : Predictive Algorithm Modeling and Interpretation Study
©Sangwon Baek, Yeon joo Jeong, Yun-Hyeon Kim, Jin Young Kim, Jin Hwan Kim, Eun Young Kim, Jae-Kwang Lim, Jungok Kim, Zero Kim, Kyunga Kim, Myung Jin Chung. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 11.01.2024..
BACKGROUND: Robust and accurate prediction of severity for patients with COVID-19 is crucial for patient triaging decisions. Many proposed models were prone to either high bias risk or low-to-moderate discrimination. Some also suffered from a lack of clinical interpretability and were developed based on early pandemic period data. Hence, there has been a compelling need for advancements in prediction models for better clinical applicability.
OBJECTIVE: The primary objective of this study was to develop and validate a machine learning-based Robust and Interpretable Early Triaging Support (RIETS) system that predicts severity progression (involving any of the following events: intensive care unit admission, in-hospital death, mechanical ventilation required, or extracorporeal membrane oxygenation required) within 15 days upon hospitalization based on routinely available clinical and laboratory biomarkers.
METHODS: We included data from 5945 hospitalized patients with COVID-19 from 19 hospitals in South Korea collected between January 2020 and August 2022. For model development and external validation, the whole data set was partitioned into 2 independent cohorts by stratified random cluster sampling according to hospital type (general and tertiary care) and geographical location (metropolitan and nonmetropolitan). Machine learning models were trained and internally validated through a cross-validation technique on the development cohort. They were externally validated using a bootstrapped sampling technique on the external validation cohort. The best-performing model was selected primarily based on the area under the receiver operating characteristic curve (AUROC), and its robustness was evaluated using bias risk assessment. For model interpretability, we used Shapley and patient clustering methods.
RESULTS: Our final model, RIETS, was developed based on a deep neural network of 11 clinical and laboratory biomarkers that are readily available within the first day of hospitalization. The features predictive of severity included lactate dehydrogenase, age, absolute lymphocyte count, dyspnea, respiratory rate, diabetes mellitus, c-reactive protein, absolute neutrophil count, platelet count, white blood cell count, and saturation of peripheral oxygen. RIETS demonstrated excellent discrimination (AUROC=0.937; 95% CI 0.935-0.938) with high calibration (integrated calibration index=0.041), satisfied all the criteria of low bias risk in a risk assessment tool, and provided detailed interpretations of model parameters and patient clusters. In addition, RIETS showed potential for transportability across variant periods with its sustainable prediction on Omicron cases (AUROC=0.903, 95% CI 0.897-0.910).
CONCLUSIONS: RIETS was developed and validated to assist early triaging by promptly predicting the severity of hospitalized patients with COVID-19. Its high performance with low bias risk ensures considerably reliable prediction. The use of a nationwide multicenter cohort in the model development and validation implicates generalizability. The use of routinely collected features may enable wide adaptability. Interpretations of model parameters and patients can promote clinical applicability. Together, we anticipate that RIETS will facilitate the patient triaging workflow and efficient resource allocation when incorporated into a routine clinical practice.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:26 |
---|---|
Enthalten in: |
Journal of medical Internet research - 26(2024) vom: 11. Jan., Seite e52134 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Baek, Sangwon [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 12.01.2024 Date Revised 28.01.2024 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.2196/52134 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM36697260X |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM36697260X | ||
003 | DE-627 | ||
005 | 20240128231933.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240114s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.2196/52134 |2 doi | |
028 | 5 | 2 | |a pubmed24n1273.xml |
035 | |a (DE-627)NLM36697260X | ||
035 | |a (NLM)38206673 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Baek, Sangwon |e verfasserin |4 aut | |
245 | 1 | 0 | |a Development and Validation of a Robust and Interpretable Early Triaging Support System for Patients Hospitalized With COVID-19 |b Predictive Algorithm Modeling and Interpretation Study |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 12.01.2024 | ||
500 | |a Date Revised 28.01.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a ©Sangwon Baek, Yeon joo Jeong, Yun-Hyeon Kim, Jin Young Kim, Jin Hwan Kim, Eun Young Kim, Jae-Kwang Lim, Jungok Kim, Zero Kim, Kyunga Kim, Myung Jin Chung. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 11.01.2024. | ||
520 | |a BACKGROUND: Robust and accurate prediction of severity for patients with COVID-19 is crucial for patient triaging decisions. Many proposed models were prone to either high bias risk or low-to-moderate discrimination. Some also suffered from a lack of clinical interpretability and were developed based on early pandemic period data. Hence, there has been a compelling need for advancements in prediction models for better clinical applicability | ||
520 | |a OBJECTIVE: The primary objective of this study was to develop and validate a machine learning-based Robust and Interpretable Early Triaging Support (RIETS) system that predicts severity progression (involving any of the following events: intensive care unit admission, in-hospital death, mechanical ventilation required, or extracorporeal membrane oxygenation required) within 15 days upon hospitalization based on routinely available clinical and laboratory biomarkers | ||
520 | |a METHODS: We included data from 5945 hospitalized patients with COVID-19 from 19 hospitals in South Korea collected between January 2020 and August 2022. For model development and external validation, the whole data set was partitioned into 2 independent cohorts by stratified random cluster sampling according to hospital type (general and tertiary care) and geographical location (metropolitan and nonmetropolitan). Machine learning models were trained and internally validated through a cross-validation technique on the development cohort. They were externally validated using a bootstrapped sampling technique on the external validation cohort. The best-performing model was selected primarily based on the area under the receiver operating characteristic curve (AUROC), and its robustness was evaluated using bias risk assessment. For model interpretability, we used Shapley and patient clustering methods | ||
520 | |a RESULTS: Our final model, RIETS, was developed based on a deep neural network of 11 clinical and laboratory biomarkers that are readily available within the first day of hospitalization. The features predictive of severity included lactate dehydrogenase, age, absolute lymphocyte count, dyspnea, respiratory rate, diabetes mellitus, c-reactive protein, absolute neutrophil count, platelet count, white blood cell count, and saturation of peripheral oxygen. RIETS demonstrated excellent discrimination (AUROC=0.937; 95% CI 0.935-0.938) with high calibration (integrated calibration index=0.041), satisfied all the criteria of low bias risk in a risk assessment tool, and provided detailed interpretations of model parameters and patient clusters. In addition, RIETS showed potential for transportability across variant periods with its sustainable prediction on Omicron cases (AUROC=0.903, 95% CI 0.897-0.910) | ||
520 | |a CONCLUSIONS: RIETS was developed and validated to assist early triaging by promptly predicting the severity of hospitalized patients with COVID-19. Its high performance with low bias risk ensures considerably reliable prediction. The use of a nationwide multicenter cohort in the model development and validation implicates generalizability. The use of routinely collected features may enable wide adaptability. Interpretations of model parameters and patients can promote clinical applicability. Together, we anticipate that RIETS will facilitate the patient triaging workflow and efficient resource allocation when incorporated into a routine clinical practice | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Multicenter Study | |
650 | 4 | |a Validation Study | |
650 | 4 | |a COVID-19 | |
650 | 4 | |a Omicron | |
650 | 4 | |a SARS-CoV-2 | |
650 | 4 | |a SHAP | |
650 | 4 | |a Shapley | |
650 | 4 | |a biomarker | |
650 | 4 | |a biomarkers | |
650 | 4 | |a clustering | |
650 | 4 | |a coronavirus | |
650 | 4 | |a deep learning | |
650 | 4 | |a early triaging | |
650 | 4 | |a emergency | |
650 | 4 | |a hospital admission | |
650 | 4 | |a hospital admissions | |
650 | 4 | |a hospitalization | |
650 | 4 | |a hospitalizations | |
650 | 4 | |a hospitalize | |
650 | 4 | |a interpretability | |
650 | 4 | |a machine learning | |
650 | 4 | |a neural network | |
650 | 4 | |a neural networks | |
650 | 4 | |a predict | |
650 | 4 | |a prediction | |
650 | 4 | |a prediction model | |
650 | 4 | |a predictive | |
650 | 4 | |a prognosis | |
650 | 4 | |a prognostic | |
650 | 4 | |a prognostics | |
650 | 4 | |a severity | |
650 | 4 | |a triage | |
650 | 4 | |a triaging | |
650 | 7 | |a Biomarkers |2 NLM | |
700 | 1 | |a Jeong, Yeon Joo |e verfasserin |4 aut | |
700 | 1 | |a Kim, Yun-Hyeon |e verfasserin |4 aut | |
700 | 1 | |a Kim, Jin Young |e verfasserin |4 aut | |
700 | 1 | |a Kim, Jin Hwan |e verfasserin |4 aut | |
700 | 1 | |a Kim, Eun Young |e verfasserin |4 aut | |
700 | 1 | |a Lim, Jae-Kwang |e verfasserin |4 aut | |
700 | 1 | |a Kim, Jungok |e verfasserin |4 aut | |
700 | 1 | |a Kim, Zero |e verfasserin |4 aut | |
700 | 1 | |a Kim, Kyunga |e verfasserin |4 aut | |
700 | 1 | |a Chung, Myung Jin |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of medical Internet research |d 1999 |g 26(2024) vom: 11. Jan., Seite e52134 |w (DE-627)NLM116127104 |x 1438-8871 |7 nnns |
773 | 1 | 8 | |g volume:26 |g year:2024 |g day:11 |g month:01 |g pages:e52134 |
856 | 4 | 0 | |u http://dx.doi.org/10.2196/52134 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 26 |j 2024 |b 11 |c 01 |h e52134 |