Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
Copyright © 2021 Elsevier Ltd. All rights reserved..
In the epidemiological COVID-19 research, artificial intelligence is a unique approach to make predictions about disease severity to manage COVID-19 patients. A limitation of artificial intelligence is, however, the high risk of bias. We investigated the skill of data mining and machine learning, two advanced forms of artificial intelligence, to predict severe COVID-19 pneumonia based on routine laboratory tests. A sample of 4009 COVID-19 patients was divided into Severe (PaO2< 60 mmHg, 489 cases) and Non-Severe (PaO2 ≥ 60 mmHg, 3520 cases) groups according to blood hypoxemia on admission and their laboratory datasets analyzed by the R software and WEKA workbench. After curation, data were processed for the selection of the most influential features including hemogram, pCO2, blood acid-base balance, prothrombin time, inflammation biomarkers, and glucose. The best fit of variables was successfully confirmed by either the Multilayer Perceptron, a feedforward neural network algorithm that performed machine recognition of severe COVID-19 with 96.5% precision, or by the C4.5 software, a supervised learning algorithm based on an objective-predefined variable (severity) that generated a decision tree with 89.4% precision. Finally, a complex bivariate Pearson's correlation matrix combined with advanced hierarchical clustering (dendrograms) were conducted for knowledge discovery. The hidden structure of the datasets revealed shift patterns related to the development of COVID-19-induced pneumonia that involved the lymphocyte-to-C-reactive protein and leukocyte-to-C-protein ratios, neutrophil %, pH and pCO2. The data mining approaches to the hematological fluctuations associated with severe COVID-19 pneumonia could not only anticipate adverse clinical outcomes, but also reveal putative therapeutic targets.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2021 |
---|---|
Erschienen: |
2021 |
Enthalten in: |
Zur Gesamtaufnahme - volume:136 |
---|---|
Enthalten in: |
Computers in biology and medicine - 136(2021) vom: 15. Sept., Seite 104738 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Pulgar-Sánchez, Mary [VerfasserIn] |
---|
Links: |
---|
Themen: |
Biomarkers |
---|
Anmerkungen: |
Date Completed 16.09.2021 Date Revised 05.12.2022 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1016/j.compbiomed.2021.104738 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM329347543 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM329347543 | ||
003 | DE-627 | ||
005 | 20231225204922.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2021 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1016/j.compbiomed.2021.104738 |2 doi | |
028 | 5 | 2 | |a pubmed24n1097.xml |
035 | |a (DE-627)NLM329347543 | ||
035 | |a (NLM)34391001 | ||
035 | |a (PII)S0010-4825(21)00532-1 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Pulgar-Sánchez, Mary |e verfasserin |4 aut | |
245 | 1 | 0 | |a Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 16.09.2021 | ||
500 | |a Date Revised 05.12.2022 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Copyright © 2021 Elsevier Ltd. All rights reserved. | ||
520 | |a In the epidemiological COVID-19 research, artificial intelligence is a unique approach to make predictions about disease severity to manage COVID-19 patients. A limitation of artificial intelligence is, however, the high risk of bias. We investigated the skill of data mining and machine learning, two advanced forms of artificial intelligence, to predict severe COVID-19 pneumonia based on routine laboratory tests. A sample of 4009 COVID-19 patients was divided into Severe (PaO2< 60 mmHg, 489 cases) and Non-Severe (PaO2 ≥ 60 mmHg, 3520 cases) groups according to blood hypoxemia on admission and their laboratory datasets analyzed by the R software and WEKA workbench. After curation, data were processed for the selection of the most influential features including hemogram, pCO2, blood acid-base balance, prothrombin time, inflammation biomarkers, and glucose. The best fit of variables was successfully confirmed by either the Multilayer Perceptron, a feedforward neural network algorithm that performed machine recognition of severe COVID-19 with 96.5% precision, or by the C4.5 software, a supervised learning algorithm based on an objective-predefined variable (severity) that generated a decision tree with 89.4% precision. Finally, a complex bivariate Pearson's correlation matrix combined with advanced hierarchical clustering (dendrograms) were conducted for knowledge discovery. The hidden structure of the datasets revealed shift patterns related to the development of COVID-19-induced pneumonia that involved the lymphocyte-to-C-reactive protein and leukocyte-to-C-protein ratios, neutrophil %, pH and pCO2. The data mining approaches to the hematological fluctuations associated with severe COVID-19 pneumonia could not only anticipate adverse clinical outcomes, but also reveal putative therapeutic targets | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Blood gas analyses | |
650 | 4 | |a COVID-19 | |
650 | 4 | |a Clinical laboratory techniques | |
650 | 4 | |a Data mining | |
650 | 4 | |a Machine learning | |
650 | 4 | |a Medical informatics applications | |
650 | 7 | |a Biomarkers |2 NLM | |
700 | 1 | |a Chamorro, Kevin |e verfasserin |4 aut | |
700 | 1 | |a Fors, Martha |e verfasserin |4 aut | |
700 | 1 | |a Mora, Francisco X |e verfasserin |4 aut | |
700 | 1 | |a Ramírez, Hégira |e verfasserin |4 aut | |
700 | 1 | |a Fernandez-Moreira, Esteban |e verfasserin |4 aut | |
700 | 1 | |a Ballaz, Santiago J |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Computers in biology and medicine |d 1970 |g 136(2021) vom: 15. Sept., Seite 104738 |w (DE-627)NLM000382272 |x 1879-0534 |7 nnns |
773 | 1 | 8 | |g volume:136 |g year:2021 |g day:15 |g month:09 |g pages:104738 |
856 | 4 | 0 | |u http://dx.doi.org/10.1016/j.compbiomed.2021.104738 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 136 |j 2021 |b 15 |c 09 |h 104738 |