Details der Publikation - A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset

A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset : Pre-diabetes, T1DM, and T2DM

Copyright © 2022 The Authors. Published by Elsevier Ltd.. All rights reserved..

Glucose is the primary source of energy for cells, which are the building blocks of life. It is given to the body by insulin that carries out the metabolic tasks that keep people alive. Glucose level imbalance is a sign of diabetes mellitus (DM), a common type of chronic disease. It leads to long-term complications, such as blindness, kidney failure, and heart disease, having a negative impact on one's quality of life. In Saudi Arabia, a ten-fold increase in diabetic cases has been documented within the last three years. DM is broadly categorized as Type 1 Diabetes (T1DM), Type 2 Diabetes (T2DM), and Pre-diabetes. The diagnosis of the correct type is sometimes ambiguous to medical professionals causing difficulties in managing the illness progression. Intensive efforts have been made to predict T2DM. However, there is a lack of studies focusing on accurately identifying T1DM and Pre-diabetes. Therefore, this study aims to utilize Machine Learning (ML) to distinguish and predict the three types of diabetes based on a Saudi Arabian hospital dataset to control their progression. Four different experiments have been conducted to achieve the highest results, where several algorithms were used, including Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (K-NN), Decision Tree (DT), Bagging, and Stacking. In experiments 2, 3, and 4, the Synthetic Minority Oversampling Technique (SMOTE) was applied to balance the dataset. The empirical results demonstrated promising results of the novel Stacking model that combined Bagging K-NN, Bagging DT, and K-NN, with a K-NN meta-classifier attaining an accuracy, weighted recall, weighted precision, and cohen's kappa score of 94.48%, 94.48%, 94.70%, and 0.9172, respectively. Five principal features were identified to significantly affect the model accuracy using the permutation feature importance, namely Education, AntiDiab, Insulin, Nutrition, and Sex.

Medienart:	E-Artikel

Erscheinungsjahr:	2022
Erschienen:	2022

Enthalten in:	Zur Gesamtaufnahme - volume:147
Enthalten in:	Computers in biology and medicine - 147(2022) vom: 19. Aug., Seite 105757

Sprache:	Englisch

Beteiligte Personen:	Gollapalli, Mohammed [VerfasserIn] Alansari, Aisha [VerfasserIn] Alkhorasani, Heba [VerfasserIn] Alsubaii, Meelaf [VerfasserIn] Sakloua, Rasha [VerfasserIn] Alzahrani, Reem [VerfasserIn] Al-Hariri, Mohammed [VerfasserIn] Alfares, Maiadah [VerfasserIn] AlKhafaji, Dania [VerfasserIn] Al Argan, Reem [VerfasserIn] Albaker, Waleed [VerfasserIn]

Links:	Volltext

Themen:	Glucose IY9XDZ35W2 Insulins Journal Article Machine learning Permutation feature importance Pre-diabetes Stacking Type 1 diabetes Type 2 diabetes

Anmerkungen:	Date Completed 13.07.2022 Date Revised 16.09.2022 published: Print-Electronic Citation Status MEDLINE

doi:	10.1016/j.compbiomed.2022.105757

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM342978578

Internformat


LEADER	01000naa a22002652 4500
001	NLM342978578
003	DE-627
005	20231226015342.0
007	cr uuu---uuuuu
008	231226s2022 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1016/j.compbiomed.2022.105757 \|2 doi
028	5	2	\|a pubmed24n1143.xml
035			\|a (DE-627)NLM342978578
035			\|a (NLM)35777087
035			\|a (PII)S0010-4825(22)00531-5
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Gollapalli, Mohammed \|e verfasserin \|4 aut
245	1	2	\|a A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset \|b Pre-diabetes, T1DM, and T2DM
264		1	\|c 2022
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 13.07.2022
500			\|a Date Revised 16.09.2022
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a Copyright © 2022 The Authors. Published by Elsevier Ltd.. All rights reserved.
520			\|a Glucose is the primary source of energy for cells, which are the building blocks of life. It is given to the body by insulin that carries out the metabolic tasks that keep people alive. Glucose level imbalance is a sign of diabetes mellitus (DM), a common type of chronic disease. It leads to long-term complications, such as blindness, kidney failure, and heart disease, having a negative impact on one's quality of life. In Saudi Arabia, a ten-fold increase in diabetic cases has been documented within the last three years. DM is broadly categorized as Type 1 Diabetes (T1DM), Type 2 Diabetes (T2DM), and Pre-diabetes. The diagnosis of the correct type is sometimes ambiguous to medical professionals causing difficulties in managing the illness progression. Intensive efforts have been made to predict T2DM. However, there is a lack of studies focusing on accurately identifying T1DM and Pre-diabetes. Therefore, this study aims to utilize Machine Learning (ML) to distinguish and predict the three types of diabetes based on a Saudi Arabian hospital dataset to control their progression. Four different experiments have been conducted to achieve the highest results, where several algorithms were used, including Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (K-NN), Decision Tree (DT), Bagging, and Stacking. In experiments 2, 3, and 4, the Synthetic Minority Oversampling Technique (SMOTE) was applied to balance the dataset. The empirical results demonstrated promising results of the novel Stacking model that combined Bagging K-NN, Bagging DT, and K-NN, with a K-NN meta-classifier attaining an accuracy, weighted recall, weighted precision, and cohen's kappa score of 94.48%, 94.48%, 94.70%, and 0.9172, respectively. Five principal features were identified to significantly affect the model accuracy using the permutation feature importance, namely Education, AntiDiab, Insulin, Nutrition, and Sex
650		4	\|a Journal Article
650		4	\|a Machine learning
650		4	\|a Permutation feature importance
650		4	\|a Pre-diabetes
650		4	\|a Stacking
650		4	\|a Type 1 diabetes
650		4	\|a Type 2 diabetes
650		7	\|a Insulins \|2 NLM
650		7	\|a Glucose \|2 NLM
650		7	\|a IY9XDZ35W2 \|2 NLM
700	1		\|a Alansari, Aisha \|e verfasserin \|4 aut
700	1		\|a Alkhorasani, Heba \|e verfasserin \|4 aut
700	1		\|a Alsubaii, Meelaf \|e verfasserin \|4 aut
700	1		\|a Sakloua, Rasha \|e verfasserin \|4 aut
700	1		\|a Alzahrani, Reem \|e verfasserin \|4 aut
700	1		\|a Al-Hariri, Mohammed \|e verfasserin \|4 aut
700	1		\|a Alfares, Maiadah \|e verfasserin \|4 aut
700	1		\|a AlKhafaji, Dania \|e verfasserin \|4 aut
700	1		\|a Al Argan, Reem \|e verfasserin \|4 aut
700	1		\|a Albaker, Waleed \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Computers in biology and medicine \|d 1970 \|g 147(2022) vom: 19. Aug., Seite 105757 \|w (DE-627)NLM000382272 \|x 1879-0534 \|7 nnns
773	1	8	\|g volume:147 \|g year:2022 \|g day:19 \|g month:08 \|g pages:105757
856	4	0	\|u http://dx.doi.org/10.1016/j.compbiomed.2022.105757 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 147 \|j 2022 \|b 19 \|c 08 \|h 105757

A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset : Pre-diabetes, T1DM, and T2DM

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände