Details der Publikation - Value of machine learning algorithms for predicting diabetes risk

Value of machine learning algorithms for predicting diabetes risk : A subset analysis from a real-world retrospective cohort study

© 2022 The Authors. Journal of Diabetes Investigation published by Asian Association for the Study of Diabetes (AASD) and John Wiley & Sons Australia, Ltd..

AIMS/INTRODUCTION: To compare the application value of different machine learning (ML) algorithms for diabetes risk prediction.

MATERIALS AND METHODS: This is a 3-year retrospective cohort study with a total of 3,687 participants being included in the data analysis. Modeling variable screening and predictive model building were carried out using logistic regression (LR) analysis and 10-fold cross-validation, respectively. In total, six different ML algorithms, including random forests, light gradient boosting machine, extreme gradient boosting, adaptive boosting (AdaBoost), multi-layer perceptrons and gaussian naive bayes were used for model construction. Model performance was mainly evaluated by the area under the receiver operating characteristic curve. The best performing ML model was selected for comparison with the traditional LR model and visualized using Shapley additive explanations.

RESULTS: A total of eight risk factors most associated with the development of diabetes were identified by univariate and multivariate LR analysis, and they were visualized in the form of a nomogram. Among the six different ML models, the random forests model had the best predictive performance. After 10-fold cross-validation, its optimal model has an area under the receiver operating characteristic value of 0.855 (95% confidence interval [CI] 0.823-0.886) in the training set and 0.835 (95% CI 0.779-0.892) in the test set. In the traditional LR model, its area under the receiver operating characteristic value is 0.840 (95% CI 0.814-0.866) in the training set and 0.834 (95% CI 0.785-0.884) in the test set.

CONCLUSIONS: In the real-world epidemiological research, the combination of traditional variable screening and ML algorithm to construct a diabetes risk prediction model has satisfactory clinical application value.

Medienart:	E-Artikel

Erscheinungsjahr:	2023
Erschienen:	2023

Enthalten in:	Zur Gesamtaufnahme - volume:14
Enthalten in:	Journal of diabetes investigation - 14(2023), 2 vom: 08. Feb., Seite 309-320

Sprache:	Englisch

Beteiligte Personen:	Mao, Yaqian [VerfasserIn] Zhu, Zheng [VerfasserIn] Pan, Shuyao [VerfasserIn] Lin, Wei [VerfasserIn] Liang, Jixing [VerfasserIn] Huang, Huibin [VerfasserIn] Li, Liantao [VerfasserIn] Wen, Junping [VerfasserIn] Chen, Gang [VerfasserIn]

Links:	Volltext

Themen:	Diabetes Journal Article Machine learning algorithms Predictive model

Anmerkungen:	Date Completed 02.02.2023 Date Revised 03.02.2023 published: Print-Electronic Citation Status MEDLINE

doi:	10.1111/jdi.13937

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM348590105

Internformat


LEADER	01000naa a22002652 4500
001	NLM348590105
003	DE-627
005	20231226040621.0
007	cr uuu---uuuuu
008	231226s2023 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1111/jdi.13937 \|2 doi
028	5	2	\|a pubmed24n1161.xml
035			\|a (DE-627)NLM348590105
035			\|a (NLM)36345236
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Mao, Yaqian \|e verfasserin \|4 aut
245	1	0	\|a Value of machine learning algorithms for predicting diabetes risk \|b A subset analysis from a real-world retrospective cohort study
264		1	\|c 2023
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 02.02.2023
500			\|a Date Revised 03.02.2023
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a © 2022 The Authors. Journal of Diabetes Investigation published by Asian Association for the Study of Diabetes (AASD) and John Wiley & Sons Australia, Ltd.
520			\|a AIMS/INTRODUCTION: To compare the application value of different machine learning (ML) algorithms for diabetes risk prediction
520			\|a MATERIALS AND METHODS: This is a 3-year retrospective cohort study with a total of 3,687 participants being included in the data analysis. Modeling variable screening and predictive model building were carried out using logistic regression (LR) analysis and 10-fold cross-validation, respectively. In total, six different ML algorithms, including random forests, light gradient boosting machine, extreme gradient boosting, adaptive boosting (AdaBoost), multi-layer perceptrons and gaussian naive bayes were used for model construction. Model performance was mainly evaluated by the area under the receiver operating characteristic curve. The best performing ML model was selected for comparison with the traditional LR model and visualized using Shapley additive explanations
520			\|a RESULTS: A total of eight risk factors most associated with the development of diabetes were identified by univariate and multivariate LR analysis, and they were visualized in the form of a nomogram. Among the six different ML models, the random forests model had the best predictive performance. After 10-fold cross-validation, its optimal model has an area under the receiver operating characteristic value of 0.855 (95% confidence interval [CI] 0.823-0.886) in the training set and 0.835 (95% CI 0.779-0.892) in the test set. In the traditional LR model, its area under the receiver operating characteristic value is 0.840 (95% CI 0.814-0.866) in the training set and 0.834 (95% CI 0.785-0.884) in the test set
520			\|a CONCLUSIONS: In the real-world epidemiological research, the combination of traditional variable screening and ML algorithm to construct a diabetes risk prediction model has satisfactory clinical application value
650		4	\|a Journal Article
650		4	\|a Diabetes
650		4	\|a Machine learning algorithms
650		4	\|a Predictive model
700	1		\|a Zhu, Zheng \|e verfasserin \|4 aut
700	1		\|a Pan, Shuyao \|e verfasserin \|4 aut
700	1		\|a Lin, Wei \|e verfasserin \|4 aut
700	1		\|a Liang, Jixing \|e verfasserin \|4 aut
700	1		\|a Huang, Huibin \|e verfasserin \|4 aut
700	1		\|a Li, Liantao \|e verfasserin \|4 aut
700	1		\|a Wen, Junping \|e verfasserin \|4 aut
700	1		\|a Chen, Gang \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Journal of diabetes investigation \|d 2010 \|g 14(2023), 2 vom: 08. Feb., Seite 309-320 \|w (DE-627)NLM21944952X \|x 2040-1124 \|7 nnns
773	1	8	\|g volume:14 \|g year:2023 \|g number:2 \|g day:08 \|g month:02 \|g pages:309-320
856	4	0	\|u http://dx.doi.org/10.1111/jdi.13937 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 14 \|j 2023 \|e 2 \|b 08 \|c 02 \|h 309-320

Value of machine learning algorithms for predicting diabetes risk : A subset analysis from a real-world retrospective cohort study

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände