Patterns of high-risk drinking among medical students : A web-based survey with machine learning

Copyright © 2021 Elsevier Ltd. All rights reserved..

BACKGROUND: Prior studies have found increased rates of alcohol consumption among physicians and medical students. The present study aims to build machine learning (ML) models to identify patterns of high-risk drinking (HRD), including alcohol use disorder, within this population.

METHODS: We analyzed data collected through a web-based survey among Brazilian medical students. Variables included sociodemographic data, personal information, university status, and mental health. Stratification for HRD was carried out based on the AUDIT-C scores. Three ML algorithms were used to build classifiers to predict HRD among medical students: elastic net regularization, random forest, and artificial neural networks. Model interpretation techniques were adopted to assess the most influential predictors for models' decisions, which represent potential factors associated with HRD.

RESULTS: A total of 4840 medical students were included in the study. The prevalence of HRD was 53.03%. The three ML models built were able to distinguish individuals with HRD from low-risk drinking (LRD) with very similar performance. The average AUC scores in the cross-validation procedure were around 0.72, and this performance was replicated in the test set. The most important features for the ML models were the use of tobacco and cannabis, monthly family income, marital status, sexual orientation, and physical activities.

CONCLUSIONS: This study proposes that ML models may serve as tools for initial screening of students regarding their susceptibility for at-risk drinking or alcohol use disorder. In addition, we identified several key factors associated with HRD that could be further investigated and explored for preventive and assistance measures.

Medienart:

E-Artikel

Erscheinungsjahr:

2021

Erschienen:

2021

Enthalten in:

Zur Gesamtaufnahme - volume:136

Enthalten in:

Computers in biology and medicine - 136(2021) vom: 25. Sept., Seite 104747

Sprache:

Englisch

Beteiligte Personen:

Marcon, Grasiela [VerfasserIn]
de Ávila Pereira, Flávia [VerfasserIn]
Zimerman, Aline [VerfasserIn]
da Silva, Bruno Castro [VerfasserIn]
von Diemen, Lisia [VerfasserIn]
Passos, Ives Cavalcante [VerfasserIn]
Recamonde-Mendoza, Mariana [VerfasserIn]

Links:

Volltext

Themen:

Classification models
High-risk drinking
Journal Article
Machine learning
Medical students
Research Support, Non-U.S. Gov't

Anmerkungen:

Date Completed 11.10.2021

Date Revised 11.10.2021

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1016/j.compbiomed.2021.104747

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM329921401