Using gut microbiota as a diagnostic tool for colorectal cancer : machine learning techniques reveal promising results

Introduction. Increasing evidence suggests a correlation between gut microbiota and colorectal cancer (CRC).Hypothesis/Gap Statement. However, few studies have used gut microbiota as a diagnostic biomarker for CRC.Aim. The objective of this study was to explore whether a machine learning (ML) model based on gut microbiota could be used to diagnose CRC and identify key biomarkers in the model.Methodology. We sequenced the 16S rRNA gene from faecal samples of 38 participants, including 17 healthy subjects and 21 CRC patients. Eight supervised ML algorithms were used to diagnose CRC based on faecal microbiota operational taxonomic units (OTUs), and the models were evaluated in terms of identification, calibration and clinical practicality for optimal modelling parameters. Finally, the key gut microbiota was identified using the random forest (RF) algorithm.Results. We found that CRC was associated with the dysregulation of gut microbiota. Through a comprehensive evaluation of supervised ML algorithms, we found that different algorithms had significantly different prediction performance using faecal microbiomes. Different data screening methods played an important role in optimization of the prediction models. We found that naïve Bayes algorithms [NB, accuracy=0.917, area under the curve (AUC)=0.926], RF (accuracy=0.750, AUC=0.926) and logistic regression (LR, accuracy=0.750, AUC=0.889) had high predictive potential for CRC. Furthermore, important features in the model, namely s__metagenome_g__Lachnospiraceae_ND3007_group (AUC=0.814), s__Escherichia_coli_g__Escherichia-Shigella (AUC=0.784) and s__unclassified_g__Prevotella (AUC=0.750), could each be used as diagnostic biomarkers of CRC.Conclusions. Our results suggested an association between gut microbiota dysregulation and CRC, and demonstrated the feasibility of the gut microbiota to diagnose cancer. The bacteria s__metagenome_g__Lachnospiraceae_ND3007_group, s__Escherichia_coli_g__Escherichia-Shigella and s__unclassified_g__Prevotella were key biomarkers for CRC.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:72

Enthalten in:

Journal of medical microbiology - 72(2023), 6 vom: 09. Juni

Sprache:

Englisch

Beteiligte Personen:

Lu, Fang [VerfasserIn]
Lei, Ting [VerfasserIn]
Zhou, Jie [VerfasserIn]
Liang, Hao [VerfasserIn]
Cui, Ping [VerfasserIn]
Zuo, Taiping [VerfasserIn]
Ye, Li [VerfasserIn]
Chen, Hui [VerfasserIn]
Huang, Jiegang [VerfasserIn]

Links:

Volltext

Themen:

16S rRNA gene sequencing
Biomarker
Colorectal cancer
Diagnosis
Gut microbiome
Journal Article
Machine learning
RNA, Ribosomal, 16S

Anmerkungen:

Date Completed 09.06.2023

Date Revised 09.06.2023

published: Print

Citation Status MEDLINE

doi:

10.1099/jmm.0.001699

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM357903889