Using gut microbiota as a diagnostic tool for colorectal cancer : machine learning techniques reveal promising results
Introduction. Increasing evidence suggests a correlation between gut microbiota and colorectal cancer (CRC).Hypothesis/Gap Statement. However, few studies have used gut microbiota as a diagnostic biomarker for CRC.Aim. The objective of this study was to explore whether a machine learning (ML) model based on gut microbiota could be used to diagnose CRC and identify key biomarkers in the model.Methodology. We sequenced the 16S rRNA gene from faecal samples of 38 participants, including 17 healthy subjects and 21 CRC patients. Eight supervised ML algorithms were used to diagnose CRC based on faecal microbiota operational taxonomic units (OTUs), and the models were evaluated in terms of identification, calibration and clinical practicality for optimal modelling parameters. Finally, the key gut microbiota was identified using the random forest (RF) algorithm.Results. We found that CRC was associated with the dysregulation of gut microbiota. Through a comprehensive evaluation of supervised ML algorithms, we found that different algorithms had significantly different prediction performance using faecal microbiomes. Different data screening methods played an important role in optimization of the prediction models. We found that naïve Bayes algorithms [NB, accuracy=0.917, area under the curve (AUC)=0.926], RF (accuracy=0.750, AUC=0.926) and logistic regression (LR, accuracy=0.750, AUC=0.889) had high predictive potential for CRC. Furthermore, important features in the model, namely s__metagenome_g__Lachnospiraceae_ND3007_group (AUC=0.814), s__Escherichia_coli_g__Escherichia-Shigella (AUC=0.784) and s__unclassified_g__Prevotella (AUC=0.750), could each be used as diagnostic biomarkers of CRC.Conclusions. Our results suggested an association between gut microbiota dysregulation and CRC, and demonstrated the feasibility of the gut microbiota to diagnose cancer. The bacteria s__metagenome_g__Lachnospiraceae_ND3007_group, s__Escherichia_coli_g__Escherichia-Shigella and s__unclassified_g__Prevotella were key biomarkers for CRC.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:72 |
---|---|
Enthalten in: |
Journal of medical microbiology - 72(2023), 6 vom: 09. Juni |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Lu, Fang [VerfasserIn] |
---|
Links: |
---|
Themen: |
16S rRNA gene sequencing |
---|
Anmerkungen: |
Date Completed 09.06.2023 Date Revised 09.06.2023 published: Print Citation Status MEDLINE |
---|
doi: |
10.1099/jmm.0.001699 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM357903889 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM357903889 | ||
003 | DE-627 | ||
005 | 20231226073608.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1099/jmm.0.001699 |2 doi | |
028 | 5 | 2 | |a pubmed24n1192.xml |
035 | |a (DE-627)NLM357903889 | ||
035 | |a (NLM)37288545 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Lu, Fang |e verfasserin |4 aut | |
245 | 1 | 0 | |a Using gut microbiota as a diagnostic tool for colorectal cancer |b machine learning techniques reveal promising results |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 09.06.2023 | ||
500 | |a Date Revised 09.06.2023 | ||
500 | |a published: Print | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Introduction. Increasing evidence suggests a correlation between gut microbiota and colorectal cancer (CRC).Hypothesis/Gap Statement. However, few studies have used gut microbiota as a diagnostic biomarker for CRC.Aim. The objective of this study was to explore whether a machine learning (ML) model based on gut microbiota could be used to diagnose CRC and identify key biomarkers in the model.Methodology. We sequenced the 16S rRNA gene from faecal samples of 38 participants, including 17 healthy subjects and 21 CRC patients. Eight supervised ML algorithms were used to diagnose CRC based on faecal microbiota operational taxonomic units (OTUs), and the models were evaluated in terms of identification, calibration and clinical practicality for optimal modelling parameters. Finally, the key gut microbiota was identified using the random forest (RF) algorithm.Results. We found that CRC was associated with the dysregulation of gut microbiota. Through a comprehensive evaluation of supervised ML algorithms, we found that different algorithms had significantly different prediction performance using faecal microbiomes. Different data screening methods played an important role in optimization of the prediction models. We found that naïve Bayes algorithms [NB, accuracy=0.917, area under the curve (AUC)=0.926], RF (accuracy=0.750, AUC=0.926) and logistic regression (LR, accuracy=0.750, AUC=0.889) had high predictive potential for CRC. Furthermore, important features in the model, namely s__metagenome_g__Lachnospiraceae_ND3007_group (AUC=0.814), s__Escherichia_coli_g__Escherichia-Shigella (AUC=0.784) and s__unclassified_g__Prevotella (AUC=0.750), could each be used as diagnostic biomarkers of CRC.Conclusions. Our results suggested an association between gut microbiota dysregulation and CRC, and demonstrated the feasibility of the gut microbiota to diagnose cancer. The bacteria s__metagenome_g__Lachnospiraceae_ND3007_group, s__Escherichia_coli_g__Escherichia-Shigella and s__unclassified_g__Prevotella were key biomarkers for CRC | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a 16S rRNA gene sequencing | |
650 | 4 | |a biomarker | |
650 | 4 | |a colorectal cancer | |
650 | 4 | |a diagnosis | |
650 | 4 | |a gut microbiome | |
650 | 4 | |a machine learning | |
650 | 7 | |a RNA, Ribosomal, 16S |2 NLM | |
700 | 1 | |a Lei, Ting |e verfasserin |4 aut | |
700 | 1 | |a Zhou, Jie |e verfasserin |4 aut | |
700 | 1 | |a Liang, Hao |e verfasserin |4 aut | |
700 | 1 | |a Cui, Ping |e verfasserin |4 aut | |
700 | 1 | |a Zuo, Taiping |e verfasserin |4 aut | |
700 | 1 | |a Ye, Li |e verfasserin |4 aut | |
700 | 1 | |a Chen, Hui |e verfasserin |4 aut | |
700 | 1 | |a Huang, Jiegang |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of medical microbiology |d 1968 |g 72(2023), 6 vom: 09. Juni |w (DE-627)NLM000052272 |x 1473-5644 |7 nnns |
773 | 1 | 8 | |g volume:72 |g year:2023 |g number:6 |g day:09 |g month:06 |
856 | 4 | 0 | |u http://dx.doi.org/10.1099/jmm.0.001699 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 72 |j 2023 |e 6 |b 09 |c 06 |