Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians
© 2022. The Author(s)..
Despite extensive efforts to address it, the vastness of uncharacterized 'dark matter' microbial genetic diversity can impact short-read sequencing based metagenomic studies. Population-specific biases in genomic reference databases can further compound this problem. Leveraging advances in hybrid assembly (using short and long reads) and Hi-C technologies in a cross-sectional survey, we deeply characterized 109 gut microbiomes from three ethnicities in Singapore to comprehensively reconstruct 4497 medium and high-quality metagenome assembled genomes, 1708 of which were missing in short-read only analysis and with >28× N50 improvement. Species-level clustering identified 70 (>10% of total) novel gut species out of 685, improved reference genomes for 363 species (53% of total), and discovered 3413 strains unique to these populations. Among the top 10 most abundant gut bacteria in our study, one of the species and >80% of strains were unrepresented in existing databases. Annotation of biosynthetic gene clusters (BGCs) uncovered more than 27,000 BGCs with a large fraction (36-88%) unrepresented in current databases, and with several unique clusters predicted to produce bacteriocins that could significantly alter microbiome community structure. These results reveal significant uncharacterized gut microbial diversity in Southeast Asian populations and highlight the utility of hybrid metagenomic references for bioprospecting and disease-focused studies.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:13 |
---|---|
Enthalten in: |
Nature communications - 13(2022), 1 vom: 13. Okt., Seite 6044 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Gounot, Jean-Sebastien [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 17.10.2022 Date Revised 22.12.2022 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41467-022-33782-z |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM347446140 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM347446140 | ||
003 | DE-627 | ||
005 | 20231226033837.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41467-022-33782-z |2 doi | |
028 | 5 | 2 | |a pubmed24n1158.xml |
035 | |a (DE-627)NLM347446140 | ||
035 | |a (NLM)36229545 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Gounot, Jean-Sebastien |e verfasserin |4 aut | |
245 | 1 | 0 | |a Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 17.10.2022 | ||
500 | |a Date Revised 22.12.2022 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2022. The Author(s). | ||
520 | |a Despite extensive efforts to address it, the vastness of uncharacterized 'dark matter' microbial genetic diversity can impact short-read sequencing based metagenomic studies. Population-specific biases in genomic reference databases can further compound this problem. Leveraging advances in hybrid assembly (using short and long reads) and Hi-C technologies in a cross-sectional survey, we deeply characterized 109 gut microbiomes from three ethnicities in Singapore to comprehensively reconstruct 4497 medium and high-quality metagenome assembled genomes, 1708 of which were missing in short-read only analysis and with >28× N50 improvement. Species-level clustering identified 70 (>10% of total) novel gut species out of 685, improved reference genomes for 363 species (53% of total), and discovered 3413 strains unique to these populations. Among the top 10 most abundant gut bacteria in our study, one of the species and >80% of strains were unrepresented in existing databases. Annotation of biosynthetic gene clusters (BGCs) uncovered more than 27,000 BGCs with a large fraction (36-88%) unrepresented in current databases, and with several unique clusters predicted to produce bacteriocins that could significantly alter microbiome community structure. These results reveal significant uncharacterized gut microbial diversity in Southeast Asian populations and highlight the utility of hybrid metagenomic references for bioprospecting and disease-focused studies | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 7 | |a Bacteriocins |2 NLM | |
700 | 1 | |a Chia, Minghao |e verfasserin |4 aut | |
700 | 1 | |a Bertrand, Denis |e verfasserin |4 aut | |
700 | 1 | |a Saw, Woei-Yuh |e verfasserin |4 aut | |
700 | 1 | |a Ravikrishnan, Aarthi |e verfasserin |4 aut | |
700 | 1 | |a Low, Adrian |e verfasserin |4 aut | |
700 | 1 | |a Ding, Yichen |e verfasserin |4 aut | |
700 | 1 | |a Ng, Amanda Hui Qi |e verfasserin |4 aut | |
700 | 1 | |a Tan, Linda Wei Lin |e verfasserin |4 aut | |
700 | 1 | |a Teo, Yik-Ying |e verfasserin |4 aut | |
700 | 1 | |a Seedorf, Henning |e verfasserin |4 aut | |
700 | 1 | |a Nagarajan, Niranjan |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nature communications |d 2010 |g 13(2022), 1 vom: 13. Okt., Seite 6044 |w (DE-627)NLM199274525 |x 2041-1723 |7 nnns |
773 | 1 | 8 | |g volume:13 |g year:2022 |g number:1 |g day:13 |g month:10 |g pages:6044 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41467-022-33782-z |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 13 |j 2022 |e 1 |b 13 |c 10 |h 6044 |