BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis
© 2024. The Author(s)..
Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:15 |
---|---|
Enthalten in: |
Nature communications - 15(2024), 1 vom: 11. März, Seite 2179 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Qiu, Zhiguang [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 13.03.2024 Date Revised 15.03.2024 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41467-024-46539-7 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM36957415X |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM36957415X | ||
003 | DE-627 | ||
005 | 20240315233556.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240312s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41467-024-46539-7 |2 doi | |
028 | 5 | 2 | |a pubmed24n1330.xml |
035 | |a (DE-627)NLM36957415X | ||
035 | |a (NLM)38467684 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Qiu, Zhiguang |e verfasserin |4 aut | |
245 | 1 | 0 | |a BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 13.03.2024 | ||
500 | |a Date Revised 15.03.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2024. The Author(s). | ||
520 | |a Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a basalt |2 NLM | |
650 | 7 | |a Silicates |2 NLM | |
700 | 1 | |a Yuan, Li |e verfasserin |4 aut | |
700 | 1 | |a Lian, Chun-Ang |e verfasserin |4 aut | |
700 | 1 | |a Lin, Bin |e verfasserin |4 aut | |
700 | 1 | |a Chen, Jie |e verfasserin |4 aut | |
700 | 1 | |a Mu, Rong |e verfasserin |4 aut | |
700 | 1 | |a Qiao, Xuejiao |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Liyu |e verfasserin |4 aut | |
700 | 1 | |a Xu, Zheng |e verfasserin |4 aut | |
700 | 1 | |a Fan, Lu |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Yunzeng |e verfasserin |4 aut | |
700 | 1 | |a Wang, Shanquan |e verfasserin |4 aut | |
700 | 1 | |a Li, Junyi |e verfasserin |4 aut | |
700 | 1 | |a Cao, Huiluo |e verfasserin |4 aut | |
700 | 1 | |a Li, Bing |e verfasserin |4 aut | |
700 | 1 | |a Chen, Baowei |e verfasserin |4 aut | |
700 | 1 | |a Song, Chi |e verfasserin |4 aut | |
700 | 1 | |a Liu, Yongxin |e verfasserin |4 aut | |
700 | 1 | |a Shi, Lili |e verfasserin |4 aut | |
700 | 1 | |a Tian, Yonghong |e verfasserin |4 aut | |
700 | 1 | |a Ni, Jinren |e verfasserin |4 aut | |
700 | 1 | |a Zhang, Tong |e verfasserin |4 aut | |
700 | 1 | |a Zhou, Jizhong |e verfasserin |4 aut | |
700 | 1 | |a Zhuang, Wei-Qin |e verfasserin |4 aut | |
700 | 1 | |a Yu, Ke |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Nature communications |d 2010 |g 15(2024), 1 vom: 11. März, Seite 2179 |w (DE-627)NLM199274525 |x 2041-1723 |7 nnns |
773 | 1 | 8 | |g volume:15 |g year:2024 |g number:1 |g day:11 |g month:03 |g pages:2179 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41467-024-46539-7 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 15 |j 2024 |e 1 |b 11 |c 03 |h 2179 |