NanoCLUST: a species-level analysis of 16S rRNA nanopore sequencing data
Abstract Summary NanoCLUST is an analysis pipeline for classification of amplicon-based full-length 16S rRNA nanopore reads. It is characterized by an unsupervised read clustering step, based on Uniform Manifold Approximation and Projection (UMAP), followed by the construction of a polished read and subsequent Blast classification. Here we demonstrate that NanoCLUST performs better than other state-of-the-art software in the characterization of two commercial mock communities, enabling accurate bacterial identification and abundance profile estimation at species level resolution.Availability and implementation Source code, test data and documentation of NanoCLUST is freely available at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/genomicsITER/NanoCLUST">https://github.com/genomicsITER/NanoCLUST</jats:ext-link> under MIT License.Contact <jats:email>cfloresull.edu.es</jats:email>.
Medienart: |
Preprint |
---|
Erscheinungsjahr: |
2020 |
---|---|
Erschienen: |
2020 |
Enthalten in: |
bioRxiv.org - (2020) vom: 08. Dez. Zur Gesamtaufnahme - year:2020 |
---|
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Rodríguez-Pérez, Héctor [VerfasserIn] |
---|
Links: |
---|
doi: |
10.1101/2020.05.14.087353 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
XBI017915961 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | XBI017915961 | ||
003 | DE-627 | ||
005 | 20230429100347.0 | ||
007 | cr uuu---uuuuu | ||
008 | 200518s2020 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1101/2020.05.14.087353 |2 doi | |
035 | |a (DE-627)XBI017915961 | ||
035 | |a (DE-599)biorXiv10.1101/2020.05.14.087353 | ||
035 | |a (biorXiv)10.1101/2020.05.14.087353 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
082 | 0 | |a 570 |q DE-84 | |
100 | 1 | |a Rodríguez-Pérez, Héctor |e verfasserin |4 aut | |
245 | 1 | 0 | |a NanoCLUST: a species-level analysis of 16S rRNA nanopore sequencing data |
264 | 1 | |c 2020 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a Abstract Summary NanoCLUST is an analysis pipeline for classification of amplicon-based full-length 16S rRNA nanopore reads. It is characterized by an unsupervised read clustering step, based on Uniform Manifold Approximation and Projection (UMAP), followed by the construction of a polished read and subsequent Blast classification. Here we demonstrate that NanoCLUST performs better than other state-of-the-art software in the characterization of two commercial mock communities, enabling accurate bacterial identification and abundance profile estimation at species level resolution.Availability and implementation Source code, test data and documentation of NanoCLUST is freely available at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/genomicsITER/NanoCLUST">https://github.com/genomicsITER/NanoCLUST</jats:ext-link> under MIT License.Contact <jats:email>cfloresull.edu.es</jats:email> | ||
700 | 1 | |a Ciuffreda, Laura |e verfasserin |4 aut | |
700 | 1 | |a Flores, Carlos |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t bioRxiv.org |g (2020) vom: 08. Dez. |
773 | 1 | 8 | |g year:2020 |g day:08 |g month:12 |
856 | 4 | 0 | |u https://doi.org/10.1093/bioinformatics/btaa900 |z lizenzpflichtig |3 Volltext |
856 | 4 | 0 | |u http://dx.doi.org/10.1101/2020.05.14.087353 |z kostenfrei |3 Volltext |
912 | |a GBV_XBI | ||
912 | |a SSG-OLC-PHA | ||
951 | |a AR | ||
952 | |j 2020 |b 08 |c 12 | ||
953 | |2 045F |a 570 |