MetaPhage: an automated pipeline for analyzing, annotating, and classifying bacteriophages in metagenomics sequencing data

Abstract In the last decades, a great interest has emerged in the study and characterisation of the microbiota, especially the human gut microbiota, demonstrating that commensal microorganisms play a pivotal role in normal anatomical development and physiological function of the human body. To better understand the complex bacterial dynamics that characterize different environments, bacteriophage predation and gene transfer need to be considered as well, as they are important factors that may contribute to controlling the density, diversity, and network interactions among bacterial communities. To date, a variety of bacteriophage identification tools have been developed, differing on phage mining strategies, input files requested and results produced; however, new users approaching the bacteriophage analysis might struggle in untangling the variety of methods and comparing the different results produced. Here we present MetaPhage, a comprehensive reads-to-report pipeline that streamlines the use of multiple miners and generates an exhaustive report to both summarize and visualize the key findings and to enable further exploration of specific results with interactive filterable tables. The pipeline is implemented in Nextflow, a widely adopted workflow manager, that enables an optimized parallelization of the tasks on different premises, from local server to the cloud, and ensures reproducible results using containerized packages. MetaPhage is designed to allow scalability, reproducibility and to be easily expanded with new miners and methods, in a field that is constantly expanding. MetaPhage is freely available under a GPL-3.0 license at <jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://github.com/MattiaPandolfoVR/MetaPhage">https://github.com/MattiaPandolfoVR/MetaPhage</jats:ext-link>..

Medienart:

Preprint

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

bioRxiv.org - (2022) vom: 28. Okt. Zur Gesamtaufnahme - year:2022

Sprache:

Englisch

Beteiligte Personen:

Pandolfo, Mattia [VerfasserIn]
Telatin, Andrea [VerfasserIn]
Lazzari, Gioele [VerfasserIn]
Adriaenssens, Evelien M. [VerfasserIn]
Vitulo, Nicola [VerfasserIn]

Links:

Volltext [lizenzpflichtig]
Volltext [kostenfrei]

Themen:

570
Biology

doi:

10.1101/2022.04.17.488583

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

XBI035780525