FRAGSION : ultra-fast protein fragment library generation by IOHMM sampling

© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissionsoup.com..

MOTIVATION: Speed, accuracy and robustness of building protein fragment library have important implications in de novo protein structure prediction since fragment-based methods are one of the most successful approaches in template-free modeling (FM). Majority of the existing fragment detection methods rely on database-driven search strategies to identify candidate fragments, which are inherently time-consuming and often hinder the possibility to locate longer fragments due to the limited sizes of databases. Also, it is difficult to alleviate the effect of noisy sequence-based predicted features such as secondary structures on the quality of fragment.

RESULTS: Here, we present FRAGSION, a database-free method to efficiently generate protein fragment library by sampling from an Input-Output Hidden Markov Model. FRAGSION offers some unique features compared to existing approaches in that it (i) is lightning-fast, consuming only few seconds of CPU time to generate fragment library for a protein of typical length (300 residues); (ii) can generate dynamic-size fragments of any length (even for the whole protein sequence) and (iii) offers ways to handle noise in predicted secondary structure during fragment sampling. On a FM dataset from the most recent Critical Assessment of Structure Prediction, we demonstrate that FGRAGSION provides advantages over the state-of-the-art fragment picking protocol of ROSETTA suite by speeding up computation by several orders of magnitude while achieving comparable performance in fragment quality.

AVAILABILITY AND IMPLEMENTATION: Source code and executable versions of FRAGSION for Linux and MacOS is freely available to non-commercial users at http://sysbio.rnet.missouri.edu/FRAGSION/ It is bundled with a manual and example data.

CONTACT: chengjimissouri.edu.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Medienart:

E-Artikel

Erscheinungsjahr:

2016

Erschienen:

2016

Enthalten in:

Zur Gesamtaufnahme - volume:32

Enthalten in:

Bioinformatics (Oxford, England) - 32(2016), 13 vom: 01. Juli, Seite 2059-61

Sprache:

Englisch

Beteiligte Personen:

Bhattacharya, Debswapna [VerfasserIn]
Adhikari, Badri [VerfasserIn]
Li, Jilong [VerfasserIn]
Cheng, Jianlin [VerfasserIn]

Links:

Volltext

Themen:

Journal Article
Peptide Library
Proteins

Anmerkungen:

Date Completed 21.08.2017

Date Revised 02.12.2018

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1093/bioinformatics/btw067

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM260106852