FRAGSION : ultra-fast protein fragment library generation by IOHMM sampling
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissionsoup.com..
MOTIVATION: Speed, accuracy and robustness of building protein fragment library have important implications in de novo protein structure prediction since fragment-based methods are one of the most successful approaches in template-free modeling (FM). Majority of the existing fragment detection methods rely on database-driven search strategies to identify candidate fragments, which are inherently time-consuming and often hinder the possibility to locate longer fragments due to the limited sizes of databases. Also, it is difficult to alleviate the effect of noisy sequence-based predicted features such as secondary structures on the quality of fragment.
RESULTS: Here, we present FRAGSION, a database-free method to efficiently generate protein fragment library by sampling from an Input-Output Hidden Markov Model. FRAGSION offers some unique features compared to existing approaches in that it (i) is lightning-fast, consuming only few seconds of CPU time to generate fragment library for a protein of typical length (300 residues); (ii) can generate dynamic-size fragments of any length (even for the whole protein sequence) and (iii) offers ways to handle noise in predicted secondary structure during fragment sampling. On a FM dataset from the most recent Critical Assessment of Structure Prediction, we demonstrate that FGRAGSION provides advantages over the state-of-the-art fragment picking protocol of ROSETTA suite by speeding up computation by several orders of magnitude while achieving comparable performance in fragment quality.
AVAILABILITY AND IMPLEMENTATION: Source code and executable versions of FRAGSION for Linux and MacOS is freely available to non-commercial users at http://sysbio.rnet.missouri.edu/FRAGSION/ It is bundled with a manual and example data.
CONTACT: chengjimissouri.edu.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2016 |
---|---|
Erschienen: |
2016 |
Enthalten in: |
Zur Gesamtaufnahme - volume:32 |
---|---|
Enthalten in: |
Bioinformatics (Oxford, England) - 32(2016), 13 vom: 01. Juli, Seite 2059-61 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Bhattacharya, Debswapna [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 21.08.2017 Date Revised 02.12.2018 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1093/bioinformatics/btw067 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM260106852 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM260106852 | ||
003 | DE-627 | ||
005 | 20231224192657.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231224s2016 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1093/bioinformatics/btw067 |2 doi | |
028 | 5 | 2 | |a pubmed24n0867.xml |
035 | |a (DE-627)NLM260106852 | ||
035 | |a (NLM)27153697 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Bhattacharya, Debswapna |e verfasserin |4 aut | |
245 | 1 | 0 | |a FRAGSION |b ultra-fast protein fragment library generation by IOHMM sampling |
264 | 1 | |c 2016 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 21.08.2017 | ||
500 | |a Date Revised 02.12.2018 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissionsoup.com. | ||
520 | |a MOTIVATION: Speed, accuracy and robustness of building protein fragment library have important implications in de novo protein structure prediction since fragment-based methods are one of the most successful approaches in template-free modeling (FM). Majority of the existing fragment detection methods rely on database-driven search strategies to identify candidate fragments, which are inherently time-consuming and often hinder the possibility to locate longer fragments due to the limited sizes of databases. Also, it is difficult to alleviate the effect of noisy sequence-based predicted features such as secondary structures on the quality of fragment | ||
520 | |a RESULTS: Here, we present FRAGSION, a database-free method to efficiently generate protein fragment library by sampling from an Input-Output Hidden Markov Model. FRAGSION offers some unique features compared to existing approaches in that it (i) is lightning-fast, consuming only few seconds of CPU time to generate fragment library for a protein of typical length (300 residues); (ii) can generate dynamic-size fragments of any length (even for the whole protein sequence) and (iii) offers ways to handle noise in predicted secondary structure during fragment sampling. On a FM dataset from the most recent Critical Assessment of Structure Prediction, we demonstrate that FGRAGSION provides advantages over the state-of-the-art fragment picking protocol of ROSETTA suite by speeding up computation by several orders of magnitude while achieving comparable performance in fragment quality | ||
520 | |a AVAILABILITY AND IMPLEMENTATION: Source code and executable versions of FRAGSION for Linux and MacOS is freely available to non-commercial users at http://sysbio.rnet.missouri.edu/FRAGSION/ It is bundled with a manual and example data | ||
520 | |a CONTACT: chengjimissouri.edu | ||
520 | |a SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a Peptide Library |2 NLM | |
650 | 7 | |a Proteins |2 NLM | |
700 | 1 | |a Adhikari, Badri |e verfasserin |4 aut | |
700 | 1 | |a Li, Jilong |e verfasserin |4 aut | |
700 | 1 | |a Cheng, Jianlin |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Bioinformatics (Oxford, England) |d 1998 |g 32(2016), 13 vom: 01. Juli, Seite 2059-61 |w (DE-627)NLM094620342 |x 1367-4811 |7 nnns |
773 | 1 | 8 | |g volume:32 |g year:2016 |g number:13 |g day:01 |g month:07 |g pages:2059-61 |
856 | 4 | 0 | |u http://dx.doi.org/10.1093/bioinformatics/btw067 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 32 |j 2016 |e 13 |b 01 |c 07 |h 2059-61 |