Assigning secondary structure in proteins using AI
© 2021. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature..
Knowledge about protein structure assignment enriches the structural and functional understanding of proteins. Accurate and reliable structure assignment data is crucial for secondary structure prediction systems. Since the 1980s, various methods based on hydrogen bond analysis and atomic coordinate geometry, followed by machine learning, have been employed in protein structure assignment. However, the assignment process becomes challenging when missing atoms are present in the protein files. Our method proposed a multi-class classifier program named DLFSA for assigning protein secondary structure elements (SSE) using convolutional neural networks (CNNs). A fast and efficient GPU-based parallel procedure extracts fragments from protein files. The model implemented in this work is trained with a subset of the protein fragments and achieves 88.1% and 82.5% train and test accuracy, respectively. The model uses only Cα coordinates for secondary structure assignments. The model has been successfully tested on a few full-length proteins also. Results from the fragment-based studies demonstrate the feasibility of applying deep learning solutions for structure assignment problems.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2021 |
---|---|
Erschienen: |
2021 |
Enthalten in: |
Zur Gesamtaufnahme - volume:27 |
---|---|
Enthalten in: |
Journal of molecular modeling - 27(2021), 9 vom: 17. Aug., Seite 252 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Antony, Jisna Vellara [VerfasserIn] |
---|
Links: |
---|
Themen: |
Convolutional neural networks |
---|
Anmerkungen: |
Date Completed 24.01.2022 Date Revised 24.01.2022 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1007/s00894-021-04825-x |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM329465740 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM329465740 | ||
003 | DE-627 | ||
005 | 20231225205159.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2021 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1007/s00894-021-04825-x |2 doi | |
028 | 5 | 2 | |a pubmed24n1098.xml |
035 | |a (DE-627)NLM329465740 | ||
035 | |a (NLM)34402969 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Antony, Jisna Vellara |e verfasserin |4 aut | |
245 | 1 | 0 | |a Assigning secondary structure in proteins using AI |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 24.01.2022 | ||
500 | |a Date Revised 24.01.2022 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2021. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature. | ||
520 | |a Knowledge about protein structure assignment enriches the structural and functional understanding of proteins. Accurate and reliable structure assignment data is crucial for secondary structure prediction systems. Since the 1980s, various methods based on hydrogen bond analysis and atomic coordinate geometry, followed by machine learning, have been employed in protein structure assignment. However, the assignment process becomes challenging when missing atoms are present in the protein files. Our method proposed a multi-class classifier program named DLFSA for assigning protein secondary structure elements (SSE) using convolutional neural networks (CNNs). A fast and efficient GPU-based parallel procedure extracts fragments from protein files. The model implemented in this work is trained with a subset of the protein fragments and achieves 88.1% and 82.5% train and test accuracy, respectively. The model uses only Cα coordinates for secondary structure assignments. The model has been successfully tested on a few full-length proteins also. Results from the fragment-based studies demonstrate the feasibility of applying deep learning solutions for structure assignment problems | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Convolutional neural networks | |
650 | 4 | |a Deep learning | |
650 | 4 | |a Fragment library creation | |
650 | 4 | |a Multi-class classifier | |
650 | 4 | |a Protein fragments | |
650 | 4 | |a Protein secondary structures | |
650 | 4 | |a Protein structure assignment | |
650 | 7 | |a Proteins |2 NLM | |
700 | 1 | |a Madhu, Prayagh |e verfasserin |4 aut | |
700 | 1 | |a Balakrishnan, Jayaraj Pottekkattuvalappil |e verfasserin |4 aut | |
700 | 1 | |a Yadav, Hemant |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Journal of molecular modeling |d 1998 |g 27(2021), 9 vom: 17. Aug., Seite 252 |w (DE-627)NLM09744605X |x 0948-5023 |7 nnns |
773 | 1 | 8 | |g volume:27 |g year:2021 |g number:9 |g day:17 |g month:08 |g pages:252 |
856 | 4 | 0 | |u http://dx.doi.org/10.1007/s00894-021-04825-x |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 27 |j 2021 |e 9 |b 17 |c 08 |h 252 |