Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment
Glycosylation of proteins in eukaryote cells is an important and complicated post-translation modification due to its pivotal role and association with crucial physiological functions within most of the proteins. Identification of glycosylation sites in a polypeptide chain is not an easy task due to multiple impediments. Analytical identification of these sites is expensive and laborious. There is a dire need to develop a reliable computational method for precise determination of such sites which can help researchers to save time and effort. Herein, we propose a novel predictor namely iGlycoS-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. The self-consistency results show that the accuracy revealed by the model using the benchmark dataset for prediction of O-linked glycosylation having serine sites is 98.8 percent. The overall accuracy of predictor achieved through 10-fold cross validation by combining the positive and negative results is 97.2 percent. The overall accuracy achieved through Jackknife test is 96.195 percent by aggregating of all the prediction results. Thus the proposed predictor can help in predicting the O-linked glycosylated serine sites in an efficient and accurate way. The overall results show that the accuracy of the iGlycoS-PseAAC is higher than the existing tools.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2021 |
---|---|
Erschienen: |
2021 |
Enthalten in: |
Zur Gesamtaufnahme - volume:18 |
---|---|
Enthalten in: |
IEEE/ACM transactions on computational biology and bioinformatics - 18(2021), 5 vom: 22. Sept., Seite 2045-2056 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Akmal, Muhammad Aizaz [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 21.01.2022 Date Revised 21.01.2022 published: Print-Electronic Citation Status MEDLINE |
---|
doi: |
10.1109/TCBB.2020.2968441 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM305819615 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM305819615 | ||
003 | DE-627 | ||
005 | 20231225122014.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231225s2021 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1109/TCBB.2020.2968441 |2 doi | |
028 | 5 | 2 | |a pubmed24n1019.xml |
035 | |a (DE-627)NLM305819615 | ||
035 | |a (NLM)31985438 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Akmal, Muhammad Aizaz |e verfasserin |4 aut | |
245 | 1 | 0 | |a Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment |
264 | 1 | |c 2021 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 21.01.2022 | ||
500 | |a Date Revised 21.01.2022 | ||
500 | |a published: Print-Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a Glycosylation of proteins in eukaryote cells is an important and complicated post-translation modification due to its pivotal role and association with crucial physiological functions within most of the proteins. Identification of glycosylation sites in a polypeptide chain is not an easy task due to multiple impediments. Analytical identification of these sites is expensive and laborious. There is a dire need to develop a reliable computational method for precise determination of such sites which can help researchers to save time and effort. Herein, we propose a novel predictor namely iGlycoS-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. The self-consistency results show that the accuracy revealed by the model using the benchmark dataset for prediction of O-linked glycosylation having serine sites is 98.8 percent. The overall accuracy of predictor achieved through 10-fold cross validation by combining the positive and negative results is 97.2 percent. The overall accuracy achieved through Jackknife test is 96.195 percent by aggregating of all the prediction results. Thus the proposed predictor can help in predicting the O-linked glycosylated serine sites in an efficient and accurate way. The overall results show that the accuracy of the iGlycoS-PseAAC is higher than the existing tools | ||
650 | 4 | |a Journal Article | |
650 | 7 | |a Glycoproteins |2 NLM | |
650 | 7 | |a Serine |2 NLM | |
650 | 7 | |a 452VLY9402 |2 NLM | |
700 | 1 | |a Hussain, Waqar |e verfasserin |4 aut | |
700 | 1 | |a Rasool, Nouman |e verfasserin |4 aut | |
700 | 1 | |a Khan, Yaser Daanial |e verfasserin |4 aut | |
700 | 1 | |a Khan, Sher Afzal |e verfasserin |4 aut | |
700 | 1 | |a Chou, Kuo-Chen |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t IEEE/ACM transactions on computational biology and bioinformatics |d 2004 |g 18(2021), 5 vom: 22. Sept., Seite 2045-2056 |w (DE-627)NLM16601530X |x 1557-9964 |7 nnns |
773 | 1 | 8 | |g volume:18 |g year:2021 |g number:5 |g day:22 |g month:09 |g pages:2045-2056 |
856 | 4 | 0 | |u http://dx.doi.org/10.1109/TCBB.2020.2968441 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 18 |j 2021 |e 5 |b 22 |c 09 |h 2045-2056 |