Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment

Glycosylation of proteins in eukaryote cells is an important and complicated post-translation modification due to its pivotal role and association with crucial physiological functions within most of the proteins. Identification of glycosylation sites in a polypeptide chain is not an easy task due to multiple impediments. Analytical identification of these sites is expensive and laborious. There is a dire need to develop a reliable computational method for precise determination of such sites which can help researchers to save time and effort. Herein, we propose a novel predictor namely iGlycoS-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. The self-consistency results show that the accuracy revealed by the model using the benchmark dataset for prediction of O-linked glycosylation having serine sites is 98.8 percent. The overall accuracy of predictor achieved through 10-fold cross validation by combining the positive and negative results is 97.2 percent. The overall accuracy achieved through Jackknife test is 96.195 percent by aggregating of all the prediction results. Thus the proposed predictor can help in predicting the O-linked glycosylated serine sites in an efficient and accurate way. The overall results show that the accuracy of the iGlycoS-PseAAC is higher than the existing tools.

Medienart:

E-Artikel

Erscheinungsjahr:

2021

Erschienen:

2021

Enthalten in:

Zur Gesamtaufnahme - volume:18

Enthalten in:

IEEE/ACM transactions on computational biology and bioinformatics - 18(2021), 5 vom: 22. Sept., Seite 2045-2056

Sprache:

Englisch

Beteiligte Personen:

Akmal, Muhammad Aizaz [VerfasserIn]
Hussain, Waqar [VerfasserIn]
Rasool, Nouman [VerfasserIn]
Khan, Yaser Daanial [VerfasserIn]
Khan, Sher Afzal [VerfasserIn]
Chou, Kuo-Chen [VerfasserIn]

Links:

Volltext

Themen:

452VLY9402
Glycoproteins
Journal Article
Serine

Anmerkungen:

Date Completed 21.01.2022

Date Revised 21.01.2022

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1109/TCBB.2020.2968441

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM305819615