Identification of potential driver mutations in glioblastoma using machine learning

© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissionsoup.com..

Glioblastoma is a fast and aggressively growing tumor in the brain and spinal cord. Mutation of amino acid residues in targets proteins, which are involved in glioblastoma, alters the structure and function and may lead to disease. In this study, we collected a set of 9386 disease-causing (drivers) mutations based on the recurrence in patient samples and experimentally annotated as pathogenic and 8728 as neutral (passenger) mutations. We observed that Arg is highly preferred at the mutant sites of drivers, whereas Met and Ile showed preferences in passengers. Inspecting neighboring residues at the mutant sites revealed that the motifs YP, CP and GRH, are preferred in drivers, whereas SI, IQ and TVI are dominant in neutral. In addition, we have computed other sequence-based features such as conservation scores, Position Specific Scoring Matrices (PSSM) and physicochemical properties, and developed a machine learning-based method, GBMDriver (GlioBlastoma Multiforme Drivers), for distinguishing between driver and passenger mutations. Our method showed an accuracy and AUC of 73.59% and 0.82, respectively, on 10-fold cross-validation and 81.99% and 0.87 in a blind set of 1809 mutants. The tool is available at https://web.iitm.ac.in/bioinfo2/GBMDriver/index.html. We envisage that the present method is helpful to prioritize driver mutations in glioblastoma and assist in identifying therapeutic targets.

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:23

Enthalten in:

Briefings in bioinformatics - 23(2022), 6 vom: 19. Nov.

Sprache:

Englisch

Beteiligte Personen:

Pandey, Medha [VerfasserIn]
Anoosha, P [VerfasserIn]
Yesudhas, Dhanusha [VerfasserIn]
Gromiha, M Michael [VerfasserIn]

Links:

Volltext

Themen:

Amino Acids
Cancer
Driver mutation
Glioblastoma
Journal Article
Machine learning
Motifs
Proteins
Research Support, Non-U.S. Gov't
Variants

Anmerkungen:

Date Completed 23.11.2022

Date Revised 12.12.2022

published: Print

Citation Status MEDLINE

doi:

10.1093/bib/bbac451

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM347808913