Identification of potential driver mutations in glioblastoma using machine learning
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissionsoup.com..
Glioblastoma is a fast and aggressively growing tumor in the brain and spinal cord. Mutation of amino acid residues in targets proteins, which are involved in glioblastoma, alters the structure and function and may lead to disease. In this study, we collected a set of 9386 disease-causing (drivers) mutations based on the recurrence in patient samples and experimentally annotated as pathogenic and 8728 as neutral (passenger) mutations. We observed that Arg is highly preferred at the mutant sites of drivers, whereas Met and Ile showed preferences in passengers. Inspecting neighboring residues at the mutant sites revealed that the motifs YP, CP and GRH, are preferred in drivers, whereas SI, IQ and TVI are dominant in neutral. In addition, we have computed other sequence-based features such as conservation scores, Position Specific Scoring Matrices (PSSM) and physicochemical properties, and developed a machine learning-based method, GBMDriver (GlioBlastoma Multiforme Drivers), for distinguishing between driver and passenger mutations. Our method showed an accuracy and AUC of 73.59% and 0.82, respectively, on 10-fold cross-validation and 81.99% and 0.87 in a blind set of 1809 mutants. The tool is available at https://web.iitm.ac.in/bioinfo2/GBMDriver/index.html. We envisage that the present method is helpful to prioritize driver mutations in glioblastoma and assist in identifying therapeutic targets.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2022 |
---|---|
Erschienen: |
2022 |
Enthalten in: |
Zur Gesamtaufnahme - volume:23 |
---|---|
Enthalten in: |
Briefings in bioinformatics - 23(2022), 6 vom: 19. Nov. |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Pandey, Medha [VerfasserIn] |
---|
Links: |
---|
Themen: |
Amino Acids |
---|
Anmerkungen: |
Date Completed 23.11.2022 Date Revised 12.12.2022 published: Print Citation Status MEDLINE |
---|
doi: |
10.1093/bib/bbac451 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM347808913 |
---|
LEADER | 01000naa a22002652 4500 | ||
---|---|---|---|
001 | NLM347808913 | ||
003 | DE-627 | ||
005 | 20231226034721.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2022 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1093/bib/bbac451 |2 doi | |
028 | 5 | 2 | |a pubmed24n1159.xml |
035 | |a (DE-627)NLM347808913 | ||
035 | |a (NLM)36266243 | ||
035 | |a (PII)bbac451 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Pandey, Medha |e verfasserin |4 aut | |
245 | 1 | 0 | |a Identification of potential driver mutations in glioblastoma using machine learning |
264 | 1 | |c 2022 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 23.11.2022 | ||
500 | |a Date Revised 12.12.2022 | ||
500 | |a published: Print | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissionsoup.com. | ||
520 | |a Glioblastoma is a fast and aggressively growing tumor in the brain and spinal cord. Mutation of amino acid residues in targets proteins, which are involved in glioblastoma, alters the structure and function and may lead to disease. In this study, we collected a set of 9386 disease-causing (drivers) mutations based on the recurrence in patient samples and experimentally annotated as pathogenic and 8728 as neutral (passenger) mutations. We observed that Arg is highly preferred at the mutant sites of drivers, whereas Met and Ile showed preferences in passengers. Inspecting neighboring residues at the mutant sites revealed that the motifs YP, CP and GRH, are preferred in drivers, whereas SI, IQ and TVI are dominant in neutral. In addition, we have computed other sequence-based features such as conservation scores, Position Specific Scoring Matrices (PSSM) and physicochemical properties, and developed a machine learning-based method, GBMDriver (GlioBlastoma Multiforme Drivers), for distinguishing between driver and passenger mutations. Our method showed an accuracy and AUC of 73.59% and 0.82, respectively, on 10-fold cross-validation and 81.99% and 0.87 in a blind set of 1809 mutants. The tool is available at https://web.iitm.ac.in/bioinfo2/GBMDriver/index.html. We envisage that the present method is helpful to prioritize driver mutations in glioblastoma and assist in identifying therapeutic targets | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a cancer | |
650 | 4 | |a driver mutation | |
650 | 4 | |a glioblastoma | |
650 | 4 | |a machine learning | |
650 | 4 | |a motifs | |
650 | 4 | |a variants | |
650 | 7 | |a Proteins |2 NLM | |
650 | 7 | |a Amino Acids |2 NLM | |
700 | 1 | |a Anoosha, P |e verfasserin |4 aut | |
700 | 1 | |a Yesudhas, Dhanusha |e verfasserin |4 aut | |
700 | 1 | |a Gromiha, M Michael |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Briefings in bioinformatics |d 2000 |g 23(2022), 6 vom: 19. Nov. |w (DE-627)NLM11366883X |x 1477-4054 |7 nnns |
773 | 1 | 8 | |g volume:23 |g year:2022 |g number:6 |g day:19 |g month:11 |
856 | 4 | 0 | |u http://dx.doi.org/10.1093/bib/bbac451 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 23 |j 2022 |e 6 |b 19 |c 11 |