Pharmacovariome scanning using whole pharmacogene resequencing coupled with deep computational analysis and machine learning for clinical pharmacogenomics

© 2023. The Author(s)..

BACKGROUND: This pilot study aims to identify and functionally assess pharmacovariants in whole exome sequencing data. While detection of known variants has benefited from pharmacogenomic-dedicated bioinformatics tools before, in this paper we have tested novel deep computational analysis in addition to artificial intelligence as possible approaches for functional analysis of unknown markers within less studied drug-related genes.

METHODS: Pharmacovariants from 1800 drug-related genes from 100 WES data files underwent (a) deep computational analysis by eight bioinformatic algorithms (overall containing 23 tools) and (b) random forest (RF) classifier as the machine learning (ML) approach separately. ML model efficiency was calculated by internal and external cross-validation during recursive feature elimination. Protein modelling was also performed for predicted highly damaging variants with lower frequencies. Genotype-phenotype correlations were implemented for top selected variants in terms of highest possibility of being damaging.

RESULTS: Five deleterious pharmacovariants in the RYR1, POLG, ANXA11, CCNH, and CDH23 genes identified in step (a) and subsequent analysis displayed high impact on drug-related phenotypes. Also, the utilization of recursive feature elimination achieved a subset of 175 malfunction pharmacovariants in 135 drug-related genes that were used by the RF model with fivefold internal cross-validation, resulting in an area under the curve of 0.9736842 with an average accuracy of 0.9818 (95% CI: 0.89, 0.99) on predicting whether a carrying individuals will develop adverse drug reactions or not. However, the external cross-validation of the same model indicated a possible false positive result when dealing with a low number of observations, as only 60 important variants in 49 genes were displayed, giving an AUC of 0.5384848 with an average accuracy of 0.9512 (95% CI: 0.83, 0.99).

CONCLUSION: While there are some technologies for functionally assess not-interpreted pharmacovariants, there is still an essential need for the development of tools, methods, and algorithms which are able to provide a functional prediction for every single pharmacovariant in both large-scale datasets and small cohorts. Our approaches may bring new insights for choosing the right computational assessment algorithms out of high throughput DNA sequencing data from small cohorts to be used for personalized drug therapy implementation.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:17

Enthalten in:

Human genomics - 17(2023), 1 vom: 14. Juli, Seite 62

Sprache:

Englisch

Beteiligte Personen:

Tafazoli, Alireza [VerfasserIn]
Mikros, John [VerfasserIn]
Khaghani, Faeze [VerfasserIn]
Alimardani, Maliheh [VerfasserIn]
Rafigh, Mahboobeh [VerfasserIn]
Hemmati, Mahboobeh [VerfasserIn]
Siamoglou, Stavroula [VerfasserIn]
Golińska, Agnieszka Kitlas [VerfasserIn]
Kamiński, Karol A [VerfasserIn]
Niemira, Magdalena [VerfasserIn]
Miltyk, Wojciech [VerfasserIn]
Patrinos, George P [VerfasserIn]

Links:

Volltext

Themen:

Artificial intelligence
Deep computational analysis
Functional assessment
High throughput DNA sequencing
Journal Article
Machine learning
Pharmacogenomics
Pharmacovariants

Anmerkungen:

Date Completed 17.07.2023

Date Revised 16.11.2023

published: Electronic

Citation Status MEDLINE

doi:

10.1186/s40246-023-00508-1

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM359530729