Details der Publikation - On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme

On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme

Copyright © 2023 Wang, Zhu, Izu, Chen-Izu, Ono, Altaf-Ul-Amin, Kanaya and Huang..

Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure-activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure-activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: <1μM; non-active: >30μM). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability.

Medienart:	E-Artikel

Erscheinungsjahr:	2023
Erschienen:	2023

Enthalten in:	Zur Gesamtaufnahme - volume:14
Enthalten in:	Frontiers in physiology - 14(2023) vom: 19., Seite 1156286

Sprache:	Englisch

Beteiligte Personen:	Wang, Huijia [VerfasserIn] Zhu, Guangxian [VerfasserIn] Izu, Leighton T [VerfasserIn] Chen-Izu, Ye [VerfasserIn] Ono, Naoaki [VerfasserIn] Altaf-Ul-Amin, M D [VerfasserIn] Kanaya, Shigehiko [VerfasserIn] Huang, Ming [VerfasserIn]

Links:	Volltext

Themen:	Cardiotoxicity Dual-threshold Graph transformer neural network HERG Journal Article Meta-path

Anmerkungen:	Date Revised 06.12.2023 published: Electronic-eCollection Citation Status PubMed-not-MEDLINE

doi:	10.3389/fphys.2023.1156286

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM357311809

Internformat


LEADER	01000naa a22002652 4500
001	NLM357311809
003	DE-627
005	20231226072317.0
007	cr uuu---uuuuu
008	231226s2023 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.3389/fphys.2023.1156286 \|2 doi
028	5	2	\|a pubmed24n1190.xml
035			\|a (DE-627)NLM357311809
035			\|a (NLM)37228825
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Wang, Huijia \|e verfasserin \|4 aut
245	1	0	\|a On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme
264		1	\|c 2023
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Revised 06.12.2023
500			\|a published: Electronic-eCollection
500			\|a Citation Status PubMed-not-MEDLINE
520			\|a Copyright © 2023 Wang, Zhu, Izu, Chen-Izu, Ono, Altaf-Ul-Amin, Kanaya and Huang.
520			\|a Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure-activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure-activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: <1μM; non-active: >30μM). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability
650		4	\|a Journal Article
650		4	\|a cardiotoxicity
650		4	\|a dual-threshold
650		4	\|a graph transformer neural network
650		4	\|a hERG
650		4	\|a meta-path
700	1		\|a Zhu, Guangxian \|e verfasserin \|4 aut
700	1		\|a Izu, Leighton T \|e verfasserin \|4 aut
700	1		\|a Chen-Izu, Ye \|e verfasserin \|4 aut
700	1		\|a Ono, Naoaki \|e verfasserin \|4 aut
700	1		\|a Altaf-Ul-Amin, M D \|e verfasserin \|4 aut
700	1		\|a Kanaya, Shigehiko \|e verfasserin \|4 aut
700	1		\|a Huang, Ming \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Frontiers in physiology \|d 2010 \|g 14(2023) vom: 19., Seite 1156286 \|w (DE-627)NLM205532799 \|x 1664-042X \|7 nnns
773	1	8	\|g volume:14 \|g year:2023 \|g day:19 \|g pages:1156286
856	4	0	\|u http://dx.doi.org/10.3389/fphys.2023.1156286 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 14 \|j 2023 \|b 19 \|h 1156286

On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände